Math 365, Elementary Statistics |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Lesson 7: EstimationIntroduction
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Use of Calculators (if you have a TI-83): Z-interval |
|---|
|
Problems on 7.1: Point and Interval Estimation
Exercise 7.1.1. Assume that you have a normal population with mean μ and standard deviation σ = 15. Suppose you have collected a sample of size 25 and the sample mean X was found to be 81.
Exercise 7.1.2. Assume that you have a normal
population with mean μ and standard
deviation σ = 9.8. Suppose you have
collected a sample of size 14 and the sample mean X
was found to be 151.1.
Exercise 7.1.3. The time taken by an athlete to run an event is normally distributed with mean μ and known standard deviation σ = 3.5 seconds. To estimate the mean μ, he ran 16 times and the sample mean was found to be X = 33 seconds.
Exercise 7.1.4. A population has normal distribution
with variance σ2 = 289.
How large a sample do we need to estimate the mean μ
within 3 units from the true value of μ,
with 90 percent confidence?
Solution
Exercise 7.1.5. The tuition X paid by a student
per semester in a university has a distribution with mean μ
and σ = $416. How large a sample should
you draw so that you are 95 percent sure that the true value of μ
will be within $10 of the sample mean x?
Solution
Let X be a normal random variable with mean μ and variance σ2. Unlike in the last section, in this section we assume that σ is not known, and we try to compute a confidence interval of μ. In the last section, the main tool (or fact) that we used was that
Z=(X-μ) √n/σ
has N(0,1) distribution. In this section, we use the distribution of
T=(X-μ) √n/S.
The distribution of T is known as t-distribution with degrees of freedom n-1, which we have not discussed. As we did for the N(0,1) random variable, we will now give the properties of t-distribution.
About t-distribution
Given a positive integer ν, there is a random variable T = tν that is said to have t-distribution with degrees of freedom ν. The useful properties of t-distribution are listed below:
P(T > tν, α) = α
where T has t-distribution with degrees of freedom ν.T=(X-μ) √n/S.
has t-distribution with degrees of freedom n-1.
So,
P(-tn-1,α/2 < (X-μ)√n/S < tn-1,α/2 ) = 1-α.
If we simplify, we get
P(X-E < μ < X+E)=1- α
where E=tn-1,α/2S/√n.
A (1-α)100
percent Confidence Interval for μ
Under the set up of the theorem, a (1-α)100 percent confidence interval for μ is given by
X-E <
μ < X+E
where E=tn-1,α/2s/ √n
E is also called the margin or error.
A Frequently Asked Question:To estimate μ, when do we use the ZInterval and when do we use the TInterval? Answer: We use the TInterval only when σ is not known.
| Use of Calculators (if you have a TI-83): T-interval |
|---|
|
Problems on 7.2: When σ Is Unknown
Exercise 7.2.1. Assume that we have normal
populations with mean μ and standard
deviation σ. We have a sample of size
n = 18 that has sample mean x = 170.5 and
standard deviation s = 13.3. Find the margin of error and compute a
99 percent confidence interval for μ.
Solution
Exercise 7.2.2. Suppose that the time taken
to complete a problem in a Math 365 test is normally distributed with
mean μ and standard deviation σ.
A sample of size 23 was taken, and sample mean and standard deviation
were found to be x = 4.7 and s = .47. Estimate
the mean time μ taken to complete a
problem using a 98 percent confidence interval.
Solution
Exercise 7.2.3. It is assumed that the lifetime
(in hours) of lightbulbs produced in a factory is normally distributed
with mean μ and standard deviation
σ. To estimate μ
the following data was collected on the lifetime of bulbs.
| 5110 | 4671 | 6441 | 3331 | 5055 | 5270 | 5335 | 4973 | 1837 |
| 7783 | 4560 | 6074 | 4777 | 4707 | 5263 | 4978 | 5418 | 5123 |
Compute a 95 percent confidence interval for μ.
Write down the formula for (1-α)100
percent confidence interval that you use here.
Solution
Exercise 7.2.4. To estimate the mean weight
(in pounds) of salmon in a river the following sample was collected:
| 34.7 | 33.8 | 38.2 | 20.3 | 27.8 | 45.3 | 43.1 | 37.3 | 32.5 | 32.3 |
| 31.8 | 41.5 | 44.5 | 29.2 | 25.3 | 29.6 | 39.5 | 29.1 | 37.3 |
Compute a 99 percent confidence interval for the sample mean μ.
Write down the formula for (1-α)100
percent confidence interval that you use here.
Solution
Exercise 7.2.5. Suppose we collect a sample
from a normal population of size n = 40 with sample mean X
= 18.6 and standard deviation s = 9.486. Construct a 95 percent confidence
interval for mean μ.
Solution
Exercise 7.2.6. The time taken by an athlete to run an event is normally distributed with mean μ and unknown standard deviation σ. To estimate the mean μ he ran 16 times and the sample mean was found to be X = 33 seconds and the sample standard deviation s = 3.5 seconds.
Let X be the normal random variable with mean μ and variance σ2. In this section, we will construct a confidence interval for σ2. We will take a sample X1,X2, …, Xn of size n from the X population. Let X be the sample mean and let S2 be the sample variance. To compute a confidence interval for σ2, we will be using the distribution of
U = (n-1)S2/σ2
The distribution of U is known as χ2 distribution with degrees of freedom n-1, which we have not discussed. Next we will give the properties of a χ2 random variable.
About χ2-distribution
Given a positive integer ν, there is a random variable χ2ν that is said to have χ2 distribution with degrees of freedom ν. The useful properties of χ2 distribution are listed below.
| P(U > χ2 | ) = α | |
| v, | α |
Theorem. Let X be a normal random variable with mean μ and variance σ2. Let X1,X2,…,Xn be a sample of size n from the X population. Then
T = (n-1)S2/σ2
has χ2 distribution with degrees of freedom n-1.
So,
P(χ2 n-1,1-α/2 < (n-1)S2/ σ2 < χ2 n-1,α/2 ) = 1-α.
If we simplify, we get
P(L < σ2
< U) = 1 - α
where
L = (n-1)S2/χ2n-1,α/2
U = (n-1)S2/χ2n-1,1-α/2
Theorem. Under the same set-up as in the above theorem, a (1-α)100 percent confidence interval for the variance σ2 is given by
l < σ2
< u
where
l = (n-1)s2/χ2n-1,α/2
u = (n-1)s2/χ2n-1,1-α/2
OR
| (n- 1)s2
χ2n- 1, [(α)/2] |
< σ 2 < | (n- 1)s2
χ2n- 1, 1- [(α)/2] |
. |
Use of Calculators: The TI-83 will not compute the confidence interval for σ2. If data is given, it is important to use the calculator to compute the sample variance s2.
Problems for 7.3: Confidence Interval for σ2
Exercise 7.3.1. Suppose that we have collected
a sample of size n = 26 from a normal population with mean μ
and variance σ2. The sample
variance was found to be s2 = 26.7. Compute a 95 percent
confidence interval for σ2.
Solution
Exercise 7.3.2. The following is sample data on the amount (in 1000 bushels) of wheat harvested by Kansas farmers in 2002.
| 206 | 300 | 200 | 385 | 280 |
| 600 | 225 | 933 | 320 | 260 |
Exercise 7.3.3. The following is data on monthly gas consumption (in ccf) during the winter months by a household.
| 154 | 222 | 264 | 257 | 127 |
| 228 | 240 | 393 | 278 | 140 |
Once again, let p be the population proportion of a certain attribute. We want to compute a confidence interval for p. We let
X = 1 if success
X = 0 if failure
where "success" means that the sample has the attribute.
So, X is a Bernoulli(p) random variable. We draw a sample X1,X2,…, Xn from the X population, let
X = X1+…+Xn
be the total number of success and
X=X/n
be the sample proportion of success. We have seen that, approximately, the sample proportion X has
N(μX,
σX)-distrubution
where μX = p
and σX
= √((p(1-p))/n).
Therefore,
P(-zα/2 < (X-p)/σX < zα/2 ) = 1-α.
In an attempt to compute a confidence interval for p we simplify and get
P(X-zα/2 σX < p < X+zα/2 σX) ) = 1-α.
Since p is unknown, this will not produce a confidence interval for p. But the sample proportion x of success is a point estimate of p. So we have an approximate (1-α)100 percent confidence interval for p given by
x-e <
p < x+e
where
e = zα/2 √(x(1-x)/n)
Following are some of the useful formulas and definitions that we may need.
e = zα/2 √(x(1-x)/n)
E = zα/2/√4n.
It can be checked that the margin of error e is always less or equal to the conservative margin of error E.
n = (zα/2/2E)2 , rounded to the higher integer.
Remark. In the days of Clinton's impeachment, we often heard TV newscasters read something like the following.
President Clinton has 64 percent approval rating. The poll has a margin of error plus or minus 3.1 percentage points. The poll surveyed 972 people.
They mean that the sample proportion x of people who "approve" President Clinton is 0.64. Normally they don't tell us the level of confidence they are using. Assuming that they are using a 95 percent confidence interval, they mean that
E = zα/2 /√4n = 1.96/√(4x972) = 0.031.
| Use of Calculators (if you have a TI-83): 1-PropZint |
|---|
|
Problems on 7.4: About the Population Proportion
Exercise 7.4.1 In a sample of 197 apples
from a lot, 19 were found to be sour. Set a 99 percent confidence interval
for the proportion p of sour apples in the lot.
Solution
Exercise 7.4.2. A new vaccine was tried on
147 randomly selected individuals, and it was determined that 97 of
them developed immunity. Find a 95 percent confidence interval for the
proportion p of individuals in the population for whom the vaccine would
help.
Solution
Exercise 7.4.3. Before a congressional election, a poll was conducted. Out of 887 randomly selected voters interviewed, 389 said that they would vote for Candidate A, and 359 said that they would vote for Candidate B.
Exercise 7.4.4. If a pollster wanted to estimate
the proportion p of Americans who think that the President should not
be impeached, how large a sample should he/she take so that the true
value of p will be within .02 of the sample proportion, with 99 percent
confidence?
Solution
Exercise 7.4.5. The proportion p of defective
lightbulbs produced by a machine needs to be estimated within .01 to
determine whether the machine needs to be replaced. How large a sample
should we take to do this with 90 percent confidence?
Solution
Exercise 7.4.6. In a poll released on October 28,1998, it was revealed that 60 percent of Americans wanted President Clinton rebuked but not impeached. The poll was conducted among 1,013 adults, and it had a margin of error of 3 percentage points.
Solution: News media polls use 95 percent confidence intervals. When they say "margin of error," they mean "conservative margin of error." The conservative margin of error E and level of confidence 1 - α are related by the formula E = zα/2 /√4n. For this problem E = .03, 1 - α =.95, and n =1,013. We can check zα/2 /√4n = 1.96/√(4x1013) = 0.03079.