Math 365, Elementary Statistics |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Lesson 8 : Comparing Two Populations Introduction
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Use of Calculators (if you have a TI-83): 2-SampZinterval |
|---|
|
Problems on 8.1: Confidence Interval of μ1 - μ2
Exercise 8.1.1. Suppose we have two normal
populations with means μ1,
μ2 and standard deviation
σ1, σ2
respectively. It is known that σ1
= 8.1 and σ2 = 11.3. A sample
of size m = 64 was collected from the first population, and the sample
mean was found to be x = 3.7. A sample of
size n = 99 was collected from the second population, and the sample
mean was found to be y = 4.1. Compute a 95
percent confidence interval for the difference of mean μ1-
μ2.
Solution
Exercise 8.1.2. The birth weight of babies in developed and developing countries are normally distributed with mean μ1, μ2 and standard deviation σ1, σ2, respectively. (My data is not real.) Given σ1 = 2.3 pounds and σ2 = 2.9 pounds. A sample of size m = 35 babies from the developed nations were collected and the sample mean birth weight was found to be x = 8.9 pounds. A sample of size n = 48 babies from the developing nations was collected and the sample mean birth weight was found to be y = 7.1 pounds.
Exercise 8.1.3. African elephants and Indian elephants are different in height, weight, and length of ear and tusk. It is natural to assume that all these are normally distributed. The mean height and standard deviation of African elephants are μ1, σ1 = 1.2 feet, respectively. The mean height and standard deviation of Indian elephants are μ2, σ2 = 1.1 feet, respectively. A sample of size 25 African elephants were collected and the sample mean height was found to be x = 10.9 feet. A sample of size 28 Indian elephants was collected and the sample mean height was found to be y = 9.1 feet.
As in the last section, we have two populations X, Y. We assume that X has N(μ1, σ1) distribution and Y has N(μ2, σ2) distribution. Unlike in the last section, we assume that σ1, σ2 are unknown. We try to find a confidence interval for μ1 - μ2.
We take a sample X1, X2, …, Xm of size m from the X population, and we take a sample Y1,Y2, …, Yn from the Y population. Following are some facts and notations.
σ1 = σ2 = σ.
And, we also assume that the X-sample and the Y-sample are mutually independent.
| 2 | |
| Let X and S | |
| X |
Sp2 =
[(m-1)SX2+(n-1)SY2 ]/
[m+n-2] =
[ ∑
(Xi-X)2
+ ∑ (Yj-Y
)2 ] / [m+n-2]
Although both SX2, SY2 are estimators of σ2, Sp2 is a better estimator for σ2 because it uses both the samples. One can see that Sp2 is a weighted average of SX2 and SY2.
T = [ (X - Y) - (μ1 -μ2) ] / [Sp√(1/m + 1/n) ]
has a t-distribution with m+n-2 degrees of freedom.
x-y-E < μ1- μ2 < x-y+E
whereE=tm+n-2,α/2 Sp √(1/m + 1/n)
| Use of Calculators (if you have a TI-83): 2-SampTint |
|---|
|
Problems on 8.2: When σ1 and σ2 Are Unknown
Exercise 8.2.1. Suppose that we are comparing two "similar" normal populations with means μ1, μ2 respectively and the populations both have standard deviation σ. We collected a sample of size m = 11 from the first population that produced a sample mean x = 13.2 and sample standard deviation s1 = 2.33. A sample of size n = 13 was collected from the second population that had sample mean y = 11.5 and sample variance s2 = 2.73.
Exercise 8.2.2. Suppose we have two normal populations with means μ1, μ2 and equal standard deviation σ. A sample of size m = 64 was collected from the first population and the sample mean and standard deviation were found to be x = 3.7, s1 = 9.2 . A sample of size n = 99 was collected from the second population and the sample mean and standard deviation were y = 4.1, s2 = 8.7.
Exercise 8.2.3. The birth weight of the babies
in developed and developing countries are normally distributed with
mean μ1,
μ2 and equal standard deviation
σ. (My data is not real.) Suppose the following data about
the birth weight from developed and developing nations were collected.
|
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Exercise 8.2.4. African elephants and Indian
elephants are different in height, weight, and length of ear and tusk.
It is natural to assume that all these are normally distributed. Assume
that the height of African and Indian elephants have an equal mean σ.
The mean heights of African elephants and Indian elephants are μ1,
μ2, respectively. Suppose
the following data were collected on the height of elephants from the
two continents (these are not real data).
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
In this section, we compute a confidence interval for the difference p1-p2 of two population proportions. An example follows.
Example. We would like to have an estimate for the difference between the proportion p1 of males who are making more than fifty thousand dollars annually and the proportion p2 of females who are making more than fifty thousand dollars annually. We construct a confidence interval for p1-p2.
Similarly, we might like to compare the proportion of defective items produced by an old machine and new machine in a factory.
Assume we have two populations. Let p1 be the proportion of Population 1 that has an attribute A and let p2 be the proportion of Population 2 that has the attribute A. We want to compute a confidence interval for p1-p2.
So, we take a sample of size m from Population 1 and let X be the number of sample members that have the attribute A and X=X/m be the sample proportion that has the attribute A. ( We may say that X is the number of "success" in this sample from Population 1 and X=X/m is the proportion of "success".) We take a sample from Population 2 of size n, which is independent of the other sample. Let Y be the number of sample members that has attribute A and Y=Y/n be the sample proportion that has the attribute A. (So, Y=Y/n is the sample proportion of "success" from Population 2.)
(Let me explain the context of the example above. We interview m males and X would be the number of males in this sample who make more than fifty thousand annually and X=X/m would be the proportion of the males in this sample who make more than fifty thousand annually. Similarly, we interview n females and Y=Y/n would be the proportion of females in this sample who make more than fifty thousand.)
We develop a confidence interval for p1-p2 as follows.
X=X/m
Y=Y/n
P(-zα/2<( (X- Y)-(p1-p2))/σ < zα/2 ) = 1-α
Theorem. An approximate (1-α)100 percent confidence interval for p1-p2 is given by
X-Y -E < p1-p2 < X-Y+E
where
E= Zα/2√( X(1-X)/m + Y(1-Y)/n )
| Use of Calculators (if you have a TI-83): 2-PropZint |
|---|
|
Problems on 8.3: Comparing Two Population Proportions.
Exercise 8.3.1. Suppose two independent samples
were collected from two populations. We want to compare the proportions
p1,p2 , respectively, of an attribute A present
in these two populations. Use 95 percent confidence interval to estimate
p1-p2. We are given that x = 55 had the attribute
A in a sample of size m = 117 from the first population and y = 37 had
the attribute A in a sample of size n = 79 from the second sample.
Solution
Exercise 8.3.2. To compare the proportions
p1,p2 of defective items produced by new and old
machines, respectively, samples were collected. In a sample of 57 items
from the new machine, 6 were found to be defective; and in a sample
of 41 items from the old, 9 were defective. Compute a 99 percent confidence
interval for p1-p2
Solution
Exercise 8.3.3. To compare the proportions
p1,p2 of men and women, respectively, who watch
football, data was collected. In a sample of 199 men, 83 said that they
watch football; and in a sample of 161 women, 51 said they watch football.
(These are not real data.) Construct a 99 percent confidence interval
for p1-p2.
Solution