Math 105, Topics in Mathematics |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Lesson 5: Normal DistributionIntroduction
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Scores | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Frequency | 1 | 2 | 6 | 10 | 16 | 13 | 9 | 8 | 5 | 2 | 1 |
The following is the bar graph of this data:

Example 5.0.2. Following is the class frequency table of weight (in pounds) of some babies:
| Class Intervals | 48-60 | 60-72 | 72-84 | 84-96 | 96-108 | 108-120 | 120-132 | 132-144 | 144-156 | 156-168 |
|---|---|---|---|---|---|---|---|---|---|---|
| Frequency | 15 | 24 | 41 | 67 | 119 | 184 | 142 | 26 | 5 | 2 |
The following is the bar graph of this data:
Example 5.0.3. The following is the frequency table of hourly wages of a group of workers:
| Wages | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Frequencies | 3 | 5 | 7 | 4 | 12 | 10 | 13 | 11 | 15 | 13 | 11 | 4 | 3 | 0 | 0 | 0 | 1 |
The following is the bar graph of this data :

Example 5.0.4. The following is the percentage frequency table of the weight of a large number of salmon:
| weight (in pounds) |
20- 24 |
25- 29 |
30- 34 |
35- 39 |
40- 44 |
45- 49 |
50- 54 |
55- 59 |
60- 64 |
65- 69 |
70- 74 |
75- 80 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Percentage Frequencies |
1.1 | 1.9 | 4.3 | 8.7 | 13.2 | 17.8 | 17.3 | 14.6 | 10.6 | 6.1 | 2.8 | 1.6 |
The following is the bar graph :

You can see that all these bar graphs fit into a nice bell-shaped curve. Some fit into the bell shape better than the other graphs.
Perhaps the most amazing fact in statistics is that bar graphs of almost all kinds of numerical data in nature fit nicely into a bell-shaped curve. How well a data set fits into a perfect bell-shaped curve largely depends on the size of the data. If the data set is large, then the bar graph or the histogram almost always fits into a perfect bell-shaped curve.
In this chapter we consider data sets whose bar graph or the histogram fits into a perfect bell-shaped curve. In fact, we talk about population data whose bar graph or histogram fits into a perfect bell-shaped curve. At this point it is helpful to recall the discussion on variables.
Recall the following definition of variables.
Definition: A variable is a characteristic of the population members that varies with each member of the population.
We do not distinguish between "variables" and "random variables" (as many textbooks do). I only want to mention that a variable, in the context of random experiments and sample spaces is called a random variable. A random variable X assigns a value to each outcome (and a variable assigns a value to each population member). In other words, here the sample space S takes the place of the population.
Example 5.1.1. Suppose we want to study the KU student population. Our random experiment is to select a KU student randomly. So, the sample space S, in this case, is the KU student population, and each student is an outcome. Now all the variables that we considered in the example of KU student population (in lesson 2) are also random variables. GPA, weight, height, and annual income of KU students are random variables.
Properties of Normal Curves:
Suppose X is a random variable that has normal distribution. Let us denote the mean (average) value of X by μ and the standard deviation by σ. Following are some properties of the normal curve of X.
μ = mean of X = median of X = center.
We say that the normal curve is symmetric around the mean.
z = (x-μ)/σ
is called the standardized value of X. We also call it the z-value of x.
(y-x)/σ - standard deviation apart
from each other.Example 5.2.1. Suppose it is known that the annual income X of the U.S. population is normally distributed with mean μ = $37,000.00 and standard deviation σ = $13,000.00. (My data is not real data.)
| The Solution |
|---|
| (y-x)/σ = (64,000-12,000)/13,000 = 4 standard deviations. |
| The Solution |
|---|
| (y-x)/σ = (77,000-64,000)/13,000 = 1 standard deviation. |
| The Solution |
|---|
| (y-x)/σ = (77,000 - 12,000)/13,000 = 5 standard deviations. |
Example 5.2.1-B.
The weight X salmon in a river has normal
distrubution with mean μ = 37 pounds and
standard deviation σ = 8 pounds.
The weight of a salmon you caught recently is 18 pounds.
The weight of a salmon that your friend caught is 30 pounds.
How many standard deviation apart are the these two fish?
Answer:
Suppose that a data set (or a random variable) has normal distribution. Then we have following facts:
|
Quartiles:
(Q3-μ)/σ = 0.675.
Q3= μ + 0.675 σ.
Q1= μ - 0.675 σ.
Example 5.2.2. Suppose scores X of students on certain test (like SAT) are approximately normally distributed with mean μ = 70 and standard deviation σ = 10 points. Suppose 1500 students take the test.
| The Solutions |
|---|
|
Example 5.2.3. Let us have the same set up as in the above problem. Suppose that 1500 students took the test.
| The Solutions |
|---|
|
Example 5.2.4. Let us have the same set up as above. Estimate the first and the third quartile.
| The Solutions |
|---|
|
Percentiles
For a number p between 0 and 100, your score in a test is (above) p-percentile, if p percent of the students who took the test did not do as well as you did. More formally:
Definition. For a variable X, the pth percentile is a number xp so that P(X < xp) = p/100. In other words, p percent of the data values are less than (or also equal to) xp. So
Example 5.2.5. The distribution of weight of six-month-old baby boys is approximately normal with mean μ = 17.25 pounds and standard deviation σ = 2 pounds.
| The Solutions |
|---|
We will first directly compute the quartiles.
|
Problems on 5.2: Probability (using The 68-95-99.7 Rule)
Exercise 5.2.1. Suppose that the weight X at birth of babies have normal distribution with mean μ = 115 ounce and standard deviation σ = 17 ounces. Use the 68-95-99.7 rule and do the following:
Exercise 5.2.2. Suppose that the length X of salmon in a river have normal distribution with mean μ = 17 inches and standard deviation σ = 4 inches. Use the 68-95-99.7 rule and do the following:
Problems on 5.2: Percentiles
Exercise 5.2.3. The distribution of weight for twelve-month-old baby girls is approximately normal with mean μ = 21 pounds and standard deviation σ = 2.2 pounds.
Exercise 5.2.4. The distribution of weights for one-month-old baby girls is approximately normal with mean μ = 8.75 pounds and standard deviation σ = 1.1 pounds.
Exercise 5.2.5. The distribution of weights for one-month-old baby boys is approximately normal with mean μ = 22.5 pounds and standard deviation σ = 2.2 pounds.
We have discussed both discrete and continuous random variables. We note here that a normal random variable X is a continuous random variable.
For a normal random variable X, we only used the 68-95-99 rule to compute the probability that X is within 1, 2, or 3 standard deviations σ from the mean μ. We will learn to use probability tables to compute the probability that X is within any two given numbers.
First, we can rephrase the definition of the normal distribution as follows:
Suppose that a random variable X (or a data set) has normal distribution with mean μ and standard deviation σ. Then the relative frequency histogram of the data fits into a a perfect bell-shaped curve as follows.
This bell-shaped curve is called a normal curve.
Now we have the following:
| P(a < X < b) = P(a ≤ X < b
) = P(a < X ≤ b) = P(a ≤ X ≤ b) = the area under the normal curve, above the x-axis (of X) and between the vertical lines x = a and x = b. |
||
|
| P( X ≤ b) = P( X <b) = the area under the normal curve (of X), above the x-axis and on the left side of the vertical line x = b. |
|
|
| P(a < X) = P(a ≤ X) = the area under the normal curve (of X), above the x-axis and on the right side of x = a. |
|
|
The Z-Table
Recall that the normal random variable with mean μ = 0 and standard deviation σ = 1 is called standard normal random variable (denoted by Z). Tables are available that you can use to compute the probability for a standard normal random variable. I have compiled a probability table for Z random variable. This table was compiled to specifically meet the needs of this class.

Computing Probability for a Normal Random Variable
Suppose X is a normal random variable with mean μ and standard deviation σ.
Z = (X - μ) / σ
is a standard normal random variable.
P(a < X < b)
= P((a-μ)/σ <(X-μ)/σ < (b - μ)/σ)
= P((a-μ)/σ < Z < (b - μ)/σ)
= The difference of the two numbers to be read from the z-table.
Example 5.3.1. The length X of life of some light bulbs produced in a factory is normally distributed with mean μ = 8640 hours and standard deviation σ\ = 1440 hours.
| The Solutions |
|---|
|
Example 5.3.2. The length X of a fish in a lake has normal distribution with mean 67 cm and standard deviation 21 cm. What proportion (i.e., probability) of fish have length between 44 cm and 110 cm?
| The Solutions |
|---|
Here we are asked to find P(44 < X < 110). We have
|
Example 5.3.3. The diameters of the pumpkins in my patch have normal distribution with mean 13 inches and standard deviation 4.5 inches. What proportion (i.e., probability) of pumpkins is above 22 inches?
| The Solutions |
|---|
Here we are asked to find P(22 < X).
|
Example 5.3.4. The amount of vegatable oil
X produced by a machine in a day is normally distributed with μ
= 130 liters and standard deviation σ = 25 liters. What is the
probability that a machine produces between 120 liters and 150 liters
each day?
Solution
| The Solutions |
|---|
Here we are asked to find P(120 < X < 150).
|
Problems on 5.3: Probability (using the Probability Table )
Exercise 5.3.1. Let Z be the standard normal random variable.
Experiment with the normal-animation.
Solution
Exercise 5.3.2. Let X be a normal random variable with mean μ = 3 and standard deviation σ = 1.5 .
Experiment with the normal-animation.
Experiment with the Solution
Exercise 5.3.3. The telephone company's data shows that length
X of their international calls has normal distribution with mean 11.5
minutes and standard deviation 4.3 minutes. The company decided to give
a special rate for the longest 20 percent calls. What percentage of
international calls last between 5 minutes and 15 minutes?
Exercise 5.3.4. The annual expenditure X of a student is approximately
normally distributed with mean μ = 11,000 dollars and standard
deviation σ = 1500 dollars. What percentage of students spend less
than 10,000 dollars?
Solution
Exercise 5.3.5. Suppose the annual production X of milk per cow
is normally distributed with μ = 5500 liters and standard deviation
σ = 150 liters. What percentage of cows have annual yield less
than 5155 liters?
Solution
Exercise 5.3.6. The weight X at birth of babies is normally distributed
with mean μ = 114 oz. and standard deviation σ = 18 oz. What
percentage of babies have birth weight below 141 oz.?
Solution