BW Chapter 6
BW Chapter 6
Probability
Defining probability:
• For a situation in which several different outcomes are possible, the probability for any specific outcome is
defined as a fraction or a proportion of all the possible outcomes.
• If the possible outcomes are defined as A, B, C, D, and so on then:
Probability values:
Random sampling:
• Probability usually involves a population of scores that can be displayed in a frequency distribution graph.
• Different portions of the graph represent different portions of the population.
• Make use of probability notation when stating the probability.
• Whenever a population is presented in a frequency distribution graph, it becomes possible to represent prob-
abilities as proportions of the graph.
p (X > 4) = ? p (X < ) = ?
p (X > 4) = 2/10 p (X < 5) = 8/10
p (X > 4) = 0.2 or 20% p (X < 5) = 0.8 or 80%
• Normal distribution:
Symmetrical.
Highest frequency in the middle.
Frequencies taper off as you move toward either extreme.
• Normal shape can also be described by the proportions of area that are contained in each distribution section.
• Statisticians often identify the scores of a normal distribution by using z-scores.
• Possible to define a normal distribution in terms of its proportions i.e. normal and has the correct proportions.
Example: the population distribution of SAT scores is normal with a mean of 500 and a standard deviation of 100.
Given this information about the population and the known proportions for a normal distribution, we can determine
the probabilities associated with specific samples. What is the probability of randomly selecting an individual from
this population who has an SAT score greater than 700?
p (X > 700) = ?
1. The probability question is translated into a proportion question: out of all the possible SAT scores, which
proportion is greater than 700?
2. The set of ‘all possible SAT scores’ is simply the population distribution. The mean is µ = 500 so the score X = 700
is to the right of the mean shade in the area to the right of 700 → represents the proportion we are trying to find.
3. Identify the exact position of X = 700 by computing a z-score … and find that an SAT score of X = 700 is exactly
two standard deviations above the mean and corresponds to a z-score of z = +2.00.
𝑋− µ 700−500
z= σ z= 100
z = 2.00
4. The proportion we are trying to determine may now be expressed in terms of its z-score.
p (z > 2.00) = ? p (X > 700) = p (z > 2.00) p (X > 700) = 2.28%.
According to the proportions shown in the above graph, all normal distributions will have 2.25% of the scores in the
tail beyond z = +2.00 regardless of µ or σ for p (X > 700) = 2.28%.
• Complete listing of the full range of z-scores and proportions is on SUNLearn in the unit normal table.
• The graph of the normal distribution shows proportions for only a few select z-score values.
• Column A: lists z-score values corresponding to different positions in a normal distribution.
Imagine a vertical line through the distribution → exact location of the line describes a z-score value.
The vertical line also separates the distribution into 2 sections: the body and the tail.
The body is the larger section and the tail is the smaller section.
Keep in mind in order to make full use of the unit normal table:
• The body always corresponds to the larger part of the distribution whether it is on the right or the left.
• The tail is always the smaller section whether it is on the right or the left.
• Normal distribution is symmetrical proportions on the right side are same as the corresponding left ones.
• For a negative z-score, the tail is on the left and the body on the right → vice versa for positive z-scores.
• Unit normal table does not provide negative z-score values → find corresponding proportion of positive z.
• z-score values are always positive even though their signs (+ and –) change from one side to the other.
• Column C always lists the proportion in the tail whether it is the right tail or left tail.
• Unit normal table lists relationships between z-score locations and proportions in a normal distribution.
• Probability equivalent to proportion use a unit normal table → look up probabilities in normal distributions.
• Unit normal tables can be used for finding proportions/probabilities for specific z-score values.
What proportion of the normal distribution corresponds to z-score values greater than z = 1.00?
- Sketch the distribution and shade in the area you are trying to determine. In this case, the shaded portion is the tail
of the distribution beyond z = 1.00.
- To find this shaded area → look for z = 1.00 in column A to find the appropriate row in the unit normal table.
- Scan across the row to column C (tail) to find the proportion.
- Using the table in Appendix B, you should find that the answer is 0.1587.
- Notice that this problem could have been phrased as a probability question such as ‘for a normal distribution,
what is the probability of selecting a z-score value greater than z = 11.00?’. The answer is p (z > 1.00) = 0.1587 (or
15.87%).
For a normal distribution, what z-score separates the top 10% from the remainder of the distribution?
- To answer this question, we have sketched a normal distribution and drawn a vertical line that separates the high-
est 10% (approximately) from the rest. The problem is to locate the exact position of this line.
- For this distribution, we know that the tail contains 0.1000 (10%) and the body contains 0.9000 (90%).
- To find the z-score value, you simply locate the row in the unit normal table that has 0.1000 in column C or 0.9000
in column B. For example, you can scan down the values in column C (tail) until you find a proportion of 0.1000.
- Note that you probably will not find the exact proportion, but you can use the closest value listed in the table. For
this example, a proportion of 0.1000 is not listed in column C but you can use 0.1003, which is listed.
- Once you have found the correct proportion in the table, simply read across the row to find the corresponding z-
score value in column A. For this example, the z-score that separates the extreme 10% in the tail is z 1.28.
- At this point you must be careful because the table does not differentiate between the right hand tail and the left-
hand tail of the distribution. The final answer could be either z = 11.28, which separates 10% in the right-hand tail,
or z = 21.28, which separates 10% in the left-hand tail.
- For this problem we want the right-hand tail (the highest 10%), so the z-score value is z = 11.28.
It is known that IQ scores form a normal dsitrbution with µ = 100 and σ = 15. Given this information, what is the pro-
bability (can also ask for the proportion) of random selecting an individual with an IQ score of less than 120?
𝑋− µ 120−15
z = σ z = 100
z = 20/15 z = 1.33
- an IQ score of X = 120 corresponds to z-score z = 1.33 IQ scores < 120 corresponds to z-scores > 1.33.
- find the z-score in the unit normal table and find the answer in column B.
- find that 1.33 corresponds to a proportion of p = 0.9082.
• Process of the probability of selecting a score that is located between 2 specific values.
• Often easiest to solve using the information in column D of the unit normal table.
The highway department conducted a study measuring driving speeds on a local section of interstate highway.
They found an average speed of 58 miles per hour with a standard deviation of σ =10. The distribution was approx-
imately normal. Given this information, what proportion of the cars are traveling between 55 and 65 miles per hour?
Using probability notation, we can express the problem as:
- determine the z-score corresponding to the X value at each end of the interval.
𝑋− µ 55−58
For X = 55: z = z = z = - 3 / 10 z = - 0.30
σ 10
𝑋− µ 65−58
For X = 65: z = z = z = 7 / 10 z = 0.70
σ 10
- the first area is the proportion between the mean and z = -0.30 and the second z = +0.70.
- find that these proportions are 0.1179 and 0.2580 respectively.
- the total proportion is found by adding these two sections to get 0.3759.
p (55 < X < 65) = p (-0.30 < z < +0.70) = 0.1179 + 0.2580 = 0.3759 (or 37.59%).
• Possible to reverse the 2 step process discussed above move in a counter-clockwise direction.
• Find the score (X value) corresponding to a specific proportion in the distribution.
• Begin with a specific proportion, use the unit normal table find corresponding z-score then transform it into
an X value.
The distribution of commuting times for American workers is normal with a mean of µ = 24.3 minutes and a stan-
dard deviation of σ =10 minutes. For this example, we will find the range of values that defines the middle 90% of
the distribution.
- the 90% shown in the middle (0.9000) can be split in half with 0.4500 on each side of the mean.
- 0.4500 in the unit normal table → find that that exact proportion is not listed but will find 0.4495 and 0.4505.
- find a z-score of z = 1.65 z-score at right boundary is z = +1.65 and left boundary is z = -1.65.
- the score at the right boundary is X = 24.3 + 16.5 = 40.8 and X = 24.3 - 16.5 = 7.8 at the left boundary.
- the middle 90% of the distribution corresponds to values between 7.8 and 40.8.
90% of American commuters spend between 7.8 and 40.8 minutes commuting to work each day.
10% of commuters spend either more or less time commuting.