Lecture 4
Lecture 4
Distributions
3
Random Variables (1 of 3)
A quantitative variable x will vary or change depending on
the particular outcome of the experiment being measured.
For example, suppose you toss a die and measure x, the
number observed on the upper face.
4
Random Variables (2 of 3)
DEFINITION
A variable x is a random variable if the value that it
assumes, corresponding to the outcome of an experiment,
is a chance or random event.
5
Random Variables (3 of 3)
It is important to distinguish between discrete and
continuous random variables because different techniques
are used to describe their distributions.
6
Probability Distributions
7
Probability Distributions (1 of 3)
We defined probability as the limiting value of the relative
frequency as the experiment is repeated over and over
again.
8
Probability Distributions (2 of 3)
DEFINITION
The probability distribution for a discrete random
variable is a formula, table, or graph that gives all the
possible values of x, and the probability p(x) associated
with each value.
9
Probability Distributions (3 of 3)
Requirements for a Discrete Probability Distribution
• 0 ≤ p(x) ≤ 1
• Σp(x) = 1
10
Example 5.1
Toss two fair coins and let x equal the number of heads
observed. Find the probability distribution for x.
11
Example 5.1 – Solution (1 of 6)
The simple events and their probabilities are listed in table.
Simple
Event Coin 1 Coin 2 P(Ei) x
E1 H H 1∕4 2
E2 H T 1∕4 1
E3 T H 1∕4 1
E4 T T 1∕4 0
12
Example 5.1 – Solution (2 of 6)
Since E1 = HH results in two heads, this simple event
results in the value x = 2. Similarly, the value x = 1 is
assigned to E2, and so on.
13
Example 5.1 – Solution (3 of 6)
For example, when x = 0, simple event E4 occurs, so that
and when x = 1,
14
Example 5.1 – Solution (4 of 6)
The values of x and their respective probabilities, p(x), are
listed in table. Notice that the probabilities add to 1.
Simple
x Events in x p(x)
0 E4 1∕4
1 E2,E3 1∕2
2 E1 1∕4
Σp(x) = 1
Probability Distribution for x (x = Number of Heads)
Table 5.2
15
Example 5.1 – Solution (5 of 6)
The probability distribution in Table 5.2 can be graphed in
several ways.
Figure 5.1
16
Example 5.1 – Solution (6 of 6)
The three values of the random variable x are located on
the horizontal axis, and the probabilities p(x) are located on
the vertical axis. Consider Figure 5.1(b), which looks like the
relative frequency histogram.
Since the width of each bar is 1, the area under the bar is
the probability of observing the particular value of x and the
total area under the bars equals 1.
17
The Mean and Standard Deviation for
a Discrete Random Variable
18
The Mean and Standard Deviation for a Discrete Random Variable (1 of 6)
19
The Mean and Standard Deviation for a Discrete Random Variable (2 of 6)
20
The Mean and Standard Deviation for a Discrete Random Variable (3 of 6)
Toss those two fair coins again, and let x be the number of
heads observed. We constructed this probability distribution
for x:
x 0 1 2
p(x) 1∕4 1∕2 1∕4
21
The Mean and Standard Deviation for a Discrete Random Variable (4 of 6)
DEFINITION
Let x be a discrete random variable with probability
distribution p(x). The mean or expected value of x is
given as
μ = E(x) = Σxp(x)
22
The Mean and Standard Deviation for a Discrete Random Variable (5 of 6)
23
The Mean and Standard Deviation for a Discrete Random Variable (6 of 6)
DEFINITION
Let x be a discrete random variable with probability
distribution p(x) and mean μ. The variance of x is
DEFINITION
The standard deviation σ of a random variable x is
equal to the positive square root of its variance.
24
Example 5.2 (1 of 2)
A “big-box” store sells a particular laptop, but has only four
in stock. The manager wonders what today’s demand for
this particular laptop will be.
25
Example 5.2 (2 of 2)
Find the mean, variance, and standard deviation of x. Is it
likely that five or more customers will want to buy the laptop
today?
26
Example 5.2 – Solution (1 of 4)
Table shows the values of x and p(x), along with the
individual terms used in the formulas for μ and
Table 5.3
27
Example 5.2 – Solution (2 of 4)
The sum of the values in the third column is
28
Example 5.2 – Solution (3 of 4)
The graph of the probability distribution is shown in figure.
Figure 5.2
29
Example 5.2 – Solution (4 of 4)
Since the distribution is approximately mound-shaped,
approximately 95% of all measurements should lie within two
standard deviations of the mean—that is,
30
Example 5.4
An insurance company needs to know how much to charge
for a $100,000 policy insuring an event against cancellation
due to inclement weather.
31
Example 5.4 – Solution (1 of 4)
Define:
x = insurance company’s gain
C = premium charged for the policy
32
Example 5.4 – Solution (2 of 4)
If the event is cancelled because of inclement weather, the
insurance company will receive $C, but will pay out
$100,000, so that the gain is (−100,000 + C) with probability
2 ∕ 100.
x = Gain p(x)
C 98 ∕ 100
−(−100,000 + C) 2 ∕ 100
33
Example 5.4 – Solution (3 of 4)
Since the company wants to set the premium C so that in
the long run, the average gain will be zero, set the expected
value of x equal to zero and solve for C.
E(x) = Σxp(x)
and C = $2,000.
34
Example 5.4 – Solution (4 of 4)
If the insurance company charged a premium of $2,000, the
average gain for a large number of similar policies would
equal zero.
35
5.2 The Binomial Probability
Distribution
DEFINITION
A binomial experiment is one that has these five
characteristics:
1. The experiment consists of n identical trials.
2. Each trial results in one of two outcomes. For lack of a
better name, one outcome is called a success, S, and
the other a failure, F.
3. The probability of success on a single trial is equal to p
and remains the same from trial to trial. The probability
of failure is equal to (1 − p) = q.
37
The Binomial Probability Distribution (2 of 9)
DEFINITION
4. The trials are independent.
5. We are interested in the binomial random variable x,
the number of successes in n trials, for x = 0,1,2, ..., n.
38
Example 5.5
Suppose there are approximately 1,000,000 adults in a
county and an unknown proportion p favors term limits for
politicians. A sample of 1000 adults will be chosen in such a
way that every one of the 1,000,000 adults has an equal
chance of being selected, and each adult is asked whether
he or she favors term limits.
39
Example 5.5 – Solution (1 of 3)
Does the experiment have the five binomial characteristics?
40
Example 5.5 – Solution (2 of 3)
3. The probability of success, p, is the probability that an
adult favors term limits. Does this probability remain the
same for each adult in the sample? For all practical
purposes, the answer is yes. For example, if 500,000
adults in the population favor term limits, then the
probability of a “success” when the first adult is chosen is
500,000 ∕ 1,000,000 = 1 ∕ 2. When the second adult is
chosen, the probability p changes slightly, depending on
the first choice. That is, there will be either 499,999 or
500,000 successes left among the 999,999 adults. In
either case, p is still approximately equal to 1 ∕ 2.
41
Example 5.5 – Solution (3 of 3)
4. The independence of the trials is guaranteed because of
the large group of adults from which the sample is
chosen. The probability of an adult favoring term limits
does not change depending on the responses of
previously chosen people.
5. The random variable x is the number of adults in the
sample who favor term limits.
42
The Binomial Probability Distribution (3 of 9)
Rule of Thumb
If the sample size is large relative to the population size—in
particular, if n ∕ N ≥ .05—then the resulting experiment is not
binomial.
44
The Binomial Probability Distribution (5 of 9)
45
Example 5.7
Find P(x = 2) for a binomial random variable with n = 10 and
p = .1.
Solution:
P(x = 2) is the probability of observing 2 successes and 8
failures in a sequence of 10 trials. You might observe the 2
successes first, followed by 8 consecutive failures:
S, S, F, F, F, F, F, F, F, F
46
Example 5.7 – Solution
Since p is the probability of success and q is the probability
of failure, this particular sequence has probability
47
The Binomial Probability Distribution (6 of 9)
48
The Binomial Probability Distribution (7 of 9)
Table 5.4
49
The Binomial Probability Distribution (8 of 9)
50
The Binomial Probability Distribution (9 of 9)
51
Example 5.11
A regimen consisting of a daily dose of vitamin C was tested
to see if it might be effective in preventing the common cold.
Ten people followed the prescribed regimen for a year, and
eight survived the winter without a cold.
52
Example 5.11 – Solution (1 of 5)
If you assume that the vitamin C regimen is ineffective, then
the probability p of surviving the winter without a cold is .5.
53
Example 5.11 – Solution (2 of 5)
1. The binomial formula:
54
Example 5.11 – Solution (3 of 5)
3. Output from MINITAB, MS Excel, or the TI-83 or TI-84
Plus: The outputs shown in figures give the cumulative
distribution function (cdf), which are the same
probabilities you found in the cumulative binomial tables.
Figure 5.5(a)
Figure 5.5(b)
55
Example 5.11 – Solution (4 of 5)
The probability density
function (pdf) gives the
individual binomial
probabilities, which you
found using the binomial
formula. The output from
the TI-84 Plus calculator
in Figure 5.5(c) shows the
binomial cdf in list L2 and
the pdf in list L3. TI-84 Plus screen captures for Example 5.11
Figure 5.5(c)
56
Example 5.11 – Solution (5 of 5)
Using the cumulative distribution function, calculate
P(x ≥ 8) = 1 − P(x ≤ 7)
= 1 − .94531 = .05469
57
Example 5.12
Would you rather take a multiple-choice or a full recall test?
If you have absolutely no knowledge of the material, you will
score zero on a full recall test. However, if you are given five
choices for each question, you have at least one chance in
five of guessing correctly!
58
Example 5.12 – Solution (1 of 3)
If x is the number of correct answers on the 100-question
exam, the probability of a correct answer, p, is one in five,
so that p = .2. Since the student is randomly selecting
answers, the n = 100 answers are independent, and the
expected score for this binomial random variable is
59
Example 5.12 – Solution (2 of 3)
Then, using your knowledge of variation from Tchebysheff’s
Theorem and the Empirical Rule, you can make these
statements:
• A large proportion of the scores will lie within two standard
deviations of the mean, or from 20 − 8 = 12 to 20 + 8 = 28.
• Almost all the scores will lie within three standard
deviations of the mean, or from 20 − 12 = 8 to 20 + 12 = 32.
60
Example 5.12 – Solution (3 of 3)
The “guessing” option gives the student a better score than
the zero score on the full recall test, but the student still will
not pass the exam.
61
5.3 The Poisson Probability Distribution
63
The Poisson Probability Distribution (2 of 7)
For example,
64
The Poisson Probability Distribution (3 of 7)
65
The Poisson Probability Distribution (4 of 7)
66
Example 5.13
The average number of traffic accidents on a certain section
of highway is two per week. Assume that the number of
accidents follows a Poisson distribution with μ = 2.
67
Example 5.13 – Solution (1 of 4)
1. The average number of accidents per week is μ = 2.
Therefore, the probability of no accidents on this
section of highway during a given week is
68
Example 5.13 – Solution (2 of 4)
The probability of at most three accidents during a 2-
week period is
69
Example 5.13 – Solution (3 of 4)
Therefore,
= .433471
Once the values for p(x) have been calculated, you can
use them to construct a probability histogram for the
random variable x.
70
Example 5.13 – Solution (4 of 4)
Graphs of the Poisson probability distribution for μ = .5, 2,
and 4 are shown in figure.
Figure 5.6
71
The Poisson Probability Distribution (5 of 7)
72
The Poisson Probability Distribution (6 of 7)
1 − P(x ≤ a)
73
The Poisson Probability Distribution (7 of 7)
74
Example 5.15
Suppose a life insurance company insures the lives of 5000
men aged 42. If actuarial studies show the probability that
any 42-year-old man will die in a given year to be .001, find
the exact probability that the company will have to pay x = 4
claims during a given year.
Solution:
The exact probability is given by the binomial distribution as
75
Example 5.15 – Solution
Calculating μ = np = (5000)(.001) = 5 and substituting into
the formula for the Poisson probability distribution, we have
76
5.4 The Hypergeometric Probability
Distribution
78
The Hypergeometric Probability Distribution (2 of 2)
79
Example 5.17
A case of wine has 12 bottles, 3 of which contain spoiled
wine. A sample of 4 bottles is randomly selected from the
case.
1. Find the probability distribution for x, the number of
bottles of spoiled wine in the sample.
80
Example 5.17 – Solution (1 of 2)
For this example, N = 12, n = 4, M = 3, and (N − M) = 9.
Then
81
Example 5.17 – Solution (2 of 2)
2. The mean is given by
82
Example 5.18
An industrial product is shipped in lots of 20. Testing to
determine whether an item is defective is costly; hence, the
manufacturer samples production rather than using a 100%
inspection plan.
The sampling plan calls for sampling five items from each
lot and rejecting the lot if more than one defective is
observed. (If the lot is rejected, each item in the lot is then
tested to isolate the defectives.) If a lot contains four
defectives, what is the probability that it will be accepted?
83
Example 5.18 – Solution (1 of 1)
Let x be the number of defectives in the sample. Then
N = 20, M = 4, (N − M) = 16, and n = 5. The lot will be
rejected if x = 2, 3, or 4. Then
84