Probability
Probability
Introduction
Although descriptive statistics consist of a set of
useful graphical and numerical methods, we are
particularly interested in developing statistical
inference about a population from a sample.
Our primary objective in this and the following
two chapters is to develop the probability-
based tools that are at the basis of statistical
inference.
However, probability can also play a critical role
in decision making, a subject we explore in
Chapter 22.
6-1 Random Experiment
A random experiment is an action or process
that leads to one of several possible outcomes.
The list of possible outcomes of an experiment
must be exhaustive and mutually exclusive.
Examples:
Illustration 1. Experiment: Flip a coin.
Outcomes: Heads and tails
Illustration 2. Experiment: Record marks on a
statistics test (out of 100).
Outcomes: Numbers between 0 and
100
Illustration 3. Experiment: Record grade on a
statistics test.
Outcomes: A, B, C, D, and F
6-1 Sample Space and Probability
Requirements
A sample space of a random experiment is a
list of all possible outcomes of the experiment.
The outcomes must be exhaustive and
mutually exclusive.
Using set notation, we represent the sample
space and its k outcomes as:
𝑆 = {𝑂! , 𝑂" , 𝑂# , … , 𝑂$ }
When we begin the task of assigning
probabilities to the outcomes, we must obey
two probability requirements:
1. The probability of any outcome must lie
between 0 and 1. That is,
𝑃(𝐴 and 𝐵)
𝑃(𝐴|𝐵 ) =
𝑃(𝐵)
𝑃(𝐴 and 𝐵)
𝑃 ( 𝐵 |𝐴 ) =
𝑃(𝐴)
𝑃(𝐴! and 𝐵! ) . 11
𝑃(𝐵! |𝐴! ) = = = .275
𝑃(𝐴! ) . 40
6-2d Independence
One of the objectives of calculating conditional
probability is to determine whether two events
are related.
In particular, we would like to know whether
they are independent events.
Two events A and B are said to be independent
if:
𝑃 (𝐴|𝐵) = 𝑃(𝐴)
or:
𝑃(𝐵|𝐴) = 𝑃(𝐵)
In other words, whether one event happens has
no influence on the likelihood that the other
event happens.
Example 6.1 – Determinants of Success
among Mutual Fund Managers – Part 3
Determine whether the event that the manager
graduated from a top-20 MBA program and the
event the fund outperforms the market are
independent events.
Solution:
A1 = Fund manager graduated from a top-20
MBA program
B1 = Fund outperforms the market
To determine whether A1 and B1 are
independent events, we need do calculate the
probability of A1 given B1:
𝑃(𝐴! and 𝐵! ) . 11
𝑃(𝐴! |𝐵! ) = = = .647
𝑃(𝐵! ) . 17
The marginal probability that a manager
graduated from a top-20 MBA program is:
𝑃(𝐴! ) = .40
Since the two probabilities are not equal, we
conclude that the two events are dependent.
6-2e Complement Rule and Union of Two
Events
The complement of event A is the event that
occurs when event A does not occur. The
C
complement of event A is denoted by A .
The complement rule defined here derives
from the fact that the probability of an event
and the probability of the event’s complement
must sum to 1.
𝑃(𝐴' ) = 1 − 𝑃(𝐴)
for any event A.
The union of events A and B is the event that
occurs when either A or B or both occur.
It is denoted as:
𝐴 or 𝐵
Example 6.1 – Determinants of Success
among Mutual Fund Managers – Part 4
Determine the probability that a randomly
selected fund outperforms the market, or the
manager graduated from a top-20 MBA
program.
Solution:
We want to compute the probability of the
union of two events:
P(A1 or B1)
The union A1 or B1 occurs when any of the
following joint events occurs:
• A1 and B1
• A2 and B1
• A1 and B2
• We can visualize the union A1 or B1
highlighting the corresponding three joint
probabilities in the table:
MBA MUTUAL FUND MUTUAL
PROGRAM OUTPERFORMS FUND DOES
MARKET NOT
OUTPERFORM
MARKET
𝑃(𝐴 and 𝐵)
𝑃 (𝐵 | 𝐴 ) =
𝑃(𝐴)
𝑃 (𝐴 and 𝐵) = 𝑃(𝐴)𝑃(𝐵)
Examples – Selecting Two Students
Example 6.5 – Without Replacemet
A graduate statistics course has seven male
and three female students. The professor
wants to select two students at random.
What is the probability that the two students
chosen are female?
Solution:
Let us define the following events:
A. the first chosen student is female.
B. the second chosen student is also female.
We need: 𝑃(𝐴 and 𝐵 ) = 𝑃(𝐴)𝑃(𝐵|𝐴)
# "
With:𝑃(𝐴) = and 𝑃(𝐵 |𝐴) =
!( )
# " *
Thus: 𝑃 (𝐴 and 𝐵) = = = .067
!( ) )(
# # )
Thus: 𝑃 (𝐴 and 𝐵) = = = .09
!( !( !((
𝑃(𝐴! or 𝐵! )
= 𝑃 (𝐴! 𝑎𝑛𝑑 𝐵" ) + 𝑃 (𝐴! 𝑎𝑛𝑑 𝐵! )
+ 𝑃(𝐴" 𝑎𝑛𝑑 𝐵! )
Example 6.7 – Applying the Addition Rule
In a large city, two newspapers are published,
the Sun and the Post.
The circulation departments report that 22% of
the city’s households have a subscription to
the Sun and 35% subscribe to the Post. A
survey reveals that 6% of all households
subscribe to both newspapers.
What proportion of the city’s households
subscribe to at least one newspaper?
Solution: We can reformulate the question
as: “what is the probability of selecting a
household at random that subscribes to the
Sun or the Post or both?”
𝑃(𝑆𝑢𝑛 or 𝑃𝑜𝑠𝑡)
= 𝑃(𝑆𝑢𝑛) + 𝑃 (𝑃𝑜𝑠𝑡 )
− 𝑃(𝑆𝑢𝑛 and 𝑃𝑜𝑠𝑡)
𝑃(𝑆𝑢𝑛 or 𝑃𝑜𝑠𝑡) = .22 + .35 − .06 = .51
Interpret: There is a 51% probability that a
randomly selected household subscribes to
one or the other or both papers.
6-3d Probability Trees
In a probabilty tree, the events in an
experiment are represented by branches,
which are lines linked to each other. Then, we
calculate the joint probabilities by multiplying
the probabilities on the linked branches.
Parallel branches from the same node are
mutually exclusive and can be added
together.
Consider again the example of the probability
of selecting two female students at random
from a graduate statistics course that has
seven male and three female students, either
without or with replacement.
*See notes for additional considerations.
Probability Tree for Exercise 6.5
𝑃(𝐴 and 𝐵)
𝑃 (𝐴 | 𝐵 ) =
𝑃(𝐵)
𝑃 (𝐴% |𝐵)
𝑃(𝐴% )𝑃(𝐵|𝐴% )
=
𝑃(𝐴! )𝑃(𝐵|𝐴! ) + 𝑃(𝐴" )𝑃(𝐵|𝐴" ) + ⋯ + 𝑃(𝐴$ )𝑃(𝐵|𝐴$ )
where:
B is the given event,
A1, A2,…, Ak are the events with known prior
probabilities P(A1), P(A2),…, P(Ak),
P(B|A1), P(B|A2),…, P(B|Ak) are the likelihood
probabilities,
P(Ai|B), with i = 1, 2,…, k are the posterior
probabilities we seek.
Example 6.9 – Solving with Bayes’s Law
Formula
Should an MBA Applicant Take a
Preparatory Course?
We define the events as follows:
A1 = GMAT score is 650 or more
A2 = GMAT score is less than 650
B = Student took preparatory course
The provided prior probabilities are:
P(A1) = .10 (probability student scored 650 or
more)
P(A2) = 1 − .10 = .90 (complement probability
– probability student scored less than 650)
The provided conditional (likelihood)
probabilities are:
P(B|A1) = .52 (probability student took prep
course among those scoring 650 or more)
P(B|A2) = .23 (probability student took prep
course among those scoring less than 650)
The Bayes’s Law formula yields the probability
a student scores 650 or more after taking the
prep course:
𝑃(𝐴! )𝑃(𝐵|𝐴! )
𝑃(𝐴! |𝐵) =
𝑃(𝐴! )𝑃(𝐵|𝐴! ) + 𝑃(𝐴" )𝑃(𝐵|𝐴" )
. 10(.52) . 052
= =
. 10(. 52) + .90(.23) . 052 + .207
= .201
6-5 Identifying the Correct Method
The key issue is to determine whether joint
probabilities are provided or are required:
Joint probabilities are provided
1. Compute marginal probabilities by adding
across rows and down columns.
2. Use the joint and marginal probabilities to
compute conditional probabilities.
3. Determine whether the events described by
the table are independent.
4. Apply the addition rule to compute the
probability that either of two events occur.
Joint probabilities are required
1. Apply probability rules or build a probability
tree.
2. Use the multiplication rule to calculate the
probability of intersections.
3. Apply addition and complement rules for
mutually exclusive events.
4. Compute posterior probability using the
Bayes’s Law.
Chapter Summary
• The first step in assigning probability is to
create an exhaustive and mutually exclusive
list of outcomes.
• The second step is to use the classical,
relative frequency, or subjective approach
and assign probability to the outcomes.
• A variety of methods are available to
compute the probability of other events.
These methods include probability rules
and trees.
• An important application of these rules is
Bayes’s Law, which allows us to compute
conditional probabilities from other forms
of probability.
Chapter 7: Random Variables and Discrete
Probability Distributions
7-1 Random Variables and Probability
Distributions
A random variable is a function or rule that
assigns a number to each outcome of an
experiment.
There are two types of random variables:
A discrete random variable takes on a
countable number of values.
• e.g., values on the roll of two dice: 2, 3, 4, ...,
12
A continuous random variable takes on an
uncountable number of values.
• e.g., time: 30.1 minutes? 30.0001 minutes?
30.0000001 minutes?
• We will consider continuous random
variables in Chapter 8.
A probability distribution is a table, formula,
or graph that describes the values of a
random variable and the probability
associated with these values.
• P(X = x) or simply P(x).
7-1a Discrete Probability Distributions
The probabilities of the values of a discrete
random variable may be derived by means of
probability tools such as tree diagrams or by
applying one of the definitions of probability.
These two conditions apply:
1 35.2
2 43.5
3 19.5
4 16.2
5 7.3
6 2.8
7 or more 1.6
Total 126.1
Number of P(x)
Persons
0 SCSCSC .512
𝐸 (𝑋 ) = 𝜇 = g 𝑥𝑃(𝑥)
/00 1
𝜎 = j𝜎 "
Example 7.3 – Describing the Population of
the Number of Persons per Household
Find the mean, variance, and standard
deviation for the population of the number of
persons per household in Example 7.1.
For this example, we will assume that the last
category is exactly seven persons.
The mean of X is:
𝐸 (𝑋) = 𝜇 = g 𝑥𝑃(𝑥)
/00 1
= 1𝑃(1) + 2𝑃(2) + ⋯ + 7𝑃(7)
= 1(. 279) + 2(. 345) + ⋯ + 7(. 013)
= 2.46
The variance of X is:
1. 𝐸 (𝑐 ) = 𝑐
2. 𝐸 (𝑋 + 𝑐 ) = 𝐸 (𝑋 ) + 𝑐
3. 𝐸 (𝑐𝑋) = 𝑐𝐸(𝑋)
where c is a constant.
Laws of Variance
1. 𝑉 (𝑐) = 0
2. 𝑉 (𝑋 + 𝑐) = 𝑉 (𝑋 )
3. 𝑉 (𝑐𝑋) = 𝑐 " 𝑉(𝑋)
Example 7.4 – Describing the Population of
Monthly Profit
The monthly sales at a computer store have a
mean of $25,000 and a standard deviation of
$4,000. Profit is calculated by multiplying sales
by 30% and subtracting a fixed cost of $6,000.
Question: find the mean and standard
deviation of monthly profit.
We can describe the relationship between
profit and sales as: Promit = .30 (Sales) −
6,000
Solution: expected value
The expected or mean profit is:
0 1 2
y P(Y=y)
= g g (𝑥 − 𝜇. )v𝑦 − 𝜇8 w𝑃(𝑥, 𝑦)
/00 1 /00 8
Coefficient of correlation:
𝜎.8
𝜌=
𝜎. 𝜎8
x+
Events P(x + y)
y
0 P(0,0) .12
4 P(2,2) .01
1. 𝐸 (𝑋 + 𝑌) = 𝐸 (𝑋) + 𝐸(𝑌)
2. 𝑉 (𝑋 + 𝑌 ) = 𝑉 (𝑋) + 𝑉 (𝑌) + 2 𝐶𝑂𝑉(𝑋, 𝑌)
If X and Y are independent, 𝐶𝑂𝑉(𝑋, 𝑌) = 0,
thus:
𝑉 (𝑋 + 𝑌) = 𝑉 (𝑋) + 𝑉 (𝑌 )
Applying the formulas to Example 7.5, we
obtain the same values shown for the
distribution of the sum of two variables:
𝑝 . (1 − 𝑝);<.
The combinatorial formula (*see notes for an
explanation of the factorial) yields the count
of branch sequences that produces x
successes and n – x failures:
𝑛!
𝐶.; =
𝑥! (𝑛 − 𝑥)!
Where n! = n(n-1)(n-2)…(2)(1).
For example, 3! = 3 x 2 x 1 = 6.
Incidentally, although it may not appear to be
logical, 0! = 1.
Combining the two components yields the
binomial probability distribution:
𝑛!
𝑃 (𝑥 ) = 𝑝 . (1 − 𝑝);<. for 𝑥
𝑥! (𝑛 − 𝑥 )!
= 1, 2, … , n
Example 7.9 – Pat Statsdud and the
Statistics Quiz
Pat Statsdud is a (not good) student taking a
statistics course. Pat’s exam strategy is to rely
on luck for the next quiz.
The quiz consists of 10 multiple-choice
questions. Each question has five possible
answers, only one of which is correct. Pat
plans to guess the answer to each question.
a. What is the probability that Pat gets no
answers correct?
b. What is the probability that Pat gets two
answers correct
Solution:
This is a binomial experiment because:
1. n = 10
2. Two outcomes: correct and incorrect
answer.
3. Probability of correct answer: p = 1/5 = .2.
4. Answers to questions are independent.
We can apply the binomial probability
distribution to answer both questions:
!(!
a. x = 0: 𝑃(0) = . 2( . 8(!(<() =
(!(!(<()!
1(1)(.8)!( = .1074
!(!
b. x = 2: 𝑃(2) = . 2" . 8(!(<") =
"!(!(<")!
)∙!(
(.2)" (.8)A
"
𝑃(𝑋 ≤ 𝑥 ) = g 𝑃(𝑋 = 𝑥)
B&(
𝑃(𝑋 ≥ 5) = 1 − 𝑃 (𝑋 ≤ 4) = 1 − .9672
= .0328
*See notes for instructions to calculate
binomial probabilities with Excel.
Cumulative Binomial Probabilities with n =
10 and p = .2
X P(X ≤ x)
0 .1074
1 .3758
2 .6778
3 .8791
4 .9672
5 .9936
6 .9991
7 .9999
8 1.0000
𝜇 = 𝑛𝑝
𝜎 " = 𝑛𝑝(1 − 𝑝)
𝜎 = j𝑛𝑝(1 − 𝑝)
Example 7.11: suppose that a professor has a
class full of students like Pat.
a. What is the mean mark? 𝜇 = 𝑛𝑝 =
10(. 2) = 2
b. What is the standard deviation? 𝜎 =
j𝑛𝑝(1 − 𝑝) = j10(. 2). 8) = 1.26
7-5 Poisson Distribution
A binomial random variable is the number of
successes in a set number of trials, whereas a
Poisson random variable is the number of
successes in an interval of time or specific
region of space.
Poisson Experiment
1. The number of successes that occur in any
interval is independent of the number of
successes that occur in any other interval.
2. The probability of a success in an interval is
the same for all equal-size intervals.
3. The probability of a success in an interval is
proportional to the size of the interval.
4. The probability of more than one success in
an interval approaches 0 as the interval
becomes smaller.
7-5 Poisson Probability Distribution
The probability that a Poisson random variable
assumes a value of x in a specific interval is:
𝑒 <C 𝜇$
𝑃 (𝑥 ) = for 𝑥 = 0, 1, 2, …
𝑘!
Where μ is the mean number of successes in
the interval or region and e is the base of the
natural logarithm (approximately 2.71828).
Incidentally, the variance of a Poisson random
variable is equal to its mean; that is, 𝜎 " = 𝜇.
Examples - Probability of the Number of
Typographical Errors in Textbooks …
A statistics instructor has observed that the
number of typos in new editions of textbooks is
Poisson distributed with a mean of 1.5 per 100
pages. The instructor randomly selects 100
pages of a new book.
Example 7.12 – … in 100 pages
We want to determine the probability that a
Poisson random variable with a mean of 1.5 is
equal to 0.
Using the formula for Poisson probability
distribution, with x = 0, and μ = 1.5, we get:
𝑒 <C 𝜇$ 𝑒 <!.E 1.5(
𝑃 ( 0) = = = 𝑒 <!.E = .2231
𝑘! 0!
The probability that in the 100 pages selected
there are no errors is .2231.
Example 7.13 – ... in a textbook of 400 pages
Calculate the probability that for a book of 400
pages there are (a) no typos, and (b) no more
than five typos.
If there are 1.5 error per 100 pages, then there
must be 4 x 1.5 = 6 errors per 400 pages of
textbook, or μ = 6.
a. Thus, the probability that there are no typos
is:
𝑒 <C 𝜇$ 𝑒 <* 6(
𝑃 (0 ) = = = 𝑒 <* = .002479
𝑘! 0!
b. And the probability that there five or less
typos is:
𝑃 (𝑋 ≤ 5)
= 𝑃(0) + 𝑃(1) + 𝑃 (2) + 𝑃(3) + 𝑃(4)
+ 𝑃 ( 5)
= .002479 + .01487 + .04462 + .08924
+ .1339 + .1606 = .4457
7-5a Poisson Table
Table 2 in Appendix B of the textbook provides
cumulative Poisson probabilities for selected
values of μ.
We can use this table to answer the question in
Example 7.13, part b, where we need P(X ≤ 5).
The table to the right shows the values of P(X ≤
5) for μ = 6.
We can use the table and the complement rule
to determine probabilities of the type P(X = x)
and P(X ≥ x).
For example, to find the probability that there
are exactly ten typos:
X P(X ≤ x)
0 .0025
1 .0174
2 .0620
3 .1512
4 .2851
5 .4457
6 .6063
7 .7440
8 .8472
9 .9161
10 .9574
11 .9799
12 .9912
13 .9964
Chapter Summary
• There are two types of random variables:
• A discrete random variable whose
values are countable.
• A continuous random variable that can
assume an uncountable number of
values.
• In this chapter, we defined the expected
value, variance, and standard deviation of a
population described by a discrete random
variable and represented by a discrete
probability distribution.
• We also introduced bivariate discrete
distributions based on an important
application in finance.
• Finally, we presented the two most
important discrete distributions: the
binomial and the Poisson distribution.
Chapter 8: Continuous Probability Distributions
8-1 Continuous Random Variable
A continuous random variable can assume an
uncountable number of values, and differs from
a discrete variable in several respects:
• We cannot list the possible values of a
continuous random variable because there
is an infinite number of them.
• Because there is an infinite number of
values, the probability of each individual
value is virtually 0.
• Thus, we can only calculate the probability
of a range of values.
The following requirements apply to a
probability density function f(x) whose range
is a ≤ x ≤ b.
1. f(x) ≥ 0 for all x between a and b.
2. The total area under the curve between a
and b is 1.
8-1 Probability Density Functions
Estimated Probability between 25 and 45
Years from the Histogram for Example 3.1
P(x1 ≤ X ≤ x2)
1 1
𝑓 (𝑥 ) = = when 2,000 ≤ 𝑥
5,000 − 2,000 3,000
≤ 5,000
a. 𝑃(2,500 ≤ 𝑥 ≤ 3,000) = (3,000 −
!
2,500) = .1667
#(((
!
b. 𝑃(𝑥 ≥ 4,000) = (5,000 − 4,000) =
#(((
.3333
c. 𝑃(𝑥 = 2,500) = 0
8-1b Using a Continuous Distribution to
Approximate a Discrete Distribution
In our definition, we distinguish between
discrete and continuous random variables by
noting whether the number of possible values is
countable or uncountable.
In practice, we use a continuous distribution to
approximate a discrete one when the number of
values the variable can assume is countable but
large.
Example: the values of weekly income,
expressed in dollars.
It is a countable variable, but because the
outcomes are so many, it is preferrable to
employ a continuous probability distribution to
determine the probabilities associated with the
variable.
8-2 Normal Distribution
The normal distribution is the most important of
all probability distributions because of its crucial
role in statistical inference.
Normal Density Function
The probability density function of a normal
random variable is:
1 ! .<C !
< F H
𝑓 (𝑥 ) = 𝑒 " G
𝜎√2𝜋
where e = 2.71828... and π = 3.14159...
The normal distribution is described by the
mean μ and the standard deviation σ.
a. Notice how a normal distribution is
symmetric about its mean and the random
variable ranges between –∞ and + ∞.
b. Changing the value of μ shifts the
distribution left or right.
c. Increasing the value of σ makes the
distribution wider.
𝑃(𝑍 > 𝑍I ) = 𝐴
To determine the value of Z associated to a
given probability, we need to use the table
backward, as shown in the next example.
In Chapter 4, we introduced percentiles, which
are measures of relative standing. The values of
ZA are the 100(1 - A)th percentiles of a standard
normal random variable.
For example, Z0.05 = 1.645, which means that
1.645 is the 95th percentile.
Example 8.4 – Finding Z.05
Find the value of a standard normal random
variable such that the probability that the
random variable is greater than this quantity is
5% (Z.05).
Solution
The probability that z is less than Z.05 must be
1 – .05 = .9500. To find Z.05, we locate .9500 on
the table.
As you can se from the figure to the right, there
are two values of Z that are equally close: .9495
and .9505.
Those two probabilities correspond to the Z-
values of 1.64 and 1.65, respectively.
Thus, we can say that: Z.05 = 1.645.
Example 8.5 – Finding –Z .05
Find the value of a standard normal random
variable such that the probability that the
random variable is less than this quantity is 5%
(−Z −.05).
Solution
Because the standard normal curve is symmetric
about 0, and we know that Z.05 = 1.645, we can
say that:
−Z.05 = −1.645.
=NORM.DIST([X],[μ],[σ],True)
Alternatively, type
=NORM.S.DIST([Z],True)
=NORMDIST(1100,1000,100,True)
Or
=NORM.S.DIST(1. 00,True)
=NORM.S.INV([1 - A])
=NORMSINV(.95)
=NORM.INV([1 – A],[μ],[σ])
=NORM.INV(.99,490,61)
which yields 632.
𝑓 (𝑥) = 𝜆𝑒 <L. 𝑥 ≥ 0
where e = 2.71828… and λ is the parameter of
the distribution. The image to the right show
the exponential distribution for different values
of λ.
The mean of an exponential random variable is
equal to its standard deviation:
μ = σ = 1/λ
Probability Associated with an Exponential
Random Variable
If X is an exponential random variable:
𝑃 (𝑋 > 𝑥 ) = 𝑒 <L.
𝑃(𝑋 < 𝑥 ) = 1 − 𝑒 <L.
𝑃 (𝑥! < 𝑋 < 𝑥" ) = 𝑒 <L.! − 𝑒 <L."
=EXPON.DIST([X],[λ],True)
=EXPON.DIST(20,.05,True)
=T.DIST([x],[ν],[True])
=T.DIST(1.812,10,TRUE) = 0.9500
To determine tA type:
=T.INV([1-A],[ν])
=T.INV(0.95,10)=1.812
=CHI.DIST([x],[ν])
For example, =
CHI.DIST(6.25,3) = .100
=CHI.INV([A],[ν])
For example,
=CHI.INV(.100,3) = 6.25
8-4c F Distribution
The F distribution is also positively skewed (see
image) and ranges between 0 and ∞. The
shape of the F distribution depends on two
numbers of degrees of freedom.
F Density Function
𝜈! + 𝜈" N" N" <"
𝛤 š › 𝜈! " 𝐹 "
𝑓 (𝐹 ) = 2
𝜈! 𝜈" •𝜈" Ž N" 9N! 𝐹
𝛤š ›𝛤š › 𝜈 𝐹 "
2 2 š1 + ! ›
𝜈"
>0
Where ν1 and ν2 are the numerator and
denominator degrees of freedom, respectively,
and Г the Gamma function (not defined here.)
The mean and variance of an F random variable
are:
𝜈"
𝐸 (𝐹 ) = 𝜈" > 2 and 𝑉 (𝐹 )
𝜈" − 2
2𝜈"" (𝜈! + 𝜈" − 2)
= 𝜈 >4
𝑣! (𝜈" − 2)" (𝜈" − 4) "
Note how the mean of the F distribution
depends only on ν2 and that for large ν2 it
approaches 1.
8-4c Determining F Probabilities and Values
Probabilities: just as it was for the Student t and
Chi-squared random variables, calculate
probabilities for the F random variable manually
is not practical either because we would need a
different table for each pair of ν1 and ν2.
Alternatively, we can use Microsoft Excel (*see
notes.)
Values: we define FA,ν1,ν2 as the value of F with ν1
and ν2 degrees of freedom such that the area to
its right under the curve is A; that is, P(F > FA,ν1,ν2)
= A (see Table 6 in Appendix B of the textbook.)
Values of F1−A,ν1,ν2 are unavailable, but it can be
shown that F1−A,ν1,ν2 = 1 / FA,ν1,ν2.
Example: find the value of F.05,5,7 in Table 6.
Locate the numerator degrees of freedom, 5,
across the
top and the denominator degrees of freedom,
7, down the
left column. The intersection is 3.97. Thus, F.05,5,7
= 3.97.
Remember that the order of the degrees of
freedom is
important, F.05,5,7 ≠ F.05,7,5.
Suppose that we want to determine the left tail
of an F
distribution with ν1 = 4 and ν2 = 8 such that the
area to its
right is .95. Thus, F.95,4,8 = 1 / F.05,4,8 = 1 / 6.04 =
0.166.
Excel Instructions:
Chapter Summary
• This chapter dealt with continuous random
variables and their distributions.
• We showed how to compute the probability
of a range of values as the area in the
interval under the density function.
• We introduced the most important
distribution in statistics and showed how to
compute the probability that a normal
random variable falls into any interval.
• Additionally, we demonstrated the inverse
problem of finding values of a normal
random variable given a probability.
• Next, we introduced the exponential
distribution, a distribution that is particularly
useful in several management science
applications.
• Finally, we presented three more
continuous random variables and their
probability density functions. The Student t,
chi-squared, and F distributions will be used
extensively in statistical inference.