Session 11-15: Dr. Anup Kumar
Session 11-15: Dr. Anup Kumar
In most of these experiments, we were not only interested in the particular outcome that
occurs but rather in some number associated with that outcomes. For example
(i) In tossing two dice, we may be interested in the sum of the numbers on the two dice.
(ii) In tossing a coin 50 times, we may want the number of heads obtained.
(iii) In the experiment of taking out four articles (one after the other) at random from a lot
of 20 articles in which 6 are defective, we want to know the number of defectives in
the sample of four.
4
Random Variables:
In all of the above experiments
We have a rule which assigns to each outcome of the experiment a single
real number. This single real number may vary with different outcomes of
the experiment. Hence, it is a variable.
Also its value depends upon the outcome of a random experiment. and,
hence, is called random variable
5
6
Example:1 Consider the experiment of tossing a coin two times in succession.
The sample space of the experiment is S = {HH, HT, TH, TT}.
If X denotes the number of heads obtained, then X is a random variable and for each
outcome, its value is as given below :
X(HH) = 2, X (HT) = 1, X (TH) = 1, X (TT) = 0.
Again if Y denote the number of tails for each outcome of the above sample space S.
Consider the experiment of tossing a fair coin three times independently. Define the random
variable X to be the number of heads obtained in the three tosses. A complete enumeration
of the value of X for each point in the sample space is
9
Question:1 An urn contains 5 red and 2 black balls. Two balls are
randomly drawn without replacement. Let X represent the number of
black balls. What are the possible values of X? Is X a random
variable ?
10
Continuous random variables take on values at every point over a
given interval.
11
Requirements for the
Probability Distribution of a Discrete Random Variable x
12
Discrete Probability Distribution Example
Probability Distribution
Values, x Probabilities, p(x)
0 1/4 = .25
1 2/4 = .50
2 1/4 = .25
13
Visualizing Discrete Probability Distributions
Listing Table
.25
.00 x
0 1 2
14
Summary Measures
1. Expected Value (Mean of probability distribution)
• Weighted average of all possible values
• = x p(x)
2. Variance
• Weighted average of squared deviation about
mean
• 2 = (x p(x)
3. Standard Deviation
● 2 15
Expected Value & Variance Solution*
= .71
16
Question:2 Two cards are drawn successively with replacement from a well-shuffled deck of 52
cards. Find the probability distribution of the number of aces.
17
18
Question: 3 The table shown below is a discrete distribution that contains the number of crises that
could occur during the day in a department and the probability that each number will occur. Find mean
and variance.
19
20
Question:4
Let X denote the number of hours you study during a randomly selected school day. The
probability that X can take the values x, has the following form, where k is some
unknown constant.
21
22
The Binomial Distribution
23
Binomial Distribution
n x n x n!
p ( x) p q p x (1 p ) n x
x x ! ( n x )!
25
Binomial Distribution
n!
Probability
p q
X n X
P( X ) for 0 X n
function X ! n X !
Mean value
n p
Variance and
standard
deviation
2
n pq
2
n pq
26
Binomial Probability Distribution
Question-1 Toss 1 coin 5 times in a row. Note number of tails. What’s
the probability of 3 tails? Find mean and standard deviation.
n!
p( x) p x (1 p ) n x
x !( n x )!
5!
p (3) .53 (1 .5) 5 3
3!(5 3)!
.3125
=np=5*.5=2.5
2 5 * .5 * .5 .125 27
Binomial Probability Table (Portion)
28
Binomial Distribution Characteristics
P(X)
1.0
n = 5 p = 0.1
Mean .5
np .0 X
0 1 2 3 4 5
Standard Deviation
npq .6
P(X) n = 5 p = 0.5
.4
.2
.0 X
0 1 2 3 4 5
29
Graphs of Selected Binomial Distributions
n = 4 PROBABILITY P = 0.5
1.000
X 0.1 0.5 0.9 0.900
0.800
0 0.656 0.063 0.000 0.700
0.600
1 0.292 0.250 0.004 0.500
P(X)
0.400
P = 0.1 P = 0.9
1.000 1.000
0.900 0.900
0.800 0.800
0.700 0.700
0.600 0.600
0.500 0.500
P(X)
P(X)
0.400 0.400
0.300 0.300
0.200 0.200
0.100 0.100
0.000 0.000
0 1 2 3 4 0 1 2 3 4
X X
30
Question:2 You’re a telemarketer selling service contracts for
Macy’s. You’ve sold 20 in your last 100 calls (p = .20). If you call
12 people tonight, what’s the probability of
A. No sales?
B. Exactly 2 sales?
C. At most 2 sales?
D. At least 2 sales?
31
Example:2
n = 12, p = .20
A. p(0) = .0687
B. p(2) = .2835
C. p(at most 2) = p(0) + p(1) + p(2) = .0687 + .2062 + .2835
= .5584
D. p(at least 2) = p(2) + p(3)...+ p(12)= 1 – [p(0) + p(1)]
= 1 – .0687 – .2062 = .7251
32
Question-3
A Gallup survey found that 65% of all financial consumers were very
satisfied with their primary financial institution. Suppose that 25
financial consumers are sampled and if the Gallup survey result still
holds true today, what is the probability that exactly 19 are very satisfied
with their primary financial institution?
Solution:
33
Question-:4
Solution:
34
Question:5
Purchasing magazine reported the result of a survey in which
buyers were asked a series of questions with regard to
Internet usages. One question asked was how they would use
the Internet if security and other issue could be resolved.
78% said they would use it for pricing information, 75% said
they would use it to send purchase orders, and 70% said they
would use it for purchase order acknowledgements.
Assume that these percentages hold true for all buyers. A
researcher randomly samples 20 buyers and asks them how
they would use the Internet if security and other issues could
be resolved.
35
Questions:5
• What is the probability that exactly 14 of these buyers would
use the Internet for pricing information?
36
a) n = 20 p = .78 x = 14
Solution:
20
C14 (.78)14(.22)6 = 38,760(.030855)(.00011338)
= .1356
b) n = 20 p = .75 x = 20
20
C20 (.75)20(.25)0 = (1)(.0031712)(1) = .0032
c) n = 20 p = .70 x < 12
Use table :
P(x=0) + P(x=1) + . . . + P(x=11) =
38
Examples of Poisson-type situations include the following:
39
Poisson Distribution: Other Applications
P( X )
X
e for X 0,1, 2,3,...
X!
where :
long run average
e 2.718282... (the base of natural logarithms)
41
Question:1
Bank customers arrive randomly on weekday afternoons at an average of 3.2
customers every 4 minutes. What is the probability of having
(a) 8 customers in a 4-minute interval on a weekday afternoon?
(b) 9 customers in a 4-minute interval on a weekday afternoon?
(c) 10 customers in a 4-minute interval on a weekday afternoon?
42
Question:2
A bank has an average random arrival rate of 3.2 customers every 4 minutes.
What is the probability of getting exactly 10 customers during an 8-minute
interval?
43
Approximating Binomial Problems by
the Poisson Distribution
Binomial problems with large sample sizes and small values of p, which
then generate rare events, are potential candidates for use of the Poisson
distribution.
44
Question-3
Q. Suppose the probability of a bank making a mistake in processing a
deposit is .0003. If 10,000 deposits (n) are audited, what is the probability
that more than 6 mistakes were made in processing deposits?
45
Question:4
Poisson Distribution: Using the Poisson Tables
𝑃 ( 𝑋 >5 )=0
47
Question:5
On Monday mornings, the First National Bank only has one teller window open
for deposits and withdrawals. Experience has shown that the average number of
arriving customers in a four-minute interval on Monday mornings is 2.8, and each
teller can serve more than that number efficiently. These random arrivals at this
bank on Monday mornings are Poisson distributed.
What is the probability that on a Monday morning exactly six customers will
arrive in a four-minute interval?
What is the probability that no one will arrive at the bank to make a deposit or
withdrawal during a four-minute interval?
48
Question:6
51
Doubt Session
1.Discussion on Project
2.Demo in Excel
3.Discussion on Probability Distribution (2-
Questions)
4.Discussion on Binomial Distribution (2-
Questions)
5.Discussion on Poisson Distribution (2-
Questions)
52
Probability Distribution
Question1. Find the probability distribution table and probability
distribution graphs for the number of heads if a coin tossed 5 times.
Answer In this question, X is the random variable X that
: represents the number of heads.
0.15
0.1
0.05
0
0 1 2 3 4 5
Random Variables (X)
54
X P(X) X*P(X) (X-Mean)^2 ((X-Mean)^2)*P(X)
0.00 0.03 0.00 6.23 0.19
1.00 0.16 0.16 2.24 0.35
2.00 0.31 0.62 0.25 0.08
3.00 0.31 0.94 0.25 0.08
4.00 0.16 0.62 2.26 0.35
5.00 0.03 0.16 6.27 0.19
Mean 2.50
Variance 1.25
56
Solution: 2. The expected payoff for a
$1 ticket in this game is
60.2 cents. If a person plays
the game for a long time, he
or she could expect to
average about 60 cents in
winnings.
58
Binomial Distribution
Question:1 According to Information Resources, which publishes data on market share
for various products, Oreos control about 10% of the market for cookie brands. Suppose 20
purchasers of cookies are selected randomly from the population. What is the probability
that fewer than four purchasers choose Oreos?
Solution:
For this problem, n = 20, p = .10, and P(x< 4)
1
Determining the probability of getting x < 4 involves summing the probabilities for x = 0,
1, 2, and 3.
If 10% of all cookie
purchasers prefer Oreos
and 20 cookie purchasers
are randomly
selected, about 86.7% of
the time fewer than four of
the 20 will select Oreos.
59
Question:2 In the past few years, outsourcing overseas has become more frequently used than ever
before by U.S. companies. However, outsourcing is not without problems. A recent survey by
Purchasing indicates that 20% of the companies that outsource overseas use a consultant. Suppose 5
companies that outsource overseas are randomly selected.
a. What is the probability that exactly five companies that outsource overseas use a consultant?
b. What is the probability that more than 4 companies that outsource overseas use a consultant?
c. What is the probability that none of the companies that outsource overseas use a consultant?
d. What is the probability that between 3 and 5 (inclusive) companies that outsource overseas use a
consultant?
e. Construct a graph for this binomial distribution. In light of the graph and the expected value,
explain why the probability results from parts (a) through (d) were obtained.
60
Solution:2
61
Poisson Distribution
Question:1 A medical researcher estimates that .00004 of the population has a rare blood
disorder. If the researcher randomly selects 100,000 people from the population,
What is the probability that three people will have the rare blood disorder?
What is the probability that more than 2 people will have the rare blood disorder?
The family did not make a trip to an amusement park last year.
The family took exactly one trip to an amusement park last year.
The family took two or more trips to amusement parks last year.
The family took three or fewer trips to amusement parks over a three-year period.
The family took exactly four trips to amusement parks during a six-year period.
63
Solution
:
λ=0.6 trip/year, Let X represents number of trips each year.
− 0.6
(a) P(X=0)= 𝑒 ( 0.6 ) ^ 0
=0.5488
− 0.6
0! ^ 1
(b) P(X=1)= 𝑒 ( 0.6 )
=0.3293
1!
(c) P(X=>2)=1-P(X<2)=1-[P(0)+P(1)]=0.1219
If the data follow Normal distribution then find the area left to the point
x=100 while the mean is 150 and standard deviation is 100
Answer
=Norm.Dist(100,150,100,1)
=0.3085
Question:2
If the data follow Normal distribution then find the area left to
the point x=100 while the mean is 150 and standard deviation is
100. Use the concept of standard normal distribution.
Answer
𝒙 −𝝁 𝟏𝟎𝟎 − 𝟏𝟓𝟎 =Norm.S.Dist(-0.5,1)
𝒛= = =− 𝟎 .𝟓
𝝈 𝟏𝟎𝟎
=0.3085
Question:2
In the given Normal distribution curve, Find the area right to the point
x=100 while the mean is 150 and standard deviation is 10
Answer:
P(X>100)=1-P(X=<100)
=1-Norm.Dist(100,150,10,1)
=0.9999997
Question:3
In the given Normal distribution curve, Find the area between x=100
and x=200 while the mean is 150 and standard deviation is 10
Answer: P(100<X<200)=P(X<200)-P(X<100)
=Norm.Dist(200,150,10,1)-Norm.Dist(100,150,10,1)
=0.9999994
Question:4
In the given Normal distribution curve, Find the area right to the point
x=100 while the mean is 150 and standard deviation is 10. Use the
concept of standard normal distribution.
𝑍=
𝜎 (
𝑥 − 𝜇 100 −150
=
10 )=−5
Answer: Norm.S.Dist(-5,1)=0.000000287
Question:5
In the given Normal distribution curve, Find the area between x=100 and
x=200 while the mean is 150 and standard deviation is 10. Use concept
of Standard Normal Distribution
Answer: Norm.D.Dist(5,1)-Norm.S.Dist(-5,1)=0.999999427
Question-6
1. If in the given standard normal distribution graph, coloured area is
.30, then find the value of z.
Answer:2 NORM.INV(0.30,100,10)
Question-7
If in the given normal distribution graph, coloured area is .60, then
find the value of z.
If in the given normal distribution graph with mean 100 and standard
deviation is 10, coloured area is .60, then find the value of x.
𝜇=494 , 𝜎=100
¿ 𝑃 ( 𝑋 >700 ) =1−𝑃 ( 𝑋 ≤700 )
¿
¿
STANDARD NORMAL DISTRIBUTION APPROACH
(Question-1)
𝑥 − 𝜇 700 − 494
𝑧= = =2.06
𝜎 100 𝑃 ( 𝑋 >700 )=𝑃 ( 𝑍 >2.06 )=1− 𝑃 ( 𝑍 ≤ 2.06 )
¿
¿
=1 − 𝑁𝑜𝑟𝑚 . 𝑆 . 𝐷𝑖𝑠𝑡 ( 2.06,1 )
=0.0197
Assignment Questions:2
NORMAL DISTRIBUTION APPROACH
(Question-2)
¿
What if the data (Population) is not normally distributed ?
Central Limit Theorem: The Central Limit Theorem states that
the distribution of the sample mean approaches to normal
distribution provided the sample size is more than 30.
(
𝑋 ∼ 𝑁 𝜇,
𝜎
√𝑛 )
As we increase the sample size, the graph of sample
mean converges to Normal Distribution
𝑋 ∼ 𝑁 𝜇,
(𝜎
√𝑛 )
Question:1 A population has a mean of 50 and a standard deviation of 10. If a random
sample of 64 is taken, what is the probability that the sample mean is
a. Greater than 52
b. Less than 51
c. Less than 47
P(=1-P(
P(
𝑿 −𝝁
𝝈 𝒁∼
𝝈
√𝒏
𝑋 𝑍
Question:1 A population has a mean of 50 and a standard deviation of 10. If a random sample of 64
is taken, what is the probability that the sample mean is Greater than 52. Use Standard Normal
Concept.
𝑥 −𝜇 52 −50 2
𝑧= = = =1.6
𝜎 10 10
√𝑛 √ 64 8
¿ 𝑃 ( 𝑥 >52 )= 𝑃 ( 𝑧 >1.6 ) =1− 𝑃 ( 𝑧 ≤ 1.6 )
=1 − 𝑁𝑜𝑟𝑚 . 𝑆 . 𝐷𝑖𝑠𝑡 ( 1.6,1 )
=1 −0.9452 =0.0548
Question:1 A population has a mean of 50 and a standard deviation of 10. If a
random sample of 64 is taken, what is the probability that the sample mean is Less
than 51. Use Standard Normal Concept.
𝒙 −𝝁 𝟓𝟏 −𝟓𝟎
𝒛= = =𝟎 . 𝟖𝟎
𝝈 𝟏𝟎
√𝒏 𝟖
𝑷 ( 𝒙 <𝟓𝟏 ) =𝑷 ( 𝒛 <𝟎 . 𝟖 ) =𝑵𝒐𝒓𝒎. 𝑺. 𝑫𝒊𝒔𝒕 (𝟎 . 𝟖 , 𝟏)=𝟎 .𝟓𝟑𝟗𝟖
Question:1A population has a mean of 50 and a standard deviation of 10. If a
random sample of 64 is taken, what is the probability that the sample mean is Less
than 47. Use Standard Normal Concept.
𝑥 − 𝜇 47 − 50
𝑧= = =− 2.40
𝜎 10
√𝑛 8
¿ 𝑃 ( 𝑥 <47 ) =𝑃 ( 𝑧 <−2.40 )=𝑁𝑜𝑟𝑚 . 𝑆 . 𝐷𝑖𝑠𝑡 (−2.40,1)=0.0082
Question: 1 Population is normally distributed, with a mean of 23.45 and a standard deviation of 3.8.
What is the probability of each of the following?
𝒙=𝟐𝟐
𝜎 3.8
𝜇=23.45 , 𝜎 =3.8 , = =1.20
√ 𝑛 √10
𝑃 ( 𝑥 >22 ) =1− 𝑃 ( 𝑥 ≤ 22 ) =1− 𝑁𝑜𝑟𝑚 . 𝐷𝑖𝑠𝑡 ( 22,23.45,1 .20,1 )
¿ 1 −.11346
¿ .88654
Question: 1 Population is normally distributed, with a mean of 23.45 and a standard deviation of 3.8. What is the
probability of each of the following?
𝒙=𝟐𝟔
𝜎 3.8
𝜇=23.45 , 𝜎=3.8 , = =1.9
√𝑛 √4
𝑃 ( 𝑥 >26 )=1 − 𝑃 ( 𝑥 ≤ 26 ) =1− 𝑁𝑜𝑟𝑚 . 𝐷𝑖𝑠𝑡 ( 26,23.45,1 .9,1 )
¿ 1− .9832
¿ .01679
Assignment Questions:2
Question: 2 Tompkins Associates reports that the mean clear height for
a Class A warehouse in the United States is 22 feet. Suppose clear heights
are normally distributed and that the standard deviation is 4 feet. A Class
A warehouse in the United States is randomly selected.
a. What is the probability that the clear height is greater than 17 feet?
b. What is the probability that the clear height is less than 13 feet?
c. What is the probability that the clear height is between 25 and 31 feet?
Question: 2a Tompkins Associates reports that the mean clear height for a Class A
warehouse in the United States is 22 feet. Suppose clear heights are normally distributed and
that the standard deviation is 4 feet. A Class A warehouse in the United States is randomly
selected. What is the probability that the clear height is greater than 17 feet?
𝒙=𝟏𝟕
𝜇=22 , 𝜎=4
𝑃 ( 𝑥 >17 )=1 − 𝑃 ( 𝑋 <17 )
¿
Question: 2 Tompkins Associates reports that the mean clear height for a Class A
warehouse in the United States is 22 feet. Suppose clear heights are normally
distributed and that the standard deviation is 4 feet. A Class A warehouse in the United
States is randomly selected. What is the probability that the clear height is less than 13
feet?
𝒙=𝟏𝟑
𝜇=22 , 𝜎 = 4
𝑃 ( 𝑥 <13 ) = 𝑃 (12,22,4,1 )
¿
Question: 2 Tompkins Associates reports that the mean clear height for a Class A
warehouse in the United States is 22 feet. Suppose clear heights are normally
distributed and that the standard deviation is 4 feet. A Class A warehouse in the United
States is randomly selected. What is the probability that the clear height is between 25
and 31 feet?
X=25 x=31
Assignment Questions:3
Question: 3 According to a report by Scarborough Research, the
average monthly household cellular phone bill is $60. Suppose local
monthly household cell phone bills are normally distributed with a
standard deviation of $11.35.
X=85
μ =60 , σ =11.35
¿
Question: 3b According to a report by Scarborough Research, the average monthly household
cellular phone bill is $60. Suppose local monthly household cell phone bills are normally
distributed with a standard deviation of $11.35. What is the probability that a randomly selected
monthly cell phone bill is between $45 and $70?
X=45 x=70
μ=60 , σ =11.35
P ( 45 < x <70 ) =P ( x< 70 ) − P ( x <45 )
¿
Statistical Inference
9-105
Business Statistics: Contemporary Decision Making, 3e, by
9-106
Black. © 2001 South-Western/Thomson Learning
What is Hypothesis Testing