Statistical Inference INF312 - Is - Lecture 02
Statistical Inference INF312 - Is - Lecture 02
-LECTURE 02
Dr. Mahmoud Mounir
[email protected]
The Normal Distribution
Note: This PowerPoint is only a summary and your main source should
be the book.
Introduction
❑ Normal Distribution.
Note: This PowerPoint is only a summary and your main source should be the book.
The Normal Distribution
Note: This PowerPoint is only a summary and your main source should be the book.
(b) Negatively skewed (c) Positively skewed
(a) Normal
Mean =Median=Mode
Note: This PowerPoint is only a summary and your main source should be the book.
❑A normal distribution is
a continuous ,
symmetric ,
bell shaped distribution of a variable.
µ Position parameter
σ shape parameter
Note: This PowerPoint is only a summary and your main source should be the book.
The mathematical equation for the normal distribution:
2
−( x− ) 2
2
e
y=
2
where
e ≈ 2 .718
π ≈ 3.14
µ ≈ population mean
σ ≈ population standard deviation
Note: This PowerPoint is only a summary and your main source should be the book.
1 (1) Different
means but same
standard
deviations.
Normal curves
with μ1 = μ2 and
2 σ1<σ2
Note: This PowerPoint is only a summary and your main source should be the book.
(3) Different
3 means and
different standard
deviations .
Note: This PowerPoint is only a summary and your main source should be the book.
Properties of the Normal Distribution
Note: This PowerPoint is only a summary and your main source should be the book.
Empirical Rule: Normal Distribution
Note: This PowerPoint is only a summary and your main source should be the book.
The Standard Normal
Distribution
Note: This PowerPoint is only a summary and your main source should be the book.
❑ The standard normal distribution is a normal
distribution with a mean of 0 and a standard deviation
of 1.
=0
=1
Note: This PowerPoint is only a summary and your main source should be the book.
❑ All Normal Distribution can be transformed into standard
Distribution.
or
Note: This PowerPoint is only a summary and your main source should be the book.
Empirical Rule: Standard Normal Distribution
=0
=1
Note: This PowerPoint is only a summary and your main source should be the book.
❑ The area under the standard normal distribution curve
that lies within
Note: This PowerPoint is only a summary and your main source should be the book.
2.To the right of any Z value
Note: This PowerPoint is only a summary and your main source should be the book.
3.Between any two Z values
= Q(a) – Q(b)
Note: This PowerPoint is only a summary and your main source should be the book.
Examples
a. Area to the left of z = 1.36: P( z< 1.36)
= 0.9131
Examples
b. Area to the left of z = -0.60: P(z < −0.60)
= 0.2743
Examples
c. Area to the right of z = 1.47: P(z > 1.47)
P(Z<1.99)=0.9767
Note: This PowerPoint is only a summary and your main source should be the book.
Column 1.9 9 row
Note: This PowerPoint is only a summary and your main source should be the book.
Example (2):
Find the area to the right of z=-1.16
P(Z>-1.16)=1-P(Z<-1.16)
=1-0.1230
= 0.8770
Note: This PowerPoint is only a summary and your main source should be the book.
Example (3):
Find the area between z=1.68 and z=-1.37
P(-1.37<Z<1.68)=P(Z<1.68)-P(Z<-1.37)
= 0.9535-0.0853
= 0.8682
Note: This PowerPoint is only a summary and your main source should be the book.
A Normal Distribution Curve as a Probability
Distribution Curve:
Note: This PowerPoint is only a summary and your main source should be the book.
Example (4):
Find probability for each
a) P(0<z<2.23)
b) P(z<1.65)
c) P(z>1.91)
a) P(0<Z<2.23)=P(Z<2.23)-P(Z<0)
=0.9898-0.5000
=0.4898
Note: This PowerPoint is only a summary and your main source should be the book.
b) P(Z<1.65)=0.9505
c) P(Z>1.91)=1-P(Z<1.91)
=1-0.9719
=0.0281
Note: This PowerPoint is only a summary and your main source should be the book.
Find the Z value that corresponds to given area
1 2
Z Z
we cannot find the area in the table Look up the area in table E to
find the z value
3 4
Z Z
we cannot find the area in the table
Look up the area in table E to
find the z value
Note: This PowerPoint is only a summary and your main source should be the book.
For exmples 5 to 12:
5 6
0.0188 0.9671
Z Z
z = -2.08 z = 1.84Z
7 8
0.0239 0.8962
Z
Z
1- 0.0239 = 0.9761 1- 0.8962 = 0.1038
z = 1.98 z = -1.26
Note: This PowerPoint is only a summary and your main source should be the book.
5 10 a
a
Z Z
0.5000 + a = ---- 0.5000 – a = ----
z = look in the table E z = look in the table E
11 12
0.4175
0.4066
Z Z
Note: This PowerPoint is only a summary and your main source should be the book.
Example (13):
Find the z value such that the area under the standard
normal distribution curve between 0 and the z value
is 0.2123
0.2123+0.5000=0.7123
Z=0.56
Note: This PowerPoint is only a summary and your main source should be the book.
Note: This PowerPoint is only a summary and your main source should be the book.
1) Find the z value to the left of the mean so that
98.87% of the area under the distribution curve lies to
the right of it.
Note: This PowerPoint is only a summary and your main source should be the book.
Example (1):
A survey by the National Retail Federation found that
women spend on average $146.21 for the Christmas
holidays. Assume the standard deviation is $29.44.
Find the percentage of women who spend less than
$160. Assume the variable is normally distributed.
Note: This PowerPoint is only a summary and your main source should be the book.
Solution: Step 3: Find the area using Z-table
FIND P (X < 160)
Step 1: Find the z value .
X − 160 − 146.21
Z = =
29.44
= 0.47 P(Z < 0.47) = 0.6808
Step 2: Draw the figure
68.08% of the women spend
less than 160$ at Christmas
time.
Note: This PowerPoint is only a summary and your main source should be the book.
Example (2):
Each month, an American household generates an
average of 28 pounds of newspaper for garbage or
recycling. Assume the standard deviation is 2 pounds.
If a household is selected at random, Find the
probability of its generating.
a) Between 27 and 31 pounds per month
b) More than 30.2 pounds per month
Assume the variable is approximately normally
distributed.
Note: This PowerPoint is only a summary and your main source should be the book.
Solution (a) :
FIND P (27 < X < 31)
Step 1: Find the two z values
X − 27 − 28
Z1 = = = −0.5
2
X − 31 − 28
Z2 = = = 1.5
2
Step 2: Draw the figure
Note: This PowerPoint is only a summary and your main source should be the book.
Step 3: Find the area using Z-table
P(-0.5 < Z < 1.5) = P(Z < 1.5) - P(Z < -0.5)
= 0.9332 - 0.3085 = 0.6247
Note: This PowerPoint is only a summary and your main source should be the book.
Solution (b) :
FIND P (X > 30.2)
Step 1: Find the z value
X − 30.2 − 28
Z = = = 1.1
2
0 1.1
Note: This PowerPoint is only a summary and your main source should be the book.
Step 3: Find the area using Z-table
Note: This PowerPoint is only a summary and your main source should be the book.
Example (3):
The American Automobile Association reports that
the average time it takes to respond to an emergency
call is 25 minutes. Assume the variable is
approximately normally distributed and the
standard deviation is 4.5 minutes. If 80 calls are
randomly selected, approximately how many will be
responded to in less than 15 minutes?
Note: This PowerPoint is only a summary and your main source should be the book.
Solution:
FIND P (X < 15)
Step 1: Find the z value
X − 15 − 25
Z = = = −2.22
4.5
-2.22 0
Note: This PowerPoint is only a summary and your main source should be the book.
Step 3: Find the area using Z-table
0.0132 × 80 = 1.056 ≈ 1
Note: This PowerPoint is only a summary and your main source should be the book.
Finding Data Values Given Specific Probabilities
Note: This PowerPoint is only a summary and your main source should be the book.
Example (4):
To qualify for a police academy, candidates must
score in the top 10% on a general abilities test. The
test has a mean of 200 and standard deviation of 20.
Find the lowest possible score to qualify. Assume the
test scores is normally distributed.
Note: This PowerPoint is only a summary and your main source should be the book.
Solution:
Step 1: Draw the figure
Note: This PowerPoint is only a summary and your main source should be the book.
Solution:
Step 1: Draw the figure
Note: This PowerPoint is only a summary and your main source should be the book.
Step 2 : Find the two z values . X = z +
P (Z > Z1) = 0.2 P (Z < Z2) = 0.2
P (Z < Z1) = 1-0.2 = 0.8
Z1 = 0.84 Z 2 = −0.84
Note: This PowerPoint is only a summary and your main source should be the book.
Example (6):
Given a normal distribution with a mean of 25, what
is the standard deviation if 18% of the values are
above 29?
Solution:
P (X > 29) = 0.18
P (X < 29) = 1- 0.18 = 0.82
𝑥−μ
P (Z < ) = 0.82
σ
29 −25
P (Z < ) = 0.82
σ
4
P (Z < ) = 0.82
σ
Note: This PowerPoint is only a summary and your main source should be the book.
From the Z- Table
4
= 0.92
σ
0.92 σ = 4
𝝈 = 4.35
Note: This PowerPoint is only a summary and your main source should be the book.
Example (7):
Given a normal distribution with a standard deviation
of 10, what is the mean if 21% of the values are below
50?
Solution:
Note: This PowerPoint is only a summary and your main source should be the book.
From the Z- Table
50 −μ
= -0.81
10
50 - μ = -8.1
𝝁= 58.1
Note: This PowerPoint is only a summary and your main source should be the book.
Example (8):
Given a normal distribution with 80% of the values
are above 125 and 90% of the values are above 110,
what are the mean and standard deviation of this
distribution?
Solution:
P (X > 125) = 0.8
P (X < 125) = 1- 0.8 = 0.2
𝑥−μ
P (Z < ) = 0.2
σ
125 −μ
P (Z < ) = 0.2
σ
Note: This PowerPoint is only a summary and your main source should be the book.
From the Z- Table
125 −μ
= -0.84 → (1)
σ
Note: This PowerPoint is only a summary and your main source should be the book.
P (X > 110) = 0.9
P (X < 110) = 1- 0.9 = 0.1
𝑥−μ
P (Z < ) = 0.1
σ
110 −μ
P (Z < ) = 0.1
σ
Note: This PowerPoint is only a summary and your main source should be the book.
From the Z- Table
110 −μ
= -1.28 → (2)
σ
Note: This PowerPoint is only a summary and your main source should be the book.
From Equations (1) and (2)
125 −μ
= -0.84
σ
𝟏𝟐𝟓 − 𝝁 = -0.84𝝈 → (1)
110 −μ
= -1.28
σ
𝟏𝟏𝟎 − 𝝁 = -1.28𝝈 → (2)
Subtract (2) from (1)
-15 = -0. 𝟒𝟒𝟐𝝈
𝝈 = 33.9
Substitute in (2)
𝟏𝟏𝟎 − 𝝁 = -1.28(33.9)
𝝁 = 153.5
Note: This PowerPoint is only a summary and your main source should be the book.
1-If X is normally distributed random variable with µ = 5 ,
σ = 4 , find the P(x> -1.4) ??
Discrete Probability
Distributions
Note: This PowerPoint is only a summary and your main source should be the book.
The Binomial Distribution
Mean, Variance and Standard deviation for
The Binomial Distribution
❑ Many types of probability problems have only
two possible outcomes or they can be reduced to
two outcomes.
Note: This PowerPoint is only a summary and your main source should be the book.
Notation for the Binomial Distribution
P(S) :The symbol for the probability of success
P(F) :The symbol for the probability of failure
p :The numerical probability of success
q :The numerical probability of failure
P(S) = p and P(F) = 1 – p = q
n :The number of trials
X :The number of successes
Note that X = 0, 1, 2, 3,...,n
Note: This PowerPoint is only a summary and your main source should be the book.
In a binomial experiment, the probability of exactly
X successes in n trials is
or
P( X ) = n Cx p q
X n− X
n!
P( X ) = p q
X n− X
( n - X )! X !
Note: This PowerPoint is only a summary and your main source should be the book.
A coin is tossed 3 times. Find the probability of getting
exactly 2 heads.
n=3
x= 2
Solution :
p=
Note: This PowerPoint is only a summary and your main source should be the book.
Example (2): Survey on Doctor Visits
A survey found that one out of five Americans say he or she
has visited a doctor in any given month. If 10 people are
selected at random, find the probability that exactly 3 will
have visited a doctor last month.
n = 10
Solution :
x= 3
n!
P( X ) = p X q n− X
( n - X )! X ! p=
3 7
10! 1 4
P ( 3) = = 0.201
7!3! 5 5
Note: This PowerPoint is only a summary and your main source should be the book.
Example (3): Survey on Employment
A survey from Teenage Research Unlimited (Northbrook, Illinois) found
that 30% of teenage consumers receive their spending money from part-
time jobs. If 5 teenagers are selected at random, find the probability that
at least 3 of them will have part-time jobs. n=5
Solution :
5! x= 3,4,5
P ( 3) = ( 0.30 ) ( 0.70 ) = 0.132
3 2
2!3!
5!
P ( 4) = ( 0.30 ) ( 0.70 ) = 0.028
4 1
1!4! p=0.30
5!
P ( 5) = ( 0.30 ) ( 0.70 ) = 0.002
5 0
0!5!
q=1-0.30 =0.70
P ( X 3 ) = 0.132
+0.028
+0.002
= 0.162
Note: This PowerPoint is only a summary and your main source should be the book.
Example (4): Survey on Employment
There are ten questions on a multiple – choice quiz each question with
five choices in each. Let X represents the number of questions a student
answers correctly.
1. What are the possible values of X?
2. Find the probability distribution.
3. What is the probability that a student will get 6 out of 10?
4. What is the probability that a student will pass the quiz?
5. Find the expected number and the standard deviation for the correct
answer.
n!
P( X ) = p X q n− X
( n - X )! X !
Note: This PowerPoint is only a summary and your main source should be the book.
Example (4): Survey on Employment
There are ten questions on a multiple – choice quiz each question with
five choices in each. Let X represents the number of questions a student
answers correctly.
1. What are the possible values of X?
2. Find the probability distribution.
3. What is the probability that a student will get 6 out of 10?
4. What is the probability that a student will pass the quiz?
5. Find the expected number and the standard deviation for the correct
answer. n = 10
x=0,1,2,3,4,5,6,7,
8,9,10
n!
P( X ) = p X q n− X
( n - X )! X ! p=0.1
q=1-0.1 =0.90
Note: This PowerPoint is only a summary and your main source should be the book.
Mean, Variance and Standard deviation
for the binomial
The mean , variance and SD of a variable that the
binomial distribution can be found by using the
following formulas:
Mean: = np
Variance: = npq 2
p=
Note: This PowerPoint is only a summary and your main source should be the book.
Example (5): Rolling a die
A die is rolled 360 times , find the mean , variance and
slandered deviation of the number of 4s that will be rolled .
n = 360
Solution :
p=
Note: This PowerPoint is only a summary and your main source should be the book.
Example (6):
Note: This PowerPoint is only a summary and your main source should be the book.
Solution (6.a) :
n!
P( X ) = p X q n− X n = 15
( n - X )! X !
x= 2
15!
𝑃 𝑥=2 = (0.2)2 (0.8)(15−2)
15 − 2 ! 2! p=0.2
15! q=1- 0.2 = 0.8
𝑃 𝑥=2 = 0.2 2 (0.8)13 = 0.2309
13! (2!)
= 23.09 %
Note: This PowerPoint is only a summary and your main source should be the book.
Solution (6.b) :
n!
P( X ) = p X q n− X n = 15
( n - X )! X !
x= 1,2, …, 15
p=0.2
15!
𝑃 𝑥=0 = (0.2)0 (0.8)(15−0) q=1- 0.2 = 0.8
15 − 0 ! 0!
15!
𝑃 𝑥=0 = 0.2 0 (0.8)15 = 0.0352
15! (0!)
𝑃 𝑋 ≥ 1 = 1 − 𝑃 𝑋 = 0 = 1 − 0.0352 = 0.9648 = 96.48%
Note: This PowerPoint is only a summary and your main source should be the book.
A coin is tossed 72 times. The standard deviation for the number of
heads that will be tossed is
A) 18
B) 4.24
C) 6
D) 36
A student takes a 6 question multiple choice quiz with 4 choices for
each question. If the student guesses at random on each question, what
is the probability that the student gets exactly 3 questions correct?
A) 0.088
B) 0.0512
C) 0.132
D) 0.022
The Poisson Distribution
Poissondistribution is for counts—if
events happen at a constant rate over
time, the Poisson distribution gives
the probability of X number of events
occurring in time T.
Poisson Mean and Variance
For a Poisson
Mean = random variable,
the variance and
mean are the
same!
◼ Variance and Standard
Deviation
=
2
=
where = expected number of hits in a
given time period
Poisson Distribution, example
The Poisson distribution models counts, such as the
number of new cases of SARS that occur in men in
New England next month.
The distribution tells you the probability of all
possible numbers of new cases, from 0 to infinity.
If X= # of new cases next month and X ~ Poisson (),
then the probability that X=k (a particular count) is:
k −
e
p( X = k ) =
k!
Example
For example, if new cases of West Nile
Virus in New England are occurring at a
rate of about 2 per month, then these are
the probabilities that: 0,1, 2, 3, 4, 5, 6, to
1000 to 1 million to… cases will occur in
New England in the next month:
Poisson Probability table
X P(X)
0 2 0 e −2 =.135
0!
1 2 1 e −2 =.27
k −
e
1!
2 2 e −2
p( X = k ) =
2 =.27
2!
3 2 3 e −2 =.18
k! 3!
4 2 4 e −2 =.09
4!
5
… …
Example: Poisson distribution
➢ Suppose that a rare disease has an incidence of 1 in 1000
person-years. Assuming that members of the population are
affected independently, find the probability of k cases in a
population of 10,000 (followed over 1 year) for k=0,1,2.
➢ The expected value (mean) = = .001*10,000 = 10
➢ 10 new cases expected in this population per year→
(10) 0 e − (10 )
P( X = 0) = = .0000454
0!
(10)1 e −(10 )
P( X = 1) = = .000454
1!
(10) 2 e −(10 )
P( X = 2) = = .00227
2!
more on Poisson…
“Poisson Process” (rates)
Note that the Poisson parameter can be given as
the mean number of events that occur in a defined
time period OR, equivalently, can be given as a
rate, such as =2/month (2 events per 1 month)
that must be multiplied by t=time (called a
“Poisson Process”) →
X ~ Poisson () k − t
( t ) e
P( X = k ) =
k!
E(X) = t
Var(X) = t
Example
For example, if new cases of West Nile in
New England are occurring at a rate of
about 2 per month, then what’s the
probability that exactly 4 cases will occur in
the next 3 months?
X ~ Poisson (=2/month)
(2 * 3) 4 e − ( 2*3) 6 4 e − ( 6 )
P(X = 4 in 3 months) = = = 0.134 = 13.4%
4! 4!
Exactly 6 cases?
(2 * 3) 6 e − ( 2*3) 66 e − ( 6 )
P(X = 6 in 3 months) = = = 0.16 = 16%
6! 6!
Practice problems (1)
a. If calls to your cell phone are a Poisson
process with a constant rate =2 calls per hour,
what’s the probability that, if you forget to turn
your phone off in a 1.5 hour movie, your phone
rings during that time?
k e −
Solution p( X = k ) =
k!
Solution
k e −
p( X = k ) =
k!
Practice problems (3)
Solution p( X = k ) =
k e −
k!
Practice problems (4)
Solution p( X = k ) =
k e −
k!