Lecture 2-Statistics The Normal Distribution and The Central Limit Theorem
Lecture 2-Statistics The Normal Distribution and The Central Limit Theorem
f(X) P (a ≤ X ≤ b)
= P (a < X < b)
(Note that the
probability of any
individual value is zero)
a b X
Probability is the area under the
density curve
f(X) P (a ≤ X ≤ b)
= P (a < X < b)
(Note that the
probability of any
individual value is zero)
a b X
The Normal Distribution
◼ ‘Bell Shaped’
◼ Symmetrical
f(X)
◼ Mean, Median and Mode
are Equal
Location is determined by the σ
mean, μ X
Spread is determined by the μ
standard deviation, σ
Mean
The random variable has an = Median
infinite theoretical range: = Mode
+ to −
The Normal Distribution
Density Function
Changing σ increases
or decreases the
σ spread.
μ X
Probability as
Area Under the Curve
The total area under the curve is 1.0, and the curve is
symmetric, so half is above the mean, half is below
f(X) P( − X μ) = 0.5
P(μ X ) = 0.5
0.5 0.5
μ X
P( − X ) = 1.0
Empirical Rules
μ ± 1σ encloses about
68.26% of X’s
σ σ
X
μ-1σ μ μ+1σ
68.26%
The Empirical Rule
(continued)
2σ 2σ 3σ 3σ
μ x μ x
95.44% 99.73%
**The beauty of the normal curve:
68% of
the data
X −μ
Z=
σ
The Z distribution always has mean = 0 and
standard deviation = 1
The Standardized Normal
Probability Density Function
◼ The formula for the standardized normal
probability density function is
1 −(1/2)Z2
f(Z) = e
2π
1
Z
0
Values above the mean have positive Z-values,
values below the mean have negative Z-values
Standard Normal Model
.5 .5
Z
-3 -2 -1 0 1 2 3
X
0 3 6 8 9 12
(X-6)/2
µ = 0 and = 1
.5 .5
Z
-3 -2 -1 0 1 2 3
Example
X − μ 200 − 100
Z= = = 2.0
σ 50
◼ This says that X = 200 is two standard
deviations (2 increments of 50 units) above
the mean of 100.
Comparing X and Z units
0 2.0 Z (μ = 0, σ = 1)
0 1.27 z
0 z
-2.24
A1 A2
z
-1.18 0 2.73
X
8.0
8.6
Finding Normal Probabilities
(continued)
◼ Let X represent the time it takes to download an image file from the
internet.
◼ Suppose X is normal with mean 8.0 and standard deviation 5.0. Find
P(X < 8.6)
X − μ 8.6 − 8.0
Z= = = 0.12
σ 5.0
μ=8 μ=0
σ = 10 σ=1
8 8.6 X 0 0.12 Z
X
8.0
8.6
Finding a Normal Probability
Between Two Values
Calculate Z-values:
X −μ 8 −8
Z= = =0
σ 5
8 8.6 X
X − μ 8.6 − 8 0 0.12 Z
Z= = = 0.12
σ 5 P(8 < X < 8.6)
= P(0 < Z < 0.12)
Probabilities in the Lower Tail
X
8.0
7.4
Probabilities in the Lower Tail
(continued)
Normal
Distribution µ=5
= 10
2.9 5 7.1 X
Example: P(X 8) = ?
Normal
Distribution
= 10
=5 8 X
Example
What is Zα if P(Z>Zα)=0.005
= 0.025
. = 0.05
Zα
Given a Normal Probability
Find the X Value
X = μ + Zσ
Finding the X value for a
Known Probability (continued)
Example:
◼ Let X represent the time it takes (in seconds) to
download an image file from the internet.
◼ Suppose X is normal with mean 8.0 and standard
deviation 5.0
◼ Find X such that 20% of download times are less than
X.
0.2000
? 8.0 X
? 0 Z
Find the Z value for
20% in the Lower Tail
1. Find the Z value for the known probability
◼ 20% area in the lower
tail is consistent with a
Z value of ?
0.2000
? 8.0 X
0 Z
Find the Z value for
20% in the Lower Tail
1. Find the Z value for the known probability
◼ 20% area in the lower
tail is consistent with a
Z value of -0.84
0.2000
? 8.0 X
-0.84 0 Z
Finding the X value
X = μ + Zσ
= 8.0 + ( −0.84 )5.0
= 3.80
X
90
60
30
-2 -1 0 1 2 Z
Normal Probability Plots (cont)
μ X = E (X ) = μ = population mean
Chap 7-57
Sample Mean for a Normal Population
σ
μX = μ and σX =
n
Sampling Distribution Properties
Normal Population
◼
μx = μ Distribution
μ x
(i.e. x is unbiased ) Normal Sampling
Distribution
(has the same mean)
μx
x
Sampling Distribution Properties
(continued)
As n increases, Larger
σx decreases sample size
Smaller
sample size
μ x
Sample Mean
for non-Normal Populations
the sampling
As the n↑ distribution of
sample the sample
size gets mean becomes
large almost normal
enough… regardless of
shape of
population
x
Sample Mean
if the Population is not Normal
(continued)
Population Distribution
Sampling distribution
properties:
Central Tendency
μx = μ
μ x
Sampling Distribution
Variation
σ
σx =
(becomes normal as n increases)
Larger
n Smaller
sample size
sample
size
μx x
The Importance of the Central
Limit Theorem
◼ When we select simple random samples of
size n, the sample means x will vary from
sample to sample. We can model the
distribution of these sample means with a
probability model that is …
N ,
n
How Large is Large Enough?
(X − μ X ) (X − μ)
Z= =
σX σ
n
Solution:
◼ Even if the population is not normally
distributed, the central limit theorem can be
used (n > 30)
◼ … so the sampling distribution of x is
approximately normal
◼ … with mean μ x = 8
σ 3
◼ …and standard deviation σ x = n = 36 = 0.5
Example
(continued)
Solution (continued):
7.8 - 8 X -μ 8.2 - 8
P(7.8 X 8.2) = P
3 σ 3
36 n 36
= P(-0.4 Z 0.4) = 0.6554 - 0.3446 = 0.3108