Engineering Probability & Statistics
Engineering Probability & Statistics
Make
Make Onthe
On thebasis
basisofof
generalizations
generalizations observationsof
observations of
aboutthe
about the aasample,
sample,aapart
part
characteristicsof
characteristics of ofaapopulation
of population
aapopulation...
population...
Literary Digest Poll (1936)
What is wrong
with the Poll?
5-2 Sample Statistics as Estimators of
Population Parameters
• Sample statistic? Population parameter?
- a numerical measure of a - a numerical measure of a
summary characteristic summary characteristic of a
of a sample. population.
Source: https://www.phamduytung.com/blog/2019-05-04-sampling-method/
Population Distribution, Random Sample
from Population, and Their Means
How do you
observe the
relationship
between means of
sample &
population?
5-3 Sampling Distribution
11 0.125
0.125 0.125
0.125 -3.5
-3.5 12.25
12.25 1.53125
1.53125 0 .2
22 0.125
0.125 0.250
0.250 -2.5
-2.5 6.25
6.25 0.78125
0.78125
3 0.125 0.375 -1.5 2.25 0.28125
3 0.125 0.375 -1.5 2.25 0.28125
4 0.125 0.500 -0.5 0.25 0.03125
4 0.125 0.500 -0.5 0.25 0.03125
55 0.125
0.125 0.625
0.625 0.5
0.5 0.25
0.25 0.03125
0.03125
66 0.125
0.125 0.750
0.750 1.5
1.5 2.25
2.25 0.28125
0.28125
P (X )
77 0.125
0.125 0.875
0.875 2.5
2.5 6.25
6.25 0.78125
0.78125 0 .1
88 0.125
0.125 1.000
1.000 3.5
3.5 12.25
12.25 1.53125
1.53125
1.000 4.500 5.25000
1.000 4.500 5.25000
0 .0
1 2 3 4 5 6 7 8
E(X) = = 4.5 X
V(X) = 2 = 5.25
SD(X) = = 2.2913
Sampling Distribution (Continued)
• There are 8*8 = 64 different but Each of these samples has a sample
equally-likely samples of size 2 mean. For example, the mean of
(n=2) that can be drawn (with the sample (1,4) is 2.5, and the
replacement) from a uniform mean of the sample (8,4) is 6.
population of the integers, 1 to 8:
Samples of Size 2 from Uniform (1,8) Sample Means from Uniform (1,8), n = 2
1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8
1 1,1 1,2 1,3 1,4 1,5 1,6 1,7 1,8 1 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5
2 2,1 2,2 2,3 2,4 2,5 2,6 2,7 2,8 2 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0
3 3,1 3,2 3,3 3,4 3,5 3,6 3,7 3,8 3 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5
4 4,1 4,2 4,3 4,4 4,5 4,6 4,7 4,8 4 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0
5 5,1 5,2 5,3 5,4 5,5 5,6 5,7 5,8 5 3.0 3.5 4.0 4.5 5.0 5.5 6.0 6.5
6 6,1 6,2 6,3 6,4 6,5 6,6 6,7 6,8 6 3.5 4.0 4.5 5.0 5.5 6.0 6.5 7.0
7 7,1 7,2 7,3 7,4 7,5 7,6 7,7 7,8 7 4.0 4.5 5.0 5.5 6.0 6.5 7.0 7.5
8 8,1 8,2 8,3 8,4 8,5 8,6 8,7 8,8 8 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0
Sampling Distribution (Continued)
The probability distribution of the sample mean is called the
sampling distribution of the sample mean.
Sampling Distribution of the Mean
S ampling Dis tributio n of the Me an
X P(X) XP(X) X-X (X-X)2 P(X)(X-X)2
P(X)
2.5 0.062500 0.156250 -2.0 4.00 0.250000
3.0 0.078125 0.234375 -1.5 2.25 0.175781 0.0 5
3.5 0.093750 0.328125 -1.0 1.00 0.093750
4.0 0.109375 0.437500 -0.5 0.25 0.027344
4.5 0.125000 0.562500 0.0 0.00 0.000000
0.0 0
5.0 0.109375 0.546875 0.5 0.25 0.027344 1.0 1 .5 2.0 2 .5 3.0 3.5 4.0 4.5 5 .0 5 .5 6 .0 6 .5 7 .0 7.5 8 .0
5.5 0.093750 0.515625 1.0 1.00 0.093750 X
6.0 0.078125 0.468750 1.5 2.25 0.175781
6.5 0.062500 0.406250 2.0 4.00 0.250000
7.0
7.5
0.046875
0.031250
0.328125
0.234375
2.5
3.0
6.25
9.00
0.292969
0.281250 E ( X ) = m X = 4.5
8.0 0.015625 0.125000 3.5 12.25 0.191406
V ( X ) = s 2X = 2.625
1.000000 4.500000 2.625000 SD( X ) = s X = 1.6202
Properties of the Sampling Distribution of
the Sample Mean
Uniform Distribution (1,8)
• Comparing the population 0.2
P(X)
0.1
symmetric. X
P(X)
with a smaller variance. 0.05
0.00
1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0
X
Nonnormal Population Distribution and
Normal Sampling Distribution of Sample
Mean When a Large Sample is Used
Relationships between Population Parameters and the
Sampling Distribution of the Sample Mean
V(X) 2
X
X
n
The standard deviation of the sample mean, known as the
standard error of the mean, is equal to the population standard
deviation divided by the square root of the sample size:
SD( X ) X
X
n
The Central Limit Theorem
Effects of Central Limit Theorem
The Central Limit Theorem
Mercury makes a 2.4 liter V-6 engine, the Laser XRi, used in
speedboats. The company’s engineers believe the engine delivers
an average power of 220 horsepower and that the standard
deviation of power delivered is 15 HP. A potential buyer intends to
sample 100 engines (each engine is to be run a single time). What
is the probability that the sample mean will be less than 217 HP?
Sampling from a Normal Population
When sampling from a normal population with mean and standard
deviation , the sample mean, X, has a normal sampling
distribution:
2
X ~ N (, )
n
Thismeans
This meansthat,
that,as
asthe
the
samplesize
sample sizeincreases,
increases,the
the
samplingdistribution
sampling distributionof
ofthe
the
samplemean
sample meanremains
remains
centeredon
centered onthe
thepopulation
population
mean,but
mean, butbecomes
becomesmore
more
compactlydistributed
compactly distributedaround
around
thatpopulation
that populationmean
mean
Student’s t Distribution
If the population standard deviation, , is unknown, replace with
the sample standard deviation, s. If the population is normal, the
resulting statistic: X t
s/ n
has a t distribution with (n-1) degrees of freedom.
•• The
Thettisisaafamily
familyof
ofbell-shaped
bell-shapedand
and
symmetricdistributions,
symmetric distributions,one onefor foreach
each
numberof
number ofdegree
degreeof offreedom.
freedom.
Standard normal
•• Theexpected
The expectedvalue
valueof ofttisis0.0.
t, df=20
•• Thevariance
The varianceof ofttisisgreater
greaterthanthan1,1,but
but t, df=10
approaches11as
approaches asthe
thenumber
numberof ofdegrees
degrees
offreedom
of freedomincreases.
increases. TheThettisisflatter
flatterand
and
hasfatter
has fattertails
tailsthan
thandoes
doesthe thestandard
standard
normal.
normal.
•• Thettdistribution
The distributionapproaches
approachesaastandardstandard
normalas
normal asthe
thenumber
numberof ofdegrees
degreesof of
freedomincreases.
freedom increases.
The Sampling Distribution of the Sample
Proportion, p̂
+ The sampling distribution of the sample
proportion is based on the binominal distribution
with parameters n and p, where n is the sample
size and p is the population proportion.
{
Bias
Consistency
n = 10 n = 100
The sample variance (the sum of the squared deviations from the
sample mean divided by (n-1) is an unbiased estimator of the
population variance.
æ
E ( s ) = Eç
2 å ( x - x )
2
ö
÷ =s2
è (n - 1) ø
æ å ( x - x ) 2ö
Eç ÷<s
2
è n ø
Degrees of freedom
s
2
=
å (x - x)
2
(n - 1)
End Lecture 6