MTPDF6 - Sampling Distribution and Point Estimation
MTPDF6 - Sampling Distribution and Point Estimation
Sampling
Distributions
MPS Department | FEU Institute of Technology
Subtopic 1
OBJECTIVES
Unbiased
Sample
Unbiased, representative
sample drawn at random
Democrats Republicans from the entire
Population
population.
Biased
People who have Sample Biased, unrepresentative
phones and/or cars
and/or are Digest sample drawn from
readers.
Democrats
people who have cars
Republicans
Population and/or telephones and/or
read the Digest.
Sample Statistics as Estimators of Population
Parameters
A sample statistic is a numerical
A population parameter is
measure of a summary a numerical measure of a
characteristic of summary characteristic of
a sample.
a population.
X X X X X X X
X X X X X X X
X X X X
Sample points
Sample mean (X )
Sampling Distributions
P(X)
5 0.125 0.625 0.5 0.25 0.03125 0.1
6 0.125 0.750 1.5 2.25 0.28125
7 0.125 0.875 2.5 6.25 0.78125
8 0.125 1.000 3.5 12.25 1.53125
0.0
1.000 4.500 5.25000 1 2 3 4 5 6 7 8
X
E(X) = = 4.5
V(X) = 2 = 5.25
SD(X) = = 2.2913
• There are 8*8 = 64 different but Each of these samples has a sample
equally-likely samples of size 2 mean. For example, the mean of the
that can be drawn (with
sample (1,4) is 2.5, and the mean of
replacement) from a uniform
population of the integers from 1 the sample (8,4) is 6.
to 8:
Samples of Size 2 from Uniform (1,8) Sample Means from Uniform (1,8), n = 2
1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8
1 1,1 1,2 1,3 1,4 1,5 1,6 1,7 1,8 1 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5
2 2,1 2,2 2,3 2,4 2,5 2,6 2,7 2,8 2 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0
3 3,1 3,2 3,3 3,4 3,5 3,6 3,7 3,8 3 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5
4 4,1 4,2 4,3 4,4 4,5 4,6 4,7 4,8 4 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0
5 5,1 5,2 5,3 5,4 5,5 5,6 5,7 5,8 5 3.0 3.5 4.0 4.5 5.0 5.5 6.0 6.5
6 6,1 6,2 6,3 6,4 6,5 6,6 6,7 6,8 6 3.5 4.0 4.5 5.0 5.5 6.0 6.5 7.0
7 7,1 7,2 7,3 7,4 7,5 7,6 7,7 7,8 7 4.0 4.5 5.0 5.5 6.0 6.5 7.0 7.5
8 8,1 8,2 8,3 8,4 8,5 8,6 8,7 8,8 8 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0
The probability distribution of the sample mean is called the
sampling distribution of the the sample mean.
Sampling Distribution of the Mean
Sampling Distribution of the Mean
X P(X) XP(X) X-X (X-X)2 P(X)(X-X)2
0.10
1.0 0.015625 0.015625 -3.5 12.25 0.191406
1.5 0.031250 0.046875 -3.0 9.00 0.281250
P(X)
2.0 0.046875 0.093750 -2.5 6.25 0.292969
0.05
2.5 0.062500 0.156250 -2.0 4.00 0.250000
3.0 0.078125 0.234375 -1.5 2.25 0.175781
3.5 0.093750 0.328125 -1.0 1.00 0.093750
4.0 0.109375 0.437500 -0.5 0.25 0.027344 0.00
4.5 0.125000 0.562500 0.0 0.00 0.000000 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0
E ( X ) X 4.5
6.0 0.078125 0.468750 1.5 2.25 0.175781
6.5 0.062500 0.406250 2.0 4.00 0.250000
7.0
7.5
0.046875
0.031250
0.328125
0.234375
2.5
3.0
6.25
9.00
0.292969
0.281250 V ( X ) 2X 2.625
8.0 0.015625 0.125000 3.5 12.25 0.191406
SD( X ) X 1.6202
1.000000 4.500000 2.625000
Properties of the Sampling Distribution of
the Sample Mean
• Comparing the population 0.2
Uniform Distribution (1,8)
P(X)
0.1
P(X)
0.05
0.00
1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0
X
Relationships between Population Parameters and the Sampling
Distribution of the Sample Mean
The expected value of the sample mean is equal to the population mean:
E( X )
X X
The variance of the sample mean is equal to the population variance divided by
the sample size:
2
V(X) 2
X
X
n
The standard deviation of the sample mean, known as the standard error of
the mean, is equal to the population standard deviation divided by the square
root of the sample size:
SD( X ) X
X
n
Sampling from a Normal Population
When sampling from a normal population with mean and standard deviation , the
sample mean, X, has a normal sampling distribution:
2
X ~ N (, )
n
This means that, as the sample size Sampling Distribution of the Sample Mean
increases, the sampling distribution of
0.4
the sample mean remains centered on Sampling Distribution: n =16
the population mean, but becomes 0.3
Sampling Distribution: n =4
more compactly distributed around
f(X)
0.2
• IQ scores: population vs. sample In a large population of adults, the mean IQ is 112 with
standard deviation 20. Suppose 200 adults are randomly selected for a market research
campaign.
‡The distribution of the sample mean IQ is:
A) Exactly normal, mean 112, standard deviation 20
B) B) Approximately normal, mean 112, standard deviation 20
C) C) Approximately normal, mean 112 , standard deviation 1.414
D) D) Approximately normal, mean 112, standard deviation 0.1
Answer:
C) Approximately normal, mean 112 , standard deviation 1.414 Population distribution : N(µ =
112; σ = 20)
Sampling distribution for n = 200 is N(µ = 112; σ / √ n = 1.414)
Example
n=5
When sampling from a population 0.25
P(X)
0.15
0.10
mean and standard deviation n
P(X)
0.1
(n >30). Large n
0.4
0.3
f(X)
0.2
2
0.1
0.0
-
X
The Central Limit Theorem Applies to Sampling Distributions
from Any Population
Normal Uniform Skewed General
Population
n=2
n = 30
X X X X
https://slideplayer.com/slide/8348541/
Example
Mercury makes a 2.4 liter V-6 engine, the Laser XRi, used in speedboats. The
company’s engineers believe the engine delivers an average power of 220
horsepower and that the standard deviation of power delivered is 15 HP. A
potential buyer intends to sample 100 engines (each engine is to be run a single
time). What is the probability that the sample mean will be less than 217HP?
X 217
P( X 217) P
n n
217 220 217 220
P Z P Z
15 15
100 10
P ( Z 2) 0.0228
Student’s t Distribution
If the population standard deviation, , is unknown, replace with the sample standard
deviation, s. If the population is normal, the resulting statistic: X
t
s
has a t distribution with (n - 1) degrees of freedom. n
0 .4
P(X)
0 .2
number of trials, n.
0 .0
0 1 2
n=10,p=0.3
0.3
P(X)
0.1
0.0
0 1 2 3 4 5 6 7 8 9 10
X
n=15, p = 0.3
0.2
P(X)
p(1 p)
0.1
n 0.0
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 X
0 1 2 3 4 5 6 7 8 9 10 1112 13 1415
15 15 15 15 15 15 15 15 15 1515 1515 15 1515 ^p
Degrees of Freedom
If only two data points and the sample mean are known:
x 14
x1=12 x2=14 x3=? x4=?
x 12 14 x x4
3
x= 14
n 4
12 14 x x4 56
3
Degrees of Freedom
The number of degrees of freedom is equal to the total number of measurements (these
are not always raw data points), less the total number of restrictions on the
measurements. A restriction is a quantity computed from the measurements.
The sample mean is a restriction on the sample measurements, so after calculating the
sample mean there are only (n-1) degrees of freedom remaining with which to calculate
the sample variance. The sample variance is based on only (n-1) free data points:
s2
(x x) 2
(n 1)
Example
x1 + x2 + x3 + x4 = 150,000
A fourth project’s budget can be determined from the total budget and the
individual budgets of the other three. For example, if:
x4=150,000-40,000-30,000-50,000=30,000
General Concept of
Point Estimation
MPS Department | FEU Institute of Technology
Subtopic 2
OBJECTIVES
{
Bias
Consistency
n = 10 n = 100
The values of the remaining two data points cannot be uniquely determined:
x 12 14 x x
x= 3 4 14
n 4
12 14 x x 56
3 4
Degrees of Freedom
The number of degrees of freedom is equal to the total number of
measurements (these are not always raw data points), less the total number of
restrictions on the measurements. A restriction is a quantity computed from the
measurements.
x1 + x2 + x3 + x4 = 150,000
A fourth project’s budget can be determined from the total budget and the individual budgets of
the other three. For example, if:
x4=150,000-40,000-30,000-50,000=30,000
Unbiased Estimators
Definition
General Concepts of Point Estimation
Example
General Concepts of Point Estimation
Example (continued)
General Concepts of Point Estimation
Variance of a Point Estimator
Definition
Example
General Concepts of Point Estimation
Example (continued)
General Concepts of Point Estimation
ˆ .
A biased estimator that has smaller ̂1variance than the unbiased estimator 2
Elementary Statistics by Bluman