Chapter 4. Distribution of Sample Statistics
Chapter 4. Distribution of Sample Statistics
December 7, 2023
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Statistical Analysis
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Statistical Analysis
Sample / Sample Statistics (x, s2 )
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Population
Definition
Population for various statistical studies is modeled as a random
variable, denoted by X. Here
Then
The number of objects in the population is called the population
size, denoted by N (is usually very large).
E(X) = µ: is called the population mean.
Var(X) = σ 2 : is called the population variance.
The population parameters, µ and σ 2 , are fixed but unknown numbers.
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Random Sample
Definition
A simple random sample is chosen by a process that randomly selects
a sample of n objects from a population in such a way that
Each member of the population has the same probability of being
selected,
The selection of one member is independent of the selection of
any other member.
Here, n is called the sample size.
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Random Sample
Select randomly n objects from the population. Let
X1 : the value obtained when we measure the 1st object.
X2 : the value obtained when we measure the 2nd object.
…
Xn : the value obtained when we measure the nth object.
Remark
The system of random variables X1 , X2 , . . . , Xn is called the random
sample. Especially X1 , X2 , . . . , Xn have the same probability
distribution as the population X. In particular,
E(Xi ) = E(X) = µ
and
Var(Xi ) = Var(X) = σ 2 . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Sample Mean
Definition
Let the random variable X1 , X2 , . . . , Xn denote a random sample from
a population. The sample mean value of these random variables is
defined as follows:
1∑
n
X1 + X2 + . . . + Xn
X= Xi =
n n
i=1
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Xi ∼ N(µ, σ 2 ), i = 1, 2, . . . , n.
Therefore
1 1
X = X1 + . . . + Xn
n n
1 1 1 1
∼ N( µ + . . . + µ, 2 σ 2 + . . . + 2 σ 2 )
n 2 n n n
σ
∼ N(µ, )
n . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Theorem
Let the random variable X1 , X2 , . . . , Xn denote a random sample from
a population. Suppose that X ∼ N(µ, σ 2 ). If the parent population
distribution is normal and, thus, the sampling distribution of the
sample means is normal, then the random variable
X−µ
Z= σ ∼ N(0, 1).
√
n
σ
√ is called the standard error of X, denoted by σX
n
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
The central limit theorem shows that the sum of n random variables
from any probability distribution will be approximately normally
distributed if n is large (given a largeenough sample size).
Theorem
Let X1 , X2 , . . . , Xn be a set of n independent random variables having
identical distributions with mean µ and variance σ 2 , and X as the
mean of these random variables. As n becomes large (n ≥ 25),
X−µ
Z= σ ∼ N(0, 1).
√
n
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Example
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Exercise
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Population Proportion
Definition
In a population, let A be a characteristic of interest. Then the
population proportion, denoted by P, is defined by
NA
P=
N
where
N: the population size;
NA : the number of items that have the characteristic A.
Note that: Since N is very large, the population proportion is fixed but
unknown.
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Sample Proportion
Sample Proportion
Properties
X
Let p̂ = be a sample proportion. Then
n
µp̂ = E(p̂) = P
P(1 − P)
σp̂2 = Var(p̂) =
n
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
P(1 − P)
p̂ ∼ N(P, )
n
or
p̂ − P
√ ∼ N(0, 1)
P(1 − P)
n . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Example
Example: A charity has found that 42% of all donors from last year
will donate again this year. A random sample of 300 donors from last
year was taken.
a. What is the standard error of the sample proportion who will donate
again this year?
b. What is the probability that more than half of these sample
members will donate again this year?
c. What is the probability that the sample proportion is between 0.40
and 0.45?
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Exercise
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Exercise
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Exercise
Ex 3: A company wants to estimate the proportion of people who are
likely to purchase electric shavers from those who watch the
nationally telecast baseball playoffs. A random sample obtained
information from 120 people who were identified as persons who
watch baseball telecasts. Suppose that the proportion of those likely
to purchase electric shavers in the population who watch the telecast
is 0.25.
a. The probability is 0.10 that the sample proportion watching the
telecast exceeds the population proportion by how much?
b. The probability is 0.05 that the sample proportion is lower than the
population proportion by how much?
c. The probability is 0.30 that the sample proportion differs from the
population proportion by how much? . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Sample Variance
Definition
Let X1 , X2 , . . . , Xn be a random sample of observation from a
population. The quantity
1 ∑
n
(X1 − X)2 + . . . + (Xn − X)2
S2 = (Xi − X)2 =
n−1 n−1
i=1
is called the sample variance, and its square root, is called the sample
standard deviation.
For example, calculate the sample variance of the following random
sample
5, 2, 3, 3, 6, 5, 4.
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Properties
(n − 1)S2
Let ∼ χ2(n−1) . Then
σ2
E(S2 ) = σ 2 .
2σ 4
Var(S2 ) = .
n−1
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Critical value
Definition
Let χ2v denote the ChiSquare Distribution with v degrees of freedom.
Given the probability α. The critical value, denoted by χ2v,α , is a value
such that
P(χ2v > χ2v,α ) = α.
Critical values can be found in Appendix Table 7.
Example
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Exercise
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Exercise
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
Exercise
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .