Review v5
Review v5
Hanoi, 2024
1 Probability
2 Random Variables
3 Discrete Random Variables
4 Continuous Random Variable
5 Summary of One-Sample Confidence Interval Procedures
6 Summary of Tests on the Mean
7 Summary of Tests on a Population Proportion
8 Summary of Two-Sample Confidence Interval Procedures
9 Summary of Two-Sample Hypothesis-Testing Procedures
10 Test Hypothesis about the Slope and Intercept
Content
1 Probability
2 Random Variables
3 Discrete Random Variables
4 Continuous Random Variable
5 Summary of One-Sample Confidence Interval Procedures
6 Summary of Tests on the Mean
7 Summary of Tests on a Population Proportion
8 Summary of Two-Sample Confidence Interval Procedures
9 Summary of Two-Sample Hypothesis-Testing Procedures
10 Test Hypothesis about the Slope and Intercept
PROBABILITY
PROBABILITY
P (A ∪ B) = P (A) + P (B) − P (A ∩ B)
P (A ∩ B)
P (A|B) = .
P (B)
P (A ∩ B) = P (A)P (B).
PROBABILITY
Total probability rule (two events): For any events A and B,
P (B | E1 ) P (E1 )
P (E1 |B) =
P (B|E1 ) P (E1 )+P (B|E2 ) P (E2 )+ ... +P (B|Ek ) P (Ek )
Content
1 Probability
2 Random Variables
3 Discrete Random Variables
4 Continuous Random Variable
5 Summary of One-Sample Confidence Interval Procedures
6 Summary of Tests on the Mean
7 Summary of Tests on a Population Proportion
8 Summary of Two-Sample Confidence Interval Procedures
9 Summary of Two-Sample Hypothesis-Testing Procedures
10 Test Hypothesis about the Slope and Intercept
RANDOM VARIABLES
Content
1 Probability
2 Random Variables
3 Discrete Random Variables
4 Continuous Random Variable
5 Summary of One-Sample Confidence Interval Procedures
6 Summary of Tests on the Mean
7 Summary of Tests on a Population Proportion
8 Summary of Two-Sample Confidence Interval Procedures
9 Summary of Two-Sample Hypothesis-Testing Procedures
10 Test Hypothesis about the Slope and Intercept
f (xi ) = 1/n.
Binomial Distribution
Binomial Distribution
A random experiment consists of n Bernoulli trials such that:
The trials are independent.
Each trial results in only two possible outcomes, labeled as ”success”
and ”failure.”
The probability of a success in each trial, denoted as p, remains
constant.
The random variable X that equals the number of trials that result in a
success has a binomial random variable with parameters 0 < p < 1 and
n = 1, 2, . . . The probability mass function of X is
and
µ = E(X) = np and σ 2 = V (X) = np(1 − p).
Dr. Duong Thi Viet An PROBABILITY & STATISTICS Hanoi, 2024 15 / 54
Discrete Random Variables
Geometric Distribution
Geometric Distribution
In a series of Bernoulli trials (independent trials with constant probability p
of a success), let the random variable X denote the number of trials until
the first success. Then X is a geometric random variable with parameter
0 < p < 1 and
f (x) = (1 − p)x−1 p x = 1, 2, . . .
and
µ = E(X) = 1/p and σ 2 = V (X) = (1 − p)/p2 .
and
µ = E(X) = r/p and σ 2 = V (X) = r(1 − p)/p2 .
Hypergeometric Distribution
Hypergeometric Distribution
A set of N objects contains: K objects classified as successes, N − K objects
classified as failures, A sample of size n objects is selected randomly (without
replacement) from the N objects, where K ≤ N and n ≤ N .
Let the random variable X denote the number of successes in the sample. Then
X is a hypergeometric random variable and
x n−x
CK .CN −K
f (x) = n , x = max{0, n + K − N } to min{K, n}
CN
−n
and µ = E(X) = np and σ 2 = V (X) = np(1 − p) N N −1 ,
N −n
where p = K/N and is called the finite population correction factor .
N −1
Poisson Distribution
Poisson Distribution
The random variable X that equals the number of events in a Poisson
process is a Poisson random variable with parameter 0 < λ, and the
probability mass function of X is
e−λ λx
f (x) = x = 0, 1, 2, . . .
x!
and
µ = E(X) = λ, σ 2 = V (X) = λ.
Content
1 Probability
2 Random Variables
3 Discrete Random Variables
4 Continuous Random Variable
5 Summary of One-Sample Confidence Interval Procedures
6 Summary of Tests on the Mean
7 Summary of Tests on a Population Proportion
8 Summary of Two-Sample Confidence Interval Procedures
9 Summary of Two-Sample Hypothesis-Testing Procedures
10 Test Hypothesis about the Slope and Intercept
1 −(x−µ)2
f (x) = √ e 2σ2 −∞<x<∞
2πσ
is a normal random variable with parameters µ, where −∞ < µ < ∞, and
σ > 0. Also,
E(X) = µ and V (X) = σ 2
and the notation N µ, σ 2 is used to denote the distribution.
µ=0 and σ2 = 1
and !
x − 0.5 − np
P (x ≤ X) = P (x − 0.5 ≤ X) ∼
=P p ≤Z
np(1 − p)
The approximation is good for np > 5 and n(1 − p) > 5.
Exponential Distribution
Exponential Distribution
The random variable X that equals the distance between successive events
of a Poisson process with mean number of events λ > 0 per unit interval is
an exponential random variable with parameter λ. The probability density
function of X is
and
1 1
µ = E(X) = and σ 2 = V (X) = .
λ λ2
Content
1 Probability
2 Random Variables
3 Discrete Random Variables
4 Continuous Random Variable
5 Summary of One-Sample Confidence Interval Procedures
6 Summary of Tests on the Mean
7 Summary of Tests on a Population Proportion
8 Summary of Two-Sample Confidence Interval Procedures
9 Summary of Two-Sample Hypothesis-Testing Procedures
10 Test Hypothesis about the Slope and Intercept
Mean µ,
variance σ 2 known x̄ − zα/2 √σn ≤ µ ≤ x̄ + zα/2 √σn
Mean µ,
variance σ 2 unknown x̄ − tα/2,n−1 √sn ≤ µ ≤ x̄ + tα/2,n−1 √sn
p p
Proportion p p̂ − zα/2 p̂(1 − p̂)/n ≤ p ≤ p̂ + zα/2 p̂(1 − p̂)/n
Content
1 Probability
2 Random Variables
3 Discrete Random Variables
4 Continuous Random Variable
5 Summary of One-Sample Confidence Interval Procedures
6 Summary of Tests on the Mean
7 Summary of Tests on a Population Proportion
8 Summary of Two-Sample Confidence Interval Procedures
9 Summary of Two-Sample Hypothesis-Testing Procedures
10 Test Hypothesis about the Slope and Intercept
Null hypothesis : H0 : µ = µ0
x̄ − µ0
Test statistic value: z0 = √
σ/ n
Alternative P -value Critical values Reject H0
Hypothesis
Null hypothesis: H0 : µ = µ0
Test statistic:
X̄ − µ0
T0 = √
S/ n
has t-distribution with n − 1 degrees of freedom.
Alternative P -value Critical values Reject H0
hypothesis
Content
1 Probability
2 Random Variables
3 Discrete Random Variables
4 Continuous Random Variable
5 Summary of One-Sample Confidence Interval Procedures
6 Summary of Tests on the Mean
7 Summary of Tests on a Population Proportion
8 Summary of Two-Sample Confidence Interval Procedures
9 Summary of Two-Sample Hypothesis-Testing Procedures
10 Test Hypothesis about the Slope and Intercept
Content
1 Probability
2 Random Variables
3 Discrete Random Variables
4 Continuous Random Variable
5 Summary of One-Sample Confidence Interval Procedures
6 Summary of Tests on the Mean
7 Summary of Tests on a Population Proportion
8 Summary of Two-Sample Confidence Interval Procedures
9 Summary of Two-Sample Hypothesis-Testing Procedures
10 Test Hypothesis about the Slope and Intercept
s
(n1 − 1)s21 + (n2 − 1)s22
where sp =
n1 + n2 − 2
2
s21 s22
n1 + n2
where v = (s21 /n1 )2 (s22 /n2 )2
n1 −1 + n2 −1
≤ p1 − p2
binominal parameters p1 − p2 q
p̂1 (1−p̂1 ) p̂2 (1−p̂2 )
≤ p̂1 − p̂2 + zα/2 n1 + n2
Content
1 Probability
2 Random Variables
3 Discrete Random Variables
4 Continuous Random Variable
5 Summary of One-Sample Confidence Interval Procedures
6 Summary of Tests on the Mean
7 Summary of Tests on a Population Proportion
8 Summary of Two-Sample Confidence Interval Procedures
9 Summary of Two-Sample Hypothesis-Testing Procedures
10 Test Hypothesis about the Slope and Intercept
Content
1 Probability
2 Random Variables
3 Discrete Random Variables
4 Continuous Random Variable
5 Summary of One-Sample Confidence Interval Procedures
6 Summary of Tests on the Mean
7 Summary of Tests on a Population Proportion
8 Summary of Two-Sample Confidence Interval Procedures
9 Summary of Two-Sample Hypothesis-Testing Procedures
10 Test Hypothesis about the Slope and Intercept
n n Pn Pn n
X X ( i=1 xi ) ( i=1 yi ) X
Sxy = (xi − x̄)(yi − ȳ) = xi yi − = xi yi − nx̄ȳ
i=1 i=1
n i=1
n n Pn 2 n
X X ( i=1 xi ) X
Sxx = (xi − x̄)2 = (n − 1)s2x = x2i − = x2i − nx̄2
i=1 i=1
n i=1
n n Pn 2 n
X X ( i=1 yi ) X
Syy = (yi − ȳ)2 = (n − 1)s2y = yi2 − = yi2 − nȳ 2
i=1 i=1
n i=1
Regression Statistics
Multiple R 0.76211
R Square 0.58082
Adjusted R
Square 0.52842
Standard
Error 41.33032
Observations 10
ANOVA Significance X
df SS MS F F
Total 9 32600.5000
Standard Upper
Coefficients Error t Stat P-value Lower 95% 95%
Correlation coefficient
Sxy Sxy
R= √ =p
Sxx SST Sxx Syy
Test on slope
test on y-intercept
Null Hypothesis H0 : β1 = β1,0
H0 : β0 = β0,0
Alternative Hypothesis H1 : β1 6= β1,0
H1 : β0 6= β0,0
β̂1 − β1,0
β̂0 − β0,0
Test satistic T0 = T0 =
se(β̂1 )
se(β̂0 )
Reject H0 |t0 | > tα/2,n−2
|t0 | > tα/2,n−2
s
σ̂ 2
Estimated standard error of the slope is se(β̂1 ) =
sxx
Estimated standard error of the intercept is
s
x̄2
2
1
se(β̂0 ) = σ̂ +
n sxx
H0 : ρ = 0
Test statistic √
R n−2
T0 = √
1 − R2
has a t−distribution with n − 2 degrees of freedom if H0 is true.
Good luck!