AE 248: AI and Data Science: Prabhu Ramachandran 2024-03-01
AE 248: AI and Data Science: Prabhu Ramachandran 2024-03-01
Sampling Statistics
Prabhu Ramachandran
2024-03-01
Sample
Sample mean
𝑋 + ... + 𝑋𝑛
𝑋̄ = 1
𝑛
. . .
̄
• What is 𝐸[𝑋]?
̄
• What is Var(𝑋)?
1
The Central Limit Theorem
Sample Variance
𝑛
2
∑𝑖=1 (𝑋𝑖 − 𝑋)̄ 2
𝑆 =
𝑛−1
• Compute 𝐸[𝑆 2 ]
Sample Variance
Using:
𝑛 𝑛
• ∑𝑖=1 (𝑥𝑖 − 𝑥)̄ 2 = ∑𝑖=1 𝑥2𝑖 − 𝑛𝑥2̄
• 𝐸[𝑋 2 ] = 𝑉 𝑎𝑟(𝑋) + 𝐸[𝑋]2
• Show that: 𝐸[𝑆 2 ] = 𝜎2
• 𝑋̄ is normally distributed
• 𝐸[𝑋]̄ = 𝜇
• 𝑉 𝑎𝑟[𝑋]̄ = 𝜎2 /𝑛
𝑛 𝑛
∑(𝑥𝑖 − 𝑥)̄ 2 = ∑(𝑥𝑖 − 𝜇)2 − 𝑛(𝑥̄ − 𝜇)2
𝑖=1 𝑖=1
2
Distribution of 𝑆 2
𝑛 √ 2
𝑛
𝑋𝑖 − 𝜇
2
∑𝑖=1 (𝑋𝑖 − 𝑋)̄ 2 𝑛(𝑋̄ − 𝜇)
∑( ) = −[ ]
𝑖=1
𝜎 𝜎2 𝜎
. . .
• LHS: 𝜒2𝑛
• RHS, second term, 𝜒21
Corollary
√ (𝑋̄ − 𝜇)
𝑛 ∼ 𝑡𝑛−1
𝑆
. . .
√
• Recall: 𝑛𝑍/√𝜒2𝑛