All of Statistics Chapter 1 & 2
All of Statistics Chapter 1 & 2
All of Statistics Chapter 1 & 2
A Concise Course in
Statistical Inference
By Larry Wasserman
Toby Xu
UW Madison
05/29/07
Chapter 1 Overview
Sample Space
Disjoint or Mutually Exclusive
Probability Axioms/Basic Properties
Finite sample spaces
Independent Events
Conditional Probability
Baye’s Theorem
Sample Space
Sample Space Ω is the set of possible outcomes.
Ex: Toss fair coin twice Ω={HH,HT,TH,TT}
Points ω in Ω are called sample outcomes, realizations,
elements.
Subsets of Ω are called events.
Ex: Event that first toss is head is A={HH,HT}
Ac = complement of A (or NOT A)
∅ = complement of Ω
A∪ B = {ω ∈ Ω : ω ∈ A or ω ∈ B or ω ∈ both}
A∩ B = {ω ∈ Ω : ω ∈ A and ω ∈ B}
Disjoint or Mutually Exclusive
Two events (A and B) are mutually exclusive iff A∩ B = ∅
Monotone decreasing if A1 ⊃ A2 ⊃ A3 ⋯
Intro to Probability
P(A) = Probability Distribution or Probability Measure
Axioms:
1) P(A)≥0 for every A
2) P(Ω) = 1
If A1,A2…are disjoint then
∞ ∞
P (∪ A i ) = ∑ P ( A i )
i =1 i =1
Statistical Interference:
Frequentist
Bayesian Schools
Basic Properties of Probability
P(∅) = 0
A ⊂ B ⇒ P ( A) ≤ P (B )
0 ≤ P ( A) ≤ 1
A∩ B = ∅ ⇒ P ( A∪ B ) = P ( A) + P ( B ) For Disjoint Probabilities only
P(Ac)=1-P(A)
P ( A∪ B ) = P ( A) + P ( B ) − P ( AB )
A or B = (ABc) or (AB) or (AcB)
Probability on Finite Sample
Spaces
P(A)=A/Ω n n
= = 1
n n! 0 n
= N choose k n n
k k !( n − k )! =
k n − k
• N choose K is counting how many ways can we get a
k-subset out from a set with n elements.
•Ex: There’s 10 people in the Book Club; We want
groups of 2 people. How many possible
combinations?
10 10!
= = 45
2 2!(10 − 2)!
Independent Events
Two events are independent (does not directly affect the probability
of the other from happening) iff: P(AB) = P(A)P(B)
A set of events {Ai : i ε I} is independent if P (∩ A i ) = ∏ P ( A i )
i∈ J i∈ J
Assume A & B are disjoint events, each with +ve probability. Can
they be independent?
Ans: No because:
Independent = P(AB) = P(A)P(B)
But here P(AB)=φ=0 and P(A)P(B)>0
x P(X=x)
0 ¼
1 ½
2 ¼
Cumulative Distribution
Function
CDF = FX: R→[0,1] Theorem: let X and Y have
CDF F and G. If F(x) = G(x)
FX(x)=P(X≤x) for all x, then P(X∈ A) = P(Y ∈
Ex: Flip a fair coin twice A)
Theorem: CDF needs to
and let X be number of
satisfy:
heads F is non-decreasing: x1<x2,
also F(x1)≤F(x2)
0 x<0
1/4 F is normalized:
0 ≤ x<1
Fx ( x) = lim F ( x) = 0
3/4 1 ≤ x<2 x →−∞
P ( X ∈ A / Y = y ) = ∫ f ( x | y ) dx
A
Multivariate Distribution and IID
Samples
If X1,…,Xn are independent and each has the same marginal
distribution with CDF F, we say that X1,…,Xn are independent
and identically distributed and we write
X1,…Xn ~ F
Random sample size n from F
2 Important Multivariable
Distributions
Multinomial: multivariate version of a Binomial
X ~ Multinomial(n,p)
n
f ( x ) = P1^ ( x1)... Pk ^ ( xk )
x1 ... xk
Where
n n!
=
x1 ... xk x1!...xk !