Lecture 18
Lecture 18
3/31/2025
435-Spring-2025 correlation
Definition and properties
435-Spring-2025 correlation
Multinomial distribution
Let X1 , ..., Xn be iid with P(X1 = m) = pm , m = 1, ..., M. Denote
the frequency of category m in n trials
Nm = I{X1 =m} + · · · + I{Xn =m} . Then
n!
P(N1 = n1 , ..., NM = nM ) = pn1 · · · pnMM
n1 ! · · · nM ! 1
(why?) where n = n1 + · · · + nM .
Facts:
N1 + · · · + NM = n and p1 + · · · + pM = 1. Hence the free
parameters are n and pm , m = 1, ..., M − 1.
Bin(n, p) is a special case of multinomial distribution with
M = 2, p1 = p and p2 = q = 1 − p.
Nm ∼ Bin(n, pm ); Nk + Nm ∼ Bin(n, pk + pm ) and
(Nk , Nm , n − Nk − Nm ) follows a multinomial distribution with
parameters {n; pk , pm , 1 − pk − pm }.
435-Spring-2025 correlation
Negative correlation
Fact: Show that Cov(Nk , Nm ) = −n pk pm for k ̸= m in the
multinomial model.
Proof: The representation Nm = I{X1 =m} + · · · + I{Xn =m} and
the bi-linearity imply
Cov(Nk , Nm )
Xn Xn
= {E[I{Xi =k, Xj =m} ] − E[I{Xi =k} ] E[I{Xj =m} ]}
i=1 j=1
Xn X n
= {P(Xi = k, Xj = m)] − P(Xi = k) P(Xj = m)}
i=1 j=1
X X
= {· · · } + {· · · }
i̸=j i=j
= −n pk pm .
P
Why? Note the first double sum i̸=j {· · · } = 0.
435-Spring-2025 correlation
Examples of multinomial distribution
435-Spring-2025 correlation
Examples continued
435-Spring-2025 correlation