AI61004 Module4 Latentvariable
AI61004 Module4 Latentvariable
Adway Mitra
Center for Artificial Intelligence
Indian Institute of Technology Kharagpur
March 6, 2023
▶ Zi ∼ N (µ0 , Σ0 ), Xi ∼ N (WZi + µ, Σ)
▶ Observation X = {X1 , . . . , XN }, Parameters
θ = {W , µ0 , µ, Σ0 , Σ}, Latent Z = {Z1 , . . . , ZN }
▶ Posterior on latents pθ (Zi |Xi ) = N (µi , Σi ) where
Σi = (Σ−1 T
0 + W ΣW )
−1 and
µi = Σi (W Σ (Xi − µ) + Σ−1
T −1
0 µ0 )
▶ Marginal pθ (Xi ) = N (W µ0 + µ, Σ + W Σ0 W T )
▶ Log-Likelihood l(θ) = N
P
i=1 log (pθ (Xi ))
▶ Parameters can be estimated by maximum-likelihood
▶ Yi ∼ Cat(π), Zi ∼ N (µ0 , Σ0 )
▶ Xi |Zi , Yi = k ∼ N (Wk Zi + µk , Σk )
▶ Observation X = {X1 , . . . , XN }, Latent
Z = {Z1 , . . . , ZN , Y1 , . . . , YN }
▶ Parameters
θ = {π, W1 , . . . , WK , µ0 , µ1 , . . . , µK , Σ0 , Σ1 , . . . , ΣK }
▶ For simplicity, assume µ0 = 0, Σ0 = I , Σk = Σ
▶ Log-likelihood l(θ) = N
P
i=1 log (pθ (Xi ))
▶ Expected Complete Log-likelihood Er ,s (log (pθ ((X , Y , Z ))
where ri (Yi ) = pθ (Yi |Xi ) and si (Zi ) = pθ (Zi |Xi , Yi )
αt (Zt ) = prob(Zt , X1 , . . . , Xt )
X
= prob(Zt , Zt−1 , X1 , . . . , Xt )
Zt−1
X
= prob(Zt−1 , X1 , . . . , Xt−1 )prob(Zt , Xt |Zt−1 , X1 , . . . , Xt−1 )
Zt−1
X
= αt−1 (Zt−1 )prob(Zt , Xt |Zt−1 )
Zt−1
X
= αt−1 (Zt−1 )prob(Zt |Zt−1 )prob(Xt |Zt )
Zt−1
K K ,K
I (Z =k) I (Zt−1 =k,Zt =l)
Y Y
α1 (Z1 ) = πk 1 f (X1 , pk ), prob(Zt |Zt−1 ) = Akl
k=1 k,l=1
K T K ,K T
I (Z1 =k) I (Zt−1 =k,Zt =l)
Y Y Y Y
L(π, A, p) = πk × Akl × f (Xt , pZt )
k=1 t=2 k,l=1 t=1