Intro To em

The document describes an Expectation-Maximization algorithm for Gaussian mixture models. It provides an example of applying EM to estimate the parameters of a 2-component GMM in 1D. The algorithms and functions for the E and M steps are presented. The EM iterations are shown to converge by monitoring the increase in log-likelihood. In the end, homework is assigned to generalize the EM algorithm to GMMs with arbitrary numbers of components and dimensions.

Uploaded by

shuhuifang2023

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views4 pages

Intro To em

Uploaded by

shuhuifang2023

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

EM coding

Xingjie Shi

Dec 7, 23

Example
In this example, we will assume our mixture components are fully specified Gaussian distributions (i.e the
means and variances are known), and we are interested in finding the maximum likelihood estimates of the
πk ’s.
Assume we have K = 2 components, so that:

Xi |Zi = 0 ∼ N (5, 1.5) (1)

Xi |Zi = 1 ∼ N (10, 2) (2)
(3)

The true mixture proportions will be P (Zi = 0) = 0.25 and P (Zi = 1) = 0.75. First we simulate data from
this mixture model:
# mixture components
mu.true = c(5, 10)
sigma.true = c(1.5, 2)

# determine Z_i
Z = rbinom(500, 1, 0.75)
# sample from mixture model

X <- rnorm(10000, mean=mu.true[Z+1], sd=sigma.true[Z+1])

hist(X,breaks=15)

1
Histogram of X
1500
1000
Frequency

500
0

0 5 10 15

X
Now we write a function to compute the log-likelihood for the incomplete data, assuming the parameters are
known. This will be used to determine convergence:
 
Xn X2
ℓ(θ) = log  πk N (xi ; µk , σk2 )
 
| {z }
i=1 k=1
L[i,k]

compute.log.lik <- function(L, w) {

L[,1] = L[,1]*w[1]
L[,2] = L[,2]*w[2]
return(sum(log(rowSums(L))))
}

Since the mixture components are fully specified, for each sample Xi we can compute the likelihood
P (Xi |Zi = 0) and P (Xi |Zi = 1). We store these values in the columns of L:
l <- function(X, mu, sigma) {
L = matrix(NA, nrow=length(X), ncol= 2)
L[, 1] = dnorm(X, mean=mu[1], sd = sigma[1])
L[, 2] = dnorm(X, mean=mu[2], sd = sigma[2])
return (L)
}

Finally, we implement the E and M step in the EM.iter function below. The mixture.EM function is the
driver which checks for convergence by computing the log-likelihoods at each step.
mixture.EM <- function(X, w.init, mu.init, sigma.init) {

w.curr <- w.init

mu.curr <- mu.init
sigma.curr <- sigma.init

2
# store log-likehoods for each iteration
log_liks <- c()
L <- l(X, mu.curr, sigma.curr)
ll <- compute.log.lik(L, w.curr)
log_liks <- c(log_liks, ll)
delta.ll <- 1e5

while(delta.ll > 1e-5) {

oneIter <- EM.iter(X, w.curr, L)
w.curr <- oneIter$w.next
mu.curr <- oneIter$mu.next
sigma.curr <- oneIter$sigma.next
L <- l(X, mu.curr, sigma.curr)
ll <- compute.log.lik(L, w.curr)
log_liks <- c(log_liks, ll)
delta.ll <- log_liks[length(log_liks)] - log_liks[length(log_liks)-1]
}
return(list(w.curr, mu.curr, sigma.curr, log_liks))
}

EM.iter <- function(X, w.curr, L, ...) {

K <- ncol(L)
# E-step: compute E_{Z|X,w0}[I(Z_i = k)]
z_ik <- L
for(i in 1:K) {
z_ik[,i] <- w.curr[i]*z_ik[,i]
}
gam <- z_ik / rowSums(z_ik)
Nk <- colSums(gam)

# M-step
w.next <- Nk/sum(Nk)
mu.next <- colSums(gam*X)/Nk
sigma.next <- numeric(K)
for(i in 1:K) {
sigma.next[i] <- sqrt(sum(gam[,i] * (X - mu.next[i])ˆ2)/Nk[i])
}
list (w.next = w.next,
mu.next = mu.next,
sigma.next = sigma.next)
}
#perform EM
ee <- mixture.EM(X, w.init=c(0.5,0.5), mu.init=c(4,12), sigma.init = c(2,2))
print(paste("proportion = (", round(ee[[1]][1],2), ",", round(ee[[1]][2],2), ")", sep=""))

## [1] "proportion = (0.23,0.77)"

print(paste("mu = (", round(ee[[2]][1],2), ",", round(ee[[2]][2],2), ")", sep=""))

## [1] "mu = (4.98,9.94)"

print(paste("sigma = (", round(ee[[3]][1],2), ",", round(ee[[3]][2],2), ")", sep=""))

## [1] "sigma = (1.55,2.02)"

3
Finally, we inspect the evolution of the log-likelihood and note that it is strictly increases:
plot(ee[[4]], ylab='marginal log-likelihood', xlab='iteration')
marginal log−likelihood

−26000
−28000

0 50 100 150

iteration

HW
Please provide an EM algorithm for GMM with arbitrary number of components and dimensions.

Lecture 11
No ratings yet
Lecture 11
124 pages
Lec15 16 Handout
No ratings yet
Lec15 16 Handout
33 pages
Modulesdocumentfile - phpsTAT301slidesem and Mixture Models PDF
No ratings yet
Modulesdocumentfile - phpsTAT301slidesem and Mixture Models PDF
83 pages
7.estimation Clustering
No ratings yet
7.estimation Clustering
56 pages
کتاب ششم بارگزاری شده
No ratings yet
کتاب ششم بارگزاری شده
49 pages
20 Gaussian Mixture Model
No ratings yet
20 Gaussian Mixture Model
55 pages
401 Week7 Part 2 EM Algorithm
No ratings yet
401 Week7 Part 2 EM Algorithm
58 pages
Lecture-04 GMM EMalg
No ratings yet
Lecture-04 GMM EMalg
34 pages
Density Estimation With Gaussian Mixture Models: CS 2XX: Mathematics For AI and ML
No ratings yet
Density Estimation With Gaussian Mixture Models: CS 2XX: Mathematics For AI and ML
26 pages
5
No ratings yet
5
29 pages
Package Mixr': R Topics Documented
No ratings yet
Package Mixr': R Topics Documented
29 pages
Isye 6416: Computational Statistics Spring 2023: Prof. Yao Xie
No ratings yet
Isye 6416: Computational Statistics Spring 2023: Prof. Yao Xie
24 pages
Using Maxlik
No ratings yet
Using Maxlik
20 pages
Aiml Lab Algorithms
No ratings yet
Aiml Lab Algorithms
10 pages
掃描文件 2019年10月24日
No ratings yet
掃描文件 2019年10月24日
19 pages
Chap2 Part2 GMM
No ratings yet
Chap2 Part2 GMM
34 pages
Machine Learning: CSCE883
No ratings yet
Machine Learning: CSCE883
22 pages
CB PDF
No ratings yet
CB PDF
69 pages
ED23D008 Lakshmi S
No ratings yet
ED23D008 Lakshmi S
23 pages
Em Algo For Multivariate GMM
No ratings yet
Em Algo For Multivariate GMM
9 pages
Cse291d 7
No ratings yet
Cse291d 7
39 pages
Intro To Essential Stats With Python
No ratings yet
Intro To Essential Stats With Python
51 pages
An Alternative View of EM - Poornima
No ratings yet
An Alternative View of EM - Poornima
4 pages
Flexmix Intro
No ratings yet
Flexmix Intro
18 pages
EM Algorithm
No ratings yet
EM Algorithm
10 pages
N D IX: The E-M Algorithm
No ratings yet
N D IX: The E-M Algorithm
12 pages
Notes7 Mixtures and EM
No ratings yet
Notes7 Mixtures and EM
7 pages
Samp Doc
No ratings yet
Samp Doc
4 pages
GMMEMNotes
No ratings yet
GMMEMNotes
10 pages
Lecture3 EM
No ratings yet
Lecture3 EM
36 pages
EM Algorithm: Shu-Ching Chang Hyung Jin Kim December 9, 2007
No ratings yet
EM Algorithm: Shu-Ching Chang Hyung Jin Kim December 9, 2007
10 pages
STAT4027 Assignment 1: Lewis Hastie
No ratings yet
STAT4027 Assignment 1: Lewis Hastie
26 pages
EM Presentation 2013
No ratings yet
EM Presentation 2013
18 pages
PROBABILISTIC Learning Jb-New
No ratings yet
PROBABILISTIC Learning Jb-New
13 pages
Expectation-Maximization For The Gaussian Mixture Model
No ratings yet
Expectation-Maximization For The Gaussian Mixture Model
8 pages
Cat Ii
No ratings yet
Cat Ii
2 pages
ML Expt 6
No ratings yet
ML Expt 6
4 pages
EM-algorithm: California Institute of Technology 136-93 Pasadena, CA 91125 Welling@vision - Caltech.edu
No ratings yet
EM-algorithm: California Institute of Technology 136-93 Pasadena, CA 91125 Welling@vision - Caltech.edu
7 pages
Statistics 512 Notes 18
No ratings yet
Statistics 512 Notes 18
10 pages
MLE Weibull
No ratings yet
MLE Weibull
7 pages
Experiment 1
No ratings yet
Experiment 1
5 pages
EM GaussianMixture Example
No ratings yet
EM GaussianMixture Example
2 pages
Expectation-Maximization Algorithm
No ratings yet
Expectation-Maximization Algorithm
13 pages
Algoritmo E-M. Utilizado para Calcular La Mezcla de Gausianas
No ratings yet
Algoritmo E-M. Utilizado para Calcular La Mezcla de Gausianas
8 pages
UL3
No ratings yet
UL3
2 pages
EM Algo
No ratings yet
EM Algo
8 pages
The Problem: Library (MASS) Data (Faithful) Attach (Faithful)
No ratings yet
The Problem: Library (MASS) Data (Faithful) Attach (Faithful)
7 pages
Dis10 Sol PDF
No ratings yet
Dis10 Sol PDF
6 pages
TR 97 021
No ratings yet
TR 97 021
15 pages
Exercise 3 Computer Intensive Statistics
No ratings yet
Exercise 3 Computer Intensive Statistics
10 pages
Mixture Models and Expectation-Maximization: Justus H. Piater
No ratings yet
Mixture Models and Expectation-Maximization: Justus H. Piater
11 pages
BAN5
No ratings yet
BAN5
2 pages
The Kullback-Liebler Distance and Entropy
No ratings yet
The Kullback-Liebler Distance and Entropy
5 pages
Maximum Entropy Distribution
No ratings yet
Maximum Entropy Distribution
11 pages
Maximum Likelihood Estimation: Guy Lebanon February 19, 2011
No ratings yet
Maximum Likelihood Estimation: Guy Lebanon February 19, 2011
6 pages
cs229 Notes7b PDF
No ratings yet
cs229 Notes7b PDF
4 pages
Chap5
No ratings yet
Chap5
51 pages
1.4. Chap3
No ratings yet
1.4. Chap3
61 pages
Chap6
No ratings yet
Chap6
43 pages
Chap8
No ratings yet
Chap8
10 pages
Intro To Mixture Models
No ratings yet
Intro To Mixture Models
5 pages