Mixture Models and EM Algorithm: S. Sumitra

The document summarizes mixture models and the Expectation-Maximization (EM) algorithm. It explains that mixture models represent cluster distributions as a combination of component distributions like Gaussians. The EM algorithm is used to determine the unknown parameters of a mixture model, including component weights, means, and covariances. It does this through an iterative E-step to calculate membership probabilities, and an M-step to re-estimate the parameters based on these probabilities.

Uploaded by

kadarsh226521

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views4 pages

Mixture Models and EM Algorithm: S. Sumitra

Uploaded by

kadarsh226521

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Mixture Models and EM Algorithm

S. Sumitra

Clustering problems could be solved by applying model-based approach, which

consists in using certain models for clusters and attempting to optimize the fit be-
tween the data and the model. Each cluster (component) can be mathematically
represented by a parametric distribution, for eg, Gaussian (continuous) or a poisson
(discrete). The entire data set is therefore modelled by a mixture of these distribu-
tions. An individual distribution used to model a specific cluster is often referred to
as a component distribution.
Let there be k clusters. Let the random variable C denote the component with
valuesP1, ..k. Here we are considering Gaussian mixture models. So xj /(C = i) ∼
N (µi , i ) where µi and Σi are the mean and covariance matrix of the ith class.
A data point is generated by first choosing a component and then generating a
sample from that component. By total probability theorem,
k
X
p(x) = p(C = i)p(x/C = i) (1)
i=1

[p(C = i) is analogous to p(y = i) in Gaussian discriminant analysis.]

To determine in which cluster each xj belongs, p(C = i/xj ) has to be found. Now

p(C = i)p(xj /C = i)
p(C = i/xj ) = pij = , i = 1, 2, . . . k, j = 1, 2, . . . N (2)
p(xj )

Hence ki=1 pij = 1. Let wi = p(C = i), i = 1, 2. . . . k. Therefore the unknown

P
parameters of a mixture of Gaussians are wi , µi and Σi .
The EM algorithm can be applied to determine the unknown parameters. The
EM algorithm has two main steps: E step & M step. In the E-step, it assumes
the values of the model (that is wi , µi and Σi ) and find P (C = i/xj ), i = 1, 2, . . . k,
j = 1, 2, . . . N . In the M-step, it updates the parameters of the model. The process
iterates until convergence.

1
E step
In the E step, compute the probabilities pij , i = 1, 2, . . . k, j = 1, 2, . . . N.

M step
Compute the new mean, covariance and component weights as follows:
PN
j=1 pij xj
µi = PN
j=1 pij
P
j 1{xj ∈ C = i}xj
[For sure event, µi = P . Here, we don’t know whether xj is in
j 1.{xj ∈ C = i}
component i. We only know p(C = i/xj ).]
T
P
j pij (xj − µi )(xj − µi )
Σi = PN
j=1 pij
PN
j=1 pij
wi =
N
[Compare these formulas with those of Gaussian discriminant analysis]
The algorithm can be summarized as follows:

2
Algorithm 1 EM algorithm
Initialize µi , Σi , wi , i = 1, 2, . . . k
Iterate until covergence:
E Step
for i = 1 to k do
for j = 1 to N do
1 1
calculate p(xj /C = i) = n/2 1/2
exp − (xj − µi )T Σ−1
i (xj − µi )
(2π) |Σi | 2
PN
calculate pij = (p(xj /C = i)wi ) / j=1 p(x j /C = i)w i
end for
pi = N
P
j=1 pij
end for
M Step
for i = 1 to k doP
N
j=1 pij xj
calculate µi =
PN p i T
j=1 pij (xj − µi )(xj − µi )
calculate Σi =
pi
pi
set wi =
N
end for
end

3
References
(1) Artificial Intelligence by Stuart Russel and Peter Norwig
(2) Andrew Ng’s Lecture Note

HENRI (LEON) LEBESGUE (1875-1941) : The Playfair Parallel Postulate Appears
No ratings yet
HENRI (LEON) LEBESGUE (1875-1941) : The Playfair Parallel Postulate Appears
4 pages
PROBABILISTIC Learning Jb-new
No ratings yet
PROBABILISTIC Learning Jb-new
13 pages
cs229 Notes7b PDF
No ratings yet
cs229 Notes7b PDF
4 pages
Applied Stat
No ratings yet
Applied Stat
2 pages
6.2 K Means
No ratings yet
6.2 K Means
23 pages
Lecture Expectation Maximization
No ratings yet
Lecture Expectation Maximization
58 pages
401 Week7 Part 2 EM Algorithm
No ratings yet
401 Week7 Part 2 EM Algorithm
58 pages
Some Studies of Expectation Maximization Clustering Algorithm To Enhance Performance
No ratings yet
Some Studies of Expectation Maximization Clustering Algorithm To Enhance Performance
16 pages
Gaussian Mixture Models
No ratings yet
Gaussian Mixture Models
3 pages
Notes7_Mixtures_and_EM
No ratings yet
Notes7_Mixtures_and_EM
7 pages
WINSEM2020-21 CSE4020 ETH VL2020210504996 Reference Material I 12-May-2021 5.5 Expectation Maximization
No ratings yet
WINSEM2020-21 CSE4020 ETH VL2020210504996 Reference Material I 12-May-2021 5.5 Expectation Maximization
28 pages
S6, S7, S8 CS - U4 Getter Setter EM Algorithm
No ratings yet
S6, S7, S8 CS - U4 Getter Setter EM Algorithm
32 pages
Isye 6416: Computational Statistics Spring 2023: Prof. Yao Xie
No ratings yet
Isye 6416: Computational Statistics Spring 2023: Prof. Yao Xie
24 pages
EM-converted
No ratings yet
EM-converted
22 pages
Chap2 Part2 GMM
No ratings yet
Chap2 Part2 GMM
34 pages
Module13 GaussianMixtureModel
No ratings yet
Module13 GaussianMixtureModel
17 pages
Get One More Story in Your Member Preview When You Sign Up. It's Free
No ratings yet
Get One More Story in Your Member Preview When You Sign Up. It's Free
12 pages
Dsci303-19 GM - em
No ratings yet
Dsci303-19 GM - em
81 pages
ML-2-Expectation Maximization
No ratings yet
ML-2-Expectation Maximization
11 pages
Clustering Mixture
No ratings yet
Clustering Mixture
22 pages
Gaussian Distribution
No ratings yet
Gaussian Distribution
5 pages
Lec15 16 Handout
No ratings yet
Lec15 16 Handout
33 pages
AI29
No ratings yet
AI29
3 pages
Algoritmo E-M. Utilizado para Calcular La Mezcla de Gausianas
No ratings yet
Algoritmo E-M. Utilizado para Calcular La Mezcla de Gausianas
8 pages
Mixture Models and Expectation-Maximization: Justus H. Piater
No ratings yet
Mixture Models and Expectation-Maximization: Justus H. Piater
11 pages
L08_GMM
No ratings yet
L08_GMM
11 pages
Lec 33
No ratings yet
Lec 33
2 pages
Mixture Models and Clustering
No ratings yet
Mixture Models and Clustering
8 pages
Expectation-Maximization Clustring V2
No ratings yet
Expectation-Maximization Clustring V2
9 pages
Machine Learning-Em Algorithm
No ratings yet
Machine Learning-Em Algorithm
5 pages
Beamer
No ratings yet
Beamer
34 pages
ML UNIT III
No ratings yet
ML UNIT III
12 pages
Week 7 GMM
No ratings yet
Week 7 GMM
9 pages
20-gaussian-mixture-model
No ratings yet
20-gaussian-mixture-model
55 pages
15_GMC
No ratings yet
15_GMC
4 pages
gmm
No ratings yet
gmm
8 pages
Gaussian Mixture Model (GMM)
No ratings yet
Gaussian Mixture Model (GMM)
10 pages
TR 97 021
No ratings yet
TR 97 021
15 pages
Unit 2
No ratings yet
Unit 2
7 pages
کتاب ششم بارگزاری شده
No ratings yet
کتاب ششم بارگزاری شده
49 pages
GAUSSIAN MIXTURES
No ratings yet
GAUSSIAN MIXTURES
5 pages
Statistical Methods For NLP: Document and Topic Clustering, K-Means, Mixture Models, Expectation-Maximization
No ratings yet
Statistical Methods For NLP: Document and Topic Clustering, K-Means, Mixture Models, Expectation-Maximization
47 pages
lecture5
No ratings yet
lecture5
16 pages
Package Emcluster': February 1, 2018
No ratings yet
Package Emcluster': February 1, 2018
31 pages
cs229 Notes9 PDF
No ratings yet
cs229 Notes9 PDF
9 pages
Unsupervised Learning_ a Comprehensive Overview Of
No ratings yet
Unsupervised Learning_ a Comprehensive Overview Of
5 pages
Expectation Maximization
No ratings yet
Expectation Maximization
23 pages
EM Algorithm: Shu-Ching Chang Hyung Jin Kim December 9, 2007
No ratings yet
EM Algorithm: Shu-Ching Chang Hyung Jin Kim December 9, 2007
10 pages
Image Segmentation1
No ratings yet
Image Segmentation1
42 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
Experiment 9
No ratings yet
Experiment 9
3 pages
Lec16 PDF
No ratings yet
Lec16 PDF
10 pages
TD10 - td_gmm_2025
No ratings yet
TD10 - td_gmm_2025
1 page
Pattern Analysis-Machine Learning
No ratings yet
Pattern Analysis-Machine Learning
74 pages
16) ISM-Session 16 - 30th and 31st March 2024
No ratings yet
16) ISM-Session 16 - 30th and 31st March 2024
36 pages
lecture_06
No ratings yet
lecture_06
51 pages
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
From Everand
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet
Computer Solved: Nonlinear Differential Equations
From Everand
Computer Solved: Nonlinear Differential Equations
Joe J. Ettl
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Generalized Fermat Equation
From Everand
Generalized Fermat Equation
Ran Van Vo
No ratings yet
Pre PDF
No ratings yet
Pre PDF
29 pages
A Short Introduction To Boosting
No ratings yet
A Short Introduction To Boosting
14 pages
Topology For Data Science: Morse Theory and Application: Colleen M. Farrelly
No ratings yet
Topology For Data Science: Morse Theory and Application: Colleen M. Farrelly
16 pages
Geometrical Meaning Moore Penrose
No ratings yet
Geometrical Meaning Moore Penrose
4 pages
Drone Delivery Problem
No ratings yet
Drone Delivery Problem
11 pages
CAPE Applied Mathematics 2012 U1 P2
No ratings yet
CAPE Applied Mathematics 2012 U1 P2
14 pages
TUN ALIFF DANIAL - Borang 1,2,3
No ratings yet
TUN ALIFF DANIAL - Borang 1,2,3
4 pages
C C CC CCC C: 3 CC ("" ! C C C " C" C) C%C) C.C'CC "C# C #CCC C C !C C CCCCCCCC" CC " !"'CC!C) C CC!C CCC C!C
No ratings yet
C C CC CCC C: 3 CC ("" ! C C C " C" C) C%C) C.C'CC "C# C #CCC C C !C C CCCCCCCC" CC " !"'CC!C) C CC!C CCC C!C
2 pages
Ch05 Integer Programming
No ratings yet
Ch05 Integer Programming
29 pages
Averages Grouped Data
No ratings yet
Averages Grouped Data
11 pages
2.3. Sự liên tục hàm số
No ratings yet
2.3. Sự liên tục hàm số
31 pages
Nilai Harapan Dan MGF Bersamaa
No ratings yet
Nilai Harapan Dan MGF Bersamaa
38 pages
HC Sba Unit 2 Test 1 2019 With Answers
No ratings yet
HC Sba Unit 2 Test 1 2019 With Answers
8 pages
Covariant Vs Contravariant
100% (1)
Covariant Vs Contravariant
3 pages
Gamification in English As Second Language Learning in Secondary Education Aged Between 11 18 - A Systematic Review Between 2013 2020
No ratings yet
Gamification in English As Second Language Learning in Secondary Education Aged Between 11 18 - A Systematic Review Between 2013 2020
15 pages
Design For Reliability and Quality: IIT, Bombay
No ratings yet
Design For Reliability and Quality: IIT, Bombay
27 pages
Shear and Moment Diagrams of An Overhung Beam Using Singularity Functions
No ratings yet
Shear and Moment Diagrams of An Overhung Beam Using Singularity Functions
5 pages
Advanced Design of Structures (22607)
No ratings yet
Advanced Design of Structures (22607)
10 pages
The Laplace Transform
No ratings yet
The Laplace Transform
22 pages
Complex Variable and Transform AUG 2021
No ratings yet
Complex Variable and Transform AUG 2021
2 pages
Welcome To Maple 8: "Command The Brilliance of A Thousand Mathematicians."
No ratings yet
Welcome To Maple 8: "Command The Brilliance of A Thousand Mathematicians."
26 pages
Queuing Model: Basic Terminologies
No ratings yet
Queuing Model: Basic Terminologies
11 pages
Sample Critical Analysis Research Paper
100% (1)
Sample Critical Analysis Research Paper
7 pages
Parametric and Nonparametric
No ratings yet
Parametric and Nonparametric
2 pages
Research Proposal (Main File)
No ratings yet
Research Proposal (Main File)
31 pages
Basic Calculus Lesson 1.2 the Limit of a Function at c Versus the Value of the Function at c
No ratings yet
Basic Calculus Lesson 1.2 the Limit of a Function at c Versus the Value of the Function at c
5 pages
Sequential Forward Selection (SFS)
No ratings yet
Sequential Forward Selection (SFS)
5 pages
Adjustment Computations Chaper 1
No ratings yet
Adjustment Computations Chaper 1
9 pages
26 PDF
No ratings yet
26 PDF
134 pages
Gen Math 1st Diagnostic Test
100% (1)
Gen Math 1st Diagnostic Test
3 pages
Math-21 Analysis 1
No ratings yet
Math-21 Analysis 1
1 page
Ch-2 Relations and Functions Notes + Worksheet
No ratings yet
Ch-2 Relations and Functions Notes + Worksheet
12 pages
Policon Analysis BDM
No ratings yet
Policon Analysis BDM
19 pages
MAS 311 Real Analysis I
No ratings yet
MAS 311 Real Analysis I
4 pages

Mixture Models and EM Algorithm: S. Sumitra

Uploaded by

Mixture Models and EM Algorithm: S. Sumitra

Uploaded by

Mixture Models and EM Algorithm

Clustering problems could be solved by applying model-based approach, which

[p(C = i) is analogous to p(y = i) in Gaussian discriminant analysis.]

Hence ki=1 pij = 1. Let wi = p(C = i), i = 1, 2. . . . k. Therefore the unknown

You might also like