0% found this document useful (0 votes)

7 views

Module13 GaussianMixtureModel

The document discusses K-means clustering and Gaussian mixture models. It explains how K-means can be used for image segmentation and data compression. It then describes Gaussian distributions and how Gaussian mixture models can model clustered data. The document outlines the expectation maximization algorithm for estimating the parameters of a Gaussian mixture model.

Uploaded by

riya pandey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Module13 GaussianMixtureModel

Uploaded by

riya pandey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Clustering and

Gaussian Mixture Model

Dr. Sayak Roychowdhury
Department of Industrial & Systems Engineering,
IIT Kharagpur
Reference
• Bishop, C. M. (2006). Pattern recognition and machine
learning. Springer google schola, 2, 5-43.
Application of K-means Clustering
• Image segmentation and compression
• The goal of segmentation is to partition an image into regions each of which has a
reasonably homogeneous visual appearance or which corresponds to objects or
parts of objects
• Each pixel in an image is a point in a 3-dimensional space comprising the intensities
of the RGB channels
• Running K-means to convergence, for any particular value of K, by re-drawing the
image replacing each pixel vector with the {R, G,B} intensity triplet given by the
centre 𝜇𝑘 to which that pixel has been assigned.
• Data Compressioning: K-means for lossy data compression
• Each data point is approximated by nearest cluster centre 𝜇𝑘
• This framework is often called vector quantization, and the vectors 𝜇𝑘 are called
code-book vectors
Image Segmentation with K-means

Bishop, C. M. (2006).
Pattern recognition
and machine
learning. Springer
google schola, 2, 5-43.
Gaussian Distribution
• Univariate Gaussian Distribution:
1 1 2
• 𝑓 𝑥|𝜇, 𝜎 = exp − 2 𝑥−𝜇
𝜎 2𝜋 2𝜎

• Multivariate Gaussian Distribution:

1 1
−2 𝑥−𝜇 𝑇 Σ−1 𝑥−𝜇
𝑓 𝑥|𝜇, Σ = 𝑝 1 𝑒
2𝜋 2 Σ 2
Gaussian Mixture

Bishop, C. M. (2006). Pattern recognition and machine

learning. Springer google schola, 2, 5-43.
Gaussian Mixture

3 gaussian distribution
That generated the datapoints

Bishop, C. M. (2006). Pattern recognition and machine

learning. Springer google schola, 2, 5-43.
Gaussian Mixture

3 gaussian distribution Clustering using estimated

that generated the datapoints posterior probability
of clusters using GMM
Bishop, C. M. (2006). Pattern recognition and machine
learning. Springer google schola, 2, 5-43.
Maximum Likelihood for Parameter
Estimation
1 𝑇 −1
𝑝
ln 𝑓𝑘 𝑥|𝜇𝑘 , Σ𝑘 = − l𝑛 Σ𝑘 − 𝑥 − 𝜇𝑘 Σ𝑘 𝑥 − 𝜇𝑘 − l𝑛 𝜋
2 2

Differentiating and equating to 0

𝑥𝑖
𝜇Ƹ 𝑘 = σ𝑔𝑖 =𝑘
𝑁𝑘
𝑥𝑖 −ෝ 𝜇𝑘 𝑇
𝜇𝑘 𝑥𝑖 −ෝ
෢k =
Σ 𝐾
σ𝑘=1 σ𝑔𝑖 =𝑘
𝑁 𝑘

Where 𝑁𝑘 is the number of datapoints in 𝑘𝑡ℎ cluster

Gaussian Mixture
• Linear superposition of Gaussians:
𝐾

𝑓(𝑥) = ෍ 𝑤𝑘 𝒩(𝑥|𝜇𝑘 , Σ𝑘 )
𝑘=1
Normalization and positivity of weights (mixing coefficients):
0 ≤ 𝑤𝑘 ≤ 1, σ𝐾 𝑘=1 𝑤𝑘 = 1
• Log-likelihood:
𝑁 𝑁 𝐾

ln 𝑓(𝑋|𝜇, Σ, 𝑊) = ෍ ln 𝑓 𝑥𝑖 = ෍ ln ෍ 𝑤𝑘 𝒩 𝑥 𝜇𝑘 , Σ𝑘
𝑖=1 𝑖=1 𝑘=1
Responsibilities
• The mixing coefficients can be thought of as prior probabilities
• For a given value of ‘x’, the posterior probabilities can be calculated, which are
also called “responsibilities”
• Using Bayes rule:

𝑓 𝑥𝑘 𝑓 𝑘 𝑤𝑘 𝑓𝑘 𝑥
𝛾𝑘 𝑥 = 𝑓 𝑘 𝑥 = = σ𝑙 𝑤𝑙 𝑓𝑙 𝑥
𝑓(𝑥)

𝑤𝑘 𝒩 𝑥 𝜇𝑘 , Σ𝑘
= 𝐾
σ𝑙=1 𝑤𝑙 𝒩(𝑥|𝜇𝑙 , Σ𝑙 )
𝑁𝑘
where 𝑤𝑘 =
𝑁

𝛾𝑘 𝑥 is also called latent variable here.

Expectation Maximization (EM) Algorithm
• EM algorithm is an iterative optimization technique
• Estimation step: for the given parameter values, compute the
expected values of the latent variable
• Maximization step: update the parameters of the model based on the
calculated value of the latent variable
Expectation Maximization (EM) Algorithm
• Given a Gaussian Mixture Model, the goal is to maximize the
likelihood function by varying the means and covariances and the
mixing coefficients
• Initialize 𝜇𝑗 , Σ𝑗 and mixing coefficients 𝑤𝑗 and evaluate initial log-
likelihood value
• Expectation step: Evaluate responsibilities using current parameter
values:
𝑤𝑘 𝒩 𝑥 𝜇𝑘 , Σ𝑘
𝛾𝑘 𝑥 = σ𝐾
𝑙=1 𝑤𝑙 𝒩(𝑥|𝜇𝑙 ,Σ𝑙 )
Expectation Maximization (EM) Algorithm
• Maximization step: Reestimate the parameters using current
responsibilities:
σ𝑁
𝑛=1 𝛾 𝑧𝑛𝑘 𝑥𝑛
𝜇𝑘𝑛𝑒𝑤 = , where 𝑁𝑘 = σ𝑁
𝑛=1 𝛾 𝑧𝑛𝑘
𝑁𝑘
• The mean 𝜇𝑘 for the kth Gaussian component is obtained by taking a
weighted mean of all of the points in the data set, in which the
weighting factor for data point 𝒙𝒏 is given by the posterior probability
𝛾 𝑧𝑛𝑘 that component k was responsible for generating 𝒙𝒏 .
Expectation Maximization (EM) Algorithm
• Setting derivative of ln 𝑓(𝑋|𝜇, Σ, 𝑊) equal to 0 w.r.t. Σ𝑘
𝑇
𝜇𝑘𝑛𝑒𝑤 𝑥𝑛 −ෝ
𝛾 𝑧𝑛𝑘 𝑥𝑛 −ෝ 𝜇𝑘𝑛𝑒𝑤
• Σ𝑘new = σ𝑁
𝑛=1 𝑁𝑘
• Finally maximize ln 𝑓(𝑋|𝜇, Σ, 𝑊), with respect to 𝑤𝑘 subject to constraint
𝐾

෍ 𝑤𝑘 = 1
𝑘=1
This can be achieved using Lagrange multiplier and maximizing
𝐾

ln 𝑓(𝑋|𝜇, Σ, 𝑊) + 𝜆(෍ 𝑤𝑘 − 1)
𝑘=1
𝑁𝑘
Resulting 𝑤𝑘𝑛𝑒𝑤 = , where 𝑁𝑘 = σ𝑁
𝑛=1 𝛾 𝑧𝑛𝑘
𝑁
• Evaluate ln 𝑓(𝑋|𝜇, Σ, 𝑊) = σ𝑁 𝑁 𝐾
𝑖=1 ln 𝑓 𝑥𝑖 = σ𝑖=1 ln σ𝑘=1 𝑤𝑘 𝒩 𝑥 𝜇𝑘 , Σ𝑘
• Iterate through E-step and M-step.
Expectation Maximization (EM)

Bishop, C. M. (2006).
Pattern recognition and
machine
learning. Springer
google schola, 2, 5-43.
EM Algorithm
• Since K-means is faster, it is common to run the K-means algorithm to
find a suitable initialization for a Gaussian mixture model that is
subsequently adapted using EM.

Indian Navy Calender 2025
100% (2)
Indian Navy Calender 2025
30 pages
Chemistry Book GrADE 7
100% (3)
Chemistry Book GrADE 7
30 pages
CH 19. Molecules in Motion: 19C Diffusion
No ratings yet
CH 19. Molecules in Motion: 19C Diffusion
5 pages
Chap2 Part2 GMM
No ratings yet
Chap2 Part2 GMM
34 pages
gmm
No ratings yet
gmm
8 pages
20-gaussian-mixture-model
No ratings yet
20-gaussian-mixture-model
55 pages
16) ISM-Session 16 - 30th and 31st March 2024
No ratings yet
16) ISM-Session 16 - 30th and 31st March 2024
36 pages
AI29
No ratings yet
AI29
3 pages
cs229 Notes7b PDF
No ratings yet
cs229 Notes7b PDF
4 pages
GMM Said Crv10 Tutorial
No ratings yet
GMM Said Crv10 Tutorial
27 pages
Clustering Mixture
No ratings yet
Clustering Mixture
22 pages
کتاب ششم بارگزاری شده
No ratings yet
کتاب ششم بارگزاری شده
49 pages
Lec15 16 Handout
No ratings yet
Lec15 16 Handout
33 pages
CB PDF
No ratings yet
CB PDF
69 pages
Gaussian Mixture Modelling GMM
No ratings yet
Gaussian Mixture Modelling GMM
11 pages
Dsci303-19 GM - em
No ratings yet
Dsci303-19 GM - em
81 pages
Gaussian Distribution
No ratings yet
Gaussian Distribution
5 pages
6.2 K Means
No ratings yet
6.2 K Means
23 pages
Lecture Expectation Maximization
No ratings yet
Lecture Expectation Maximization
58 pages
GMMEMNotes
No ratings yet
GMMEMNotes
10 pages
Week 7 - Latent Variable Models and Expectation Maximization
No ratings yet
Week 7 - Latent Variable Models and Expectation Maximization
39 pages
L08_GMM
No ratings yet
L08_GMM
11 pages
EM-converted
No ratings yet
EM-converted
22 pages
lecture5
No ratings yet
lecture5
16 pages
Mixture Models and Expectation-Maximization: Justus H. Piater
No ratings yet
Mixture Models and Expectation-Maximization: Justus H. Piater
11 pages
Mixture Models and Clustering
No ratings yet
Mixture Models and Clustering
8 pages
Applied Stat
No ratings yet
Applied Stat
2 pages
MLSlides5 - Selected - Shared
No ratings yet
MLSlides5 - Selected - Shared
30 pages
PROBABILISTIC Learning Jb-new
No ratings yet
PROBABILISTIC Learning Jb-new
13 pages
Gaussian Mixture Models
No ratings yet
Gaussian Mixture Models
3 pages
WINSEM2020-21 CSE4020 ETH VL2020210504996 Reference Material I 12-May-2021 5.5 Expectation Maximization
No ratings yet
WINSEM2020-21 CSE4020 ETH VL2020210504996 Reference Material I 12-May-2021 5.5 Expectation Maximization
28 pages
GaussianMixtureModel(GMM)_0a8d7758700f041bd57d8aef0862eb14
No ratings yet
GaussianMixtureModel(GMM)_0a8d7758700f041bd57d8aef0862eb14
18 pages
L11.2 Prob Models em
No ratings yet
L11.2 Prob Models em
20 pages
Get One More Story in Your Member Preview When You Sign Up. It's Free
No ratings yet
Get One More Story in Your Member Preview When You Sign Up. It's Free
12 pages
ML UNIT III
No ratings yet
ML UNIT III
12 pages
Lecture-04_GMM_EMalg
No ratings yet
Lecture-04_GMM_EMalg
34 pages
Pattern Classification 08. Gaussian Mixture Model: Abdelmoniem Bayoumi, PHD
No ratings yet
Pattern Classification 08. Gaussian Mixture Model: Abdelmoniem Bayoumi, PHD
12 pages
Image Segmentation1
No ratings yet
Image Segmentation1
42 pages
ML-2-Expectation Maximization
No ratings yet
ML-2-Expectation Maximization
11 pages
TD10 - td_gmm_2025
No ratings yet
TD10 - td_gmm_2025
1 page
EM and Kmeans relations
No ratings yet
EM and Kmeans relations
70 pages
Week 7 GMM
No ratings yet
Week 7 GMM
9 pages
Gaussian Mixture Model (GMM)
No ratings yet
Gaussian Mixture Model (GMM)
10 pages
Lecture 3
No ratings yet
Lecture 3
15 pages
Unit 2
No ratings yet
Unit 2
7 pages
GMM
No ratings yet
GMM
40 pages
Algoritmo E-M. Utilizado para Calcular La Mezcla de Gausianas
No ratings yet
Algoritmo E-M. Utilizado para Calcular La Mezcla de Gausianas
8 pages
Wk04 machine learning
No ratings yet
Wk04 machine learning
6 pages
GAUSSIAN MIXTURES
No ratings yet
GAUSSIAN MIXTURES
5 pages
15_GMC
No ratings yet
15_GMC
4 pages
Dynamical Gaussian Mixture Model For Tracking Elliptical Living Objects
No ratings yet
Dynamical Gaussian Mixture Model For Tracking Elliptical Living Objects
5 pages
Dis10 Sol PDF
No ratings yet
Dis10 Sol PDF
6 pages
Gaussian Mixture Model: P (X - Y) P (Y - X) P (X)
No ratings yet
Gaussian Mixture Model: P (X - Y) P (Y - X) P (X)
3 pages
GMM (2)
No ratings yet
GMM (2)
25 pages
14 Gaussian Mixture Models
No ratings yet
14 Gaussian Mixture Models
60 pages
Some Studies of Expectation Maximization Clustering Algorithm To Enhance Performance
No ratings yet
Some Studies of Expectation Maximization Clustering Algorithm To Enhance Performance
16 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Exercises of Numerical Analysis
From Everand
Exercises of Numerical Analysis
Simone Malacrida
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet
Introduction to Numerical Analysis
From Everand
Introduction to Numerical Analysis
Simone Malacrida
No ratings yet
Gauss Nodes Revolution: Numerical Integration Theory Radically Simplified And Generalised
From Everand
Gauss Nodes Revolution: Numerical Integration Theory Radically Simplified And Generalised
Rob Porter
No ratings yet
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet
MTM 2024 Mod 1 Lec 2
No ratings yet
MTM 2024 Mod 1 Lec 2
12 pages
Module10 - Support Vector Machine
No ratings yet
Module10 - Support Vector Machine
23 pages
Ltintegratedreport 2023
No ratings yet
Ltintegratedreport 2023
100 pages
Module08 PolynomialRegressionSplineGAMs
No ratings yet
Module08 PolynomialRegressionSplineGAMs
56 pages
The Omnipotence Paradox As A Problem of Infinite Regress
No ratings yet
The Omnipotence Paradox As A Problem of Infinite Regress
14 pages
DLL Mathematics 3 q4 w7
No ratings yet
DLL Mathematics 3 q4 w7
4 pages
Chapter 1: Introduction and Research Methods
No ratings yet
Chapter 1: Introduction and Research Methods
40 pages
Anthropology Upsc 2022 Paper 1 Solved by Vishal Sir_84372ce9-0374-46cc-96d4-4b09d1bfe9ff
No ratings yet
Anthropology Upsc 2022 Paper 1 Solved by Vishal Sir_84372ce9-0374-46cc-96d4-4b09d1bfe9ff
31 pages
Joymark Dagasaan@deped Gov PH
No ratings yet
Joymark Dagasaan@deped Gov PH
10 pages
Ielts 9.0
No ratings yet
Ielts 9.0
2 pages
Structure of The Earth
No ratings yet
Structure of The Earth
2 pages
Airtek twp200
No ratings yet
Airtek twp200
76 pages
Set B
No ratings yet
Set B
16 pages
DLL Reading and Writing week 10
No ratings yet
DLL Reading and Writing week 10
5 pages
Math Expressions Homework and Remembering Grade 4 Answer Key
100% (1)
Math Expressions Homework and Remembering Grade 4 Answer Key
5 pages
BSBWHS401 - Assessment
No ratings yet
BSBWHS401 - Assessment
11 pages
CliFi_Intro
No ratings yet
CliFi_Intro
18 pages
Holacracy and Hierarchy Concepts: Which One Is More Effective in An Organizational Leadership and Management System?
No ratings yet
Holacracy and Hierarchy Concepts: Which One Is More Effective in An Organizational Leadership and Management System?
15 pages
Manometry Calculation Details
No ratings yet
Manometry Calculation Details
5 pages
Zeolite Report
No ratings yet
Zeolite Report
19 pages
Ra Uru Hu Colors in Human Design - 37
No ratings yet
Ra Uru Hu Colors in Human Design - 37
1 page
TEXTO LIBROTheimpactofidealismthelegacyofpostKantianGermanVolumen1
No ratings yet
TEXTO LIBROTheimpactofidealismthelegacyofpostKantianGermanVolumen1
449 pages
8101 13 14 Studentguide Part2
0% (1)
8101 13 14 Studentguide Part2
4 pages
AI-Based Path Planning for Autonomous Robots
No ratings yet
AI-Based Path Planning for Autonomous Robots
3 pages
SmartGrid Seminar Presentation 41 83
No ratings yet
SmartGrid Seminar Presentation 41 83
13 pages
Bayer People Partner_JD
No ratings yet
Bayer People Partner_JD
5 pages
Encyclopedia of the World s Endangered Languages Routledge Language Family Series 1st Edition Christopher Moseleypdf download
100% (2)
Encyclopedia of the World s Endangered Languages Routledge Language Family Series 1st Edition Christopher Moseleypdf download
47 pages
22 1765 01 Dowsil 9040 Silicone Elastomer Blend
No ratings yet
22 1765 01 Dowsil 9040 Silicone Elastomer Blend
6 pages
509 - F 250 BODYFIBER - Tds
No ratings yet
509 - F 250 BODYFIBER - Tds
1 page
SHS Class Program Harmonized Katipunan II
No ratings yet
SHS Class Program Harmonized Katipunan II
4 pages
Water Rockets (PDFDrive)
No ratings yet
Water Rockets (PDFDrive)
64 pages

Module13 GaussianMixtureModel

Uploaded by

Module13 GaussianMixtureModel

Uploaded by

Clustering and

Gaussian Mixture Model

• Multivariate Gaussian Distribution:

Bishop, C. M. (2006). Pattern recognition and machine

Bishop, C. M. (2006). Pattern recognition and machine

3 gaussian distribution Clustering using estimated

Differentiating and equating to 0

Where 𝑁𝑘 is the number of datapoints in 𝑘𝑡ℎ cluster

𝛾𝑘 𝑥 is also called latent variable here.

You might also like