0% found this document useful (0 votes)

38 views5 pages

Machine Learning-Em Algorithm

Machine Learning-em algorithm MIT

Uploaded by

aviral1987

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views5 pages

Machine Learning-Em Algorithm

Machine Learning-em algorithm MIT

Uploaded by

aviral1987

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Machine learning: lecture 14 Topics

Tommi S. Jaakkola • Gaussian mixtures and the EM-algorithm

MIT CSAIL – complete, incomplete, and inferred data
[email protected] – EM for mixtures
– demo
– EM and convergence
– regularized mixtures
– selecting the number of mixture components
– Gaussian mixtures for classification

Tommi Jaakkola, MIT CSAIL 2

Review: mixture densities Types of data: complete

• A Gaussian mixture model with m components is defined as m
!
m
! p(x|θ) = pj p(x|µj , Σj )
p(x|θ) = pj p(x|µj , Σj ) j=1
j=1

where θ = {p1, . . . , pm, µ1, . . . , µm, Σ1, . . . , Σm} contains all • When the available data is complete each sample contains
the parameters of the mixture model. the setting of all the variables in the model.
x y
x1 0 1 ... 0
x2 0 0 ... 1
··· ...
xn 0 1 ... 0

• We have to estimate these models from incomplete data The parameter estimation problem is in this case
involving only x samples; the assignment to components has straightforward (each component Gaussian can be estimated
to be inferred separately)

Tommi Jaakkola, MIT CSAIL 3 Tommi Jaakkola, MIT CSAIL 4

Types of data: incomplete Types of data: inferred

m
! m
!
p(x|θ) = pj p(x|µj , Σj ) p(x|θ) = pj p(x|µj , Σj )
j=1 j=1

• Incomplete data for a mixture model typically contain only • We can infer the values for the missing data based on the
x samples. current setting of the parameters
x y x y
x1 x1 P (y = 1|x1, θ) P (y = 2|x1, θ)
. . . P (y = m|x1, θ)
x2 x2 P (y = 1|x2, θ) P (y = 2|x2, θ)
. . . P (y = m|x2, θ)
··· ··· ...
xn xn P (y = 1|xn, θ) P (y = 2|xn, θ) . . . P (y = m|xn, θ)
To estimate the parameters we have to infer which The parameter estimation problem is again easy if we treat
component Gaussian was responsible for generating each the inferred data as complete data. The solution has to be
sample xi iterative, however.

Tommi Jaakkola, MIT CSAIL 5 Tommi Jaakkola, MIT CSAIL 6

The EM-algorithm The EM-algorithm
Step 0: specify the initial setting of the parameters θ = θ(0) Step 0: specify the initial setting of the parameters θ = θ(0)
E-step: complete the incomplete data with the posterior
m
! probabilities
p(x|θ) = pj p(x|µj , Σj )
j=1 P (y = j|xi, θ(k)), j = 1, . . . , m, i = 1, . . . , n
For example, we could
– set each µj to x sampled at random from the training set
– set each Σj to be the sample covariance of the whole data
– set mixing proportions pj to be uniform pj = 1/m.

Tommi Jaakkola, MIT CSAIL 7 Tommi Jaakkola, MIT CSAIL 8

The EM-algorithm Demo

Step 0: specify the initial setting of the parameters θ = θ(0)
E-step: complete the incomplete data with the posterior
probabilities

P (y = j|xi, θ(k)), j = 1, . . . , m, i = 1, . . . , n

M-step: find the new setting of the parameters θ(k+1) by

maximizing the log-likelihood of the completed (inferred)
data
n !
m P (x ,y=j|θ)
! " i
#$ %
θ(k+1) = argmax P (y = j|xi, θ(k)) log [pj p(xi|µj , Σj )]
θ
i=1 j=1

Tommi Jaakkola, MIT CSAIL 9 Tommi Jaakkola, MIT CSAIL 10

Topics EM-algorithm: convergence

m
!
• Gaussian mixtures and the EM-algorithm
p(x|θ) = pj p(x|µj , Σj )
– complete, incomplete, and inferred data j=1
– EM for mixtures
• The EM-algorithm monotonically increases the log-likelihood
– demo
of the training data. In other words,
– EM and convergence
– regularized mixtures l(θ(0)) < l(θ(1)) < l(θ(2)) < . . . until convergence
&n
– selecting the number of mixture components l(θ(k)) = i=1 log p(xi|θ(k))
– Gaussian mixtures for classification 200

100

!100

!200

!300

!400

!500
0 5 10 15 20 25 30 35

Tommi Jaakkola, MIT CSAIL 11 Tommi Jaakkola, MIT CSAIL 12

EM-algorithm: auxiliary objective EM-algorithm: auxiliary objective
• We first introduce possible posterior assignments {Q(j|i)} • The auxiliary objective
and the corresponding auxiliary likelihood objective: n !
m (k) (k) (k)
! pj p(xi|µj , Σj )
n
! l(Q; θ(k)) = Q(j|i) log ≤ l(θ(k))
Q(j|i)
l(θ(k)) = log p(xi|θ(k)) i=1 j=1
i=1 recovers the log-likelihood of the data at the correct posterior
!n m
! (k) (k) (k) assignments. In other words,
= log pj p(xi|µj , Σj )
i=1 j=1 max l(Q; θ(k)) = l(Q(k); θ(k)) = l(θ(k))
n m (k) (k) (k) Q
! ! pj p(xi|µj , Σj )
= log Q(j|i) where Q(k)(j|i) = P (y = j|xi, θ(k)) are the posterior
i=1 j=1
Q(j|i)
assignments corresponding to parameters θ(k).
n !
m (k) (k) (k)
! pj p(xi|µj , Σj )
≥ Q(j|i) log
i=1 j=1
Q(j|i)

= l(Q; θ(k))
Tommi Jaakkola, MIT CSAIL 13 Tommi Jaakkola, MIT CSAIL 14

EM-algorithm: max-max and monotonicity Topics

• We can now rewrite the EM-algorithm in terms of two • Gaussian mixtures and the EM-algorithm
maximization steps involving the auxiliary objective: – complete, incomplete, and inferred data
E-step: Q(k) = argmaxQ l(Q; θ(k)) – EM for mixtures
M-step: θ(k+1) = argmaxθ l(Q(k); θ) – demo
– EM and convergence
The monotonic increase of the log-likelihood now follows – regularized mixtures
from the facts that 1) the auxiliary objective is monotonically – selecting the number of mixture components
increasing, and 2) it equals the log-likelihood after each E- – Gaussian mixtures for classification
step

l(θ(k)) = l(Q(k); θ(k))

≤ l(Q(k); θ(k+1))
≤ l(Q(k+1); θ(k+1)) = l(θ(k+1))

Tommi Jaakkola, MIT CSAIL 15 Tommi Jaakkola, MIT CSAIL 16

Regularized EM Regularized EM: prior

• Even a single covariance matrix in the Gaussian mixture • A Wishart prior over each covariance matrix is given by
model involves a number of parameters and can easily lead ' (
1 n!
to over-fitting. P (Σ|S, n!) ∝ exp − Trace(Σ −1
S)
|Σ|n!/2 2
m
!
p(x|θ) = pj p(x|µj , Σj ) (written here in a bit non-standard way)
j=1
S = “prior” covariance matrix
• We can regularize the model by assigning a prior distribution n! = equivalent sample size
over the parameters, especially the covariance matrices
The equivalent sample size represents the number of training
samples we would have to see in order for the prior and the
data to have equal effect on the solution

Tommi Jaakkola, MIT CSAIL 17 Tommi Jaakkola, MIT CSAIL 18

Regularized EM Regularized EM: demo
• The E-step is unaffected (though the resulting values for the
soft assignments will change)
• In the M-step we now maximize a penalized log-likelihood of
the weighted training set:
m " p̂(j|i)
n !
! #$ % ) * !m
P (y = j|xi, θ(k)) log pj p(xi|µj , Σj ) + log P (Σj |S, n!)
i=1 j=1 j=1

Formally the regularization penalty changes the resulting

covariance estimates only slightly:
+ n ,
(k+1) 1 !
Σj ← p̂(j|i) (xi − µ̂j )(xi − µ̂j ) + n S
T !
n̂j + n! i=1

Tommi Jaakkola, MIT CSAIL 19 Tommi Jaakkola, MIT CSAIL 20

Topics Model selection and mixtures

• Gaussian mixtures and the EM-algorithm • As a simple strategy for selecting the appropriate number
– complete, incomplete, and inferred data of mixture components, we can find m that minimizes the
– EM for mixtures overall description length (cf. BIC):
– demo dm
– EM and convergence DL ≈ − log p(data|θ̂m) + log(n)
2
– regularized mixtures
– selecting the number of mixture components – n is the number of training points,
– Gaussian mixtures for classification – θ̂m are the maximum likelihood parameters for the m-
component mixture, and
– dm is the (effective) number of parameters in the m-
component mixture.

Tommi Jaakkola, MIT CSAIL 21 Tommi Jaakkola, MIT CSAIL 22

Model selection: example Model selection: example

• Typical cases • Best cases (out of several runs):
12 12 12 12

10 10 10 10

8 8 8 8

6 6 6 6

4 4 4 4

2 2 2 2

0 0 0 0

!2 !2 !2 !2

!4 !4 !4 !4
!4 !2 0 2 4 6 8 !4 !2 0 2 4 6 8 !4 !2 0 2 4 6 8 !4 !2 0 2 4 6 8

m=1, -logP(data)=2017.38, penalty=14.98, DL=2032.36 m=1, -logP(data)=2017.38, penalty=14.98, DL=2032.36

m=2, -logP(data)=1712.69, penalty=32.95, DL=1745.65 m=2, -logP(data)=1712.69, penalty=32.95, DL=1745.65
m=3, -logP(data)=1711.40, penalty=50.93, DL=1762.32 m=3, -logP(data)=1678.56, penalty=50.93, DL=1729.49
m=4, -logP(data)=1682.06, penalty=68.90, DL=1750.97 m=4, -logP(data)=1649.08, penalty=68.90, DL=1717.98

Tommi Jaakkola, MIT CSAIL 23 Tommi Jaakkola, MIT CSAIL 24

Topics Classification example
• Gaussian mixtures and the EM-algorithm • A digit recognition problem (8x8 binary digits)
– complete, incomplete, and inferred data Training set n = 100 (50 examples of each digit).
– EM for mixtures Test set n = 400 (200 examples of each digit).
– demo • We’d like to estimate class conditional mixture models (and
– EM and convergence prior class frequencies) to solve the classification problem
– regularized mixtures
– selecting the number of mixture components
– Gaussian mixtures for classification

Tommi Jaakkola, MIT CSAIL 25 Tommi Jaakkola, MIT CSAIL 26

Classification example Classification example

• A digit recognition problem (8x8 binary digits) • A digit recognition problem (8x8 binary digits)
Training set n = 100 (50 examples of each digit). Training set n = 100 (50 examples of each digit).
Test set n = 400 (200 examples of each digit). Test set n = 400 (200 examples of each digit).
• We’d like to estimate class conditional mixture models (and • We’d like to estimate class conditional mixture models (and
prior class frequencies) to solve the classification problem prior class frequencies) to solve the classification problem
For example: P (y )
class labels y=0 y=1
Class 1: P (y = 1), p(x|θ1), (e.g., a 3-component mixture) pj|0
Class 0: P (y = 0), p(x|θ0), (e.g., a 3-component mixture) class conditional
j=1 j=3
mixtures

A new test example x would be classified according to p(x|µ1|0, Σ1|0)

P3
P̂ (y = 1)p(x|θ̂1) p(x|θ0) = j =1 pj|0 p(x|µj|0 , Σj|0 )
Class = 1 if log >0
P̂ (y = 0)p(x|θ̂0) (a hierarchical mixture model)
and Class = 0 otherwise.

Tommi Jaakkola, MIT CSAIL 27 Tommi Jaakkola, MIT CSAIL 28

Classification example
• A digit recognition problem (8x8 binary digits)
Training set n = 100 (50 examples of each digit).
Test set n = 400 (200 examples of each digit).
• The figure gives the number of missclassified examples on the
test set as a function of the number of mixture components
in each class-conditional model
44

26
0 2 4 6 8 10

Tommi Jaakkola, MIT CSAIL 29

2 3a. Normal Distribution and Sampling and Sampling Distributions
75% (4)
2 3a. Normal Distribution and Sampling and Sampling Distributions
64 pages
Module One: Understanding Some Concepts and Terms Used in Systems Analysis
No ratings yet
Module One: Understanding Some Concepts and Terms Used in Systems Analysis
30 pages
Week 7 - Latent Variable Models and Expectation Maximization
No ratings yet
Week 7 - Latent Variable Models and Expectation Maximization
39 pages
ML Columbia PDF
No ratings yet
ML Columbia PDF
615 pages
Exercise 4.1 - 4.4 - Guiding Solutions
0% (1)
Exercise 4.1 - 4.4 - Guiding Solutions
6 pages
Lec15 16 Handout
No ratings yet
Lec15 16 Handout
33 pages
Methods of Research and Procedures
No ratings yet
Methods of Research and Procedures
22 pages
Eapp Q2 M6
No ratings yet
Eapp Q2 M6
34 pages
Dsci303-19 GM - em
No ratings yet
Dsci303-19 GM - em
81 pages
CB PDF
No ratings yet
CB PDF
69 pages
401 Week7 Part 2 EM Algorithm
No ratings yet
401 Week7 Part 2 EM Algorithm
58 pages
14 Gaussian Mixture Models
No ratings yet
14 Gaussian Mixture Models
60 pages
DEMARCATING SCIENCE FROM NONSCIENCE Libiternos and Lim
No ratings yet
DEMARCATING SCIENCE FROM NONSCIENCE Libiternos and Lim
44 pages
Applied Statistics - Lecture 1: Mario Beraha
No ratings yet
Applied Statistics - Lecture 1: Mario Beraha
52 pages
20 Gaussian Mixture Model
No ratings yet
20 Gaussian Mixture Model
55 pages
کتاب ششم بارگزاری شده
No ratings yet
کتاب ششم بارگزاری شده
49 pages
16) ISM-Session 16 - 30th and 31st March 2024
No ratings yet
16) ISM-Session 16 - 30th and 31st March 2024
36 pages
Research Format - Qualitative Research
No ratings yet
Research Format - Qualitative Research
20 pages
Cse291d 7
No ratings yet
Cse291d 7
39 pages
WPS02 01 Que 20220121-1 220323 120208
No ratings yet
WPS02 01 Que 20220121-1 220323 120208
32 pages
Lecture-04 GMM EMalg
No ratings yet
Lecture-04 GMM EMalg
34 pages
CM Latent - Models 2022
No ratings yet
CM Latent - Models 2022
27 pages
Chap2 Part2 GMM
No ratings yet
Chap2 Part2 GMM
34 pages
Lecture3 EM
No ratings yet
Lecture3 EM
36 pages
S6, S7, S8 CS - U4 Getter Setter EM Algorithm
No ratings yet
S6, S7, S8 CS - U4 Getter Setter EM Algorithm
32 pages
Hypothesis Tests in R
No ratings yet
Hypothesis Tests in R
25 pages
Practical Research 2
No ratings yet
Practical Research 2
27 pages
Training Gaussian Mixture Models at Scale Via Coresets: Mario Lucic
No ratings yet
Training Gaussian Mixture Models at Scale Via Coresets: Mario Lucic
25 pages
Learning With Hidden Variables - EM Algorithm
No ratings yet
Learning With Hidden Variables - EM Algorithm
31 pages
Density Estimation With Gaussian Mixture Models: CS 2XX: Mathematics For AI and ML
No ratings yet
Density Estimation With Gaussian Mixture Models: CS 2XX: Mathematics For AI and ML
26 pages
WINSEM2020-21 CSE4020 ETH VL2020210504996 Reference Material I 12-May-2021 5.5 Expectation Maximization
No ratings yet
WINSEM2020-21 CSE4020 ETH VL2020210504996 Reference Material I 12-May-2021 5.5 Expectation Maximization
28 pages
Isye 6416: Computational Statistics Spring 2023: Prof. Yao Xie
No ratings yet
Isye 6416: Computational Statistics Spring 2023: Prof. Yao Xie
24 pages
Chapter 11
No ratings yet
Chapter 11
10 pages
Unsupervised Learning Clustering Math
No ratings yet
Unsupervised Learning Clustering Math
28 pages
Research Methods - Unit 2
No ratings yet
Research Methods - Unit 2
23 pages
Some Studies of Expectation Maximization Clustering Algorithm To Enhance Performance
No ratings yet
Some Studies of Expectation Maximization Clustering Algorithm To Enhance Performance
16 pages
Assessment For Preschool Science Learning and Learning Environments
No ratings yet
Assessment For Preschool Science Learning and Learning Environments
17 pages
Machine Learning: CSCE883
No ratings yet
Machine Learning: CSCE883
22 pages
Clustering Mixture
No ratings yet
Clustering Mixture
22 pages
Expectation Maximization
No ratings yet
Expectation Maximization
19 pages
Probability Presentation
No ratings yet
Probability Presentation
22 pages
Anshul Singh
No ratings yet
Anshul Singh
15 pages
Lecture 5
No ratings yet
Lecture 5
16 pages
Module13 GaussianMixtureModel
No ratings yet
Module13 GaussianMixtureModel
17 pages
Lec16 PDF
No ratings yet
Lec16 PDF
10 pages
Bana Reviewer
No ratings yet
Bana Reviewer
9 pages
(Slides) The em Algorithm
No ratings yet
(Slides) The em Algorithm
14 pages
BP701TP Syllabus
No ratings yet
BP701TP Syllabus
1 page
Algoritmo E-M. Utilizado para Calcular La Mezcla de Gausianas
No ratings yet
Algoritmo E-M. Utilizado para Calcular La Mezcla de Gausianas
8 pages
Histology Record
No ratings yet
Histology Record
10 pages
Finite Mixture Modelling Model Specification, Estimation & Application
No ratings yet
Finite Mixture Modelling Model Specification, Estimation & Application
11 pages
GMMEMNotes
No ratings yet
GMMEMNotes
10 pages
L08 GMM
No ratings yet
L08 GMM
11 pages
Result and Discussion: The Result of Vocabulary Mastery Test
No ratings yet
Result and Discussion: The Result of Vocabulary Mastery Test
9 pages
PROBABILISTIC Learning Jb-New
No ratings yet
PROBABILISTIC Learning Jb-New
13 pages
Assessing The Rigor of Case Study Research in Supply Chain Management
No ratings yet
Assessing The Rigor of Case Study Research in Supply Chain Management
10 pages
What Is Research
No ratings yet
What Is Research
4 pages
Important Questions - DM
No ratings yet
Important Questions - DM
4 pages
EM Algorithm: Shu-Ching Chang Hyung Jin Kim December 9, 2007
No ratings yet
EM Algorithm: Shu-Ching Chang Hyung Jin Kim December 9, 2007
10 pages
Dis10 Sol PDF
No ratings yet
Dis10 Sol PDF
6 pages
Variational Bayesian Model Selection For Mixture Distributions - Corduneanu at Al
No ratings yet
Variational Bayesian Model Selection For Mixture Distributions - Corduneanu at Al
8 pages
cs229 Notes7b PDF
No ratings yet
cs229 Notes7b PDF
4 pages
TR 97 021
No ratings yet
TR 97 021
15 pages
ASSIGNMENT1
No ratings yet
ASSIGNMENT1
7 pages
Expectation-Maximization For The Gaussian Mixture Model
No ratings yet
Expectation-Maximization For The Gaussian Mixture Model
8 pages
Notes7 Mixtures and EM
No ratings yet
Notes7 Mixtures and EM
7 pages
Solution To Sta408 Semester June 2015: M M P M M P
No ratings yet
Solution To Sta408 Semester June 2015: M M P M M P
6 pages
The Expectation Maximization Algorithm
No ratings yet
The Expectation Maximization Algorithm
7 pages
Mixture Models and Clustering
No ratings yet
Mixture Models and Clustering
8 pages
Question Paper Code: 17126: Reg. No
No ratings yet
Question Paper Code: 17126: Reg. No
4 pages
Summary of Frequency Distribution, Cross Tabulation and Hypothesis Testing
No ratings yet
Summary of Frequency Distribution, Cross Tabulation and Hypothesis Testing
3 pages
The Scientific Process - The Steps
No ratings yet
The Scientific Process - The Steps
2 pages
Mixture Models and Expectation-Maximization: Justus H. Piater
No ratings yet
Mixture Models and Expectation-Maximization: Justus H. Piater
11 pages
Gaussian Mixture Model: P (X - Y) P (Y - X) P (X)
No ratings yet
Gaussian Mixture Model: P (X - Y) P (Y - X) P (X)
3 pages
Gaussian Mixtures
No ratings yet
Gaussian Mixtures
5 pages
An Alternative View of EM - Poornima
No ratings yet
An Alternative View of EM - Poornima
4 pages
Test of Significance - : 3-Standard Error of Difference
No ratings yet
Test of Significance - : 3-Standard Error of Difference
4 pages
Competitive Mixtures of Simple Neurons: Karthik Sridharan Matthew J. Beal Venu Govindaraju
No ratings yet
Competitive Mixtures of Simple Neurons: Karthik Sridharan Matthew J. Beal Venu Govindaraju
4 pages
Gaussian Mixture Models
No ratings yet
Gaussian Mixture Models
3 pages
Mixture Models and EM Algorithm: S. Sumitra
No ratings yet
Mixture Models and EM Algorithm: S. Sumitra
4 pages
LONG QUIZ Set A
No ratings yet
LONG QUIZ Set A
3 pages
Research Statement
No ratings yet
Research Statement
3 pages
Applied Stat
No ratings yet
Applied Stat
2 pages
TD10 - TD - GMM - 2025
No ratings yet
TD10 - TD - GMM - 2025
1 page
MULE Design Research Process Model Alternative Palette 2.2
No ratings yet
MULE Design Research Process Model Alternative Palette 2.2
1 page
Special Matrices and Their Applications in Numerical Mathematics: Second Edition
From Everand
Special Matrices and Their Applications in Numerical Mathematics: Second Edition
Miroslav Fiedler
5/5 (1)
Hexagon Number Sense
From Everand
Hexagon Number Sense
Christopher Casey
No ratings yet
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Exercises of Matrices and Linear Algebra
From Everand
Exercises of Matrices and Linear Algebra
Simone Malacrida
4/5 (1)
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Machine Learning-Em Algorithm

Uploaded by

Machine Learning-Em Algorithm

Uploaded by

Machine learning: lecture 14 Topics

Tommi S. Jaakkola • Gaussian mixtures and the EM-algorithm

Tommi Jaakkola, MIT CSAIL 2

Review: mixture densities Types of data: complete

Tommi Jaakkola, MIT CSAIL 3 Tommi Jaakkola, MIT CSAIL 4

Types of data: incomplete Types of data: inferred

Tommi Jaakkola, MIT CSAIL 5 Tommi Jaakkola, MIT CSAIL 6

Tommi Jaakkola, MIT CSAIL 7 Tommi Jaakkola, MIT CSAIL 8

The EM-algorithm Demo

M-step: find the new setting of the parameters θ(k+1) by

Tommi Jaakkola, MIT CSAIL 9 Tommi Jaakkola, MIT CSAIL 10

Topics EM-algorithm: convergence

Tommi Jaakkola, MIT CSAIL 11 Tommi Jaakkola, MIT CSAIL 12

EM-algorithm: max-max and monotonicity Topics

l(θ(k)) = l(Q(k); θ(k))

Tommi Jaakkola, MIT CSAIL 15 Tommi Jaakkola, MIT CSAIL 16

Regularized EM Regularized EM: prior

Tommi Jaakkola, MIT CSAIL 17 Tommi Jaakkola, MIT CSAIL 18

Formally the regularization penalty changes the resulting

Tommi Jaakkola, MIT CSAIL 19 Tommi Jaakkola, MIT CSAIL 20

Topics Model selection and mixtures

Tommi Jaakkola, MIT CSAIL 21 Tommi Jaakkola, MIT CSAIL 22

Model selection: example Model selection: example

m=1, -logP(data)=2017.38, penalty=14.98, DL=2032.36 m=1, -logP(data)=2017.38, penalty=14.98, DL=2032.36

Tommi Jaakkola, MIT CSAIL 23 Tommi Jaakkola, MIT CSAIL 24

Tommi Jaakkola, MIT CSAIL 25 Tommi Jaakkola, MIT CSAIL 26

Classification example Classification example

A new test example x would be classified according to p(x|µ1|0, Σ1|0)

Tommi Jaakkola, MIT CSAIL 27 Tommi Jaakkola, MIT CSAIL 28

Tommi Jaakkola, MIT CSAIL 29

You might also like