Em Algorithm

The document describes using the Expectation-Maximization (EM) algorithm to fit a mixture of two normal distributions to simulated data where some data points were drawn from N(1,1) and others from N(7,1). The EM algorithm iteratively estimates the latent class assignments (E-step) and distribution parameters (M-step) until convergence. It demonstrates the EM algorithm converging to the correct parameters over iterations on two examples of simulated data.

Uploaded by

api-285777244

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3K views4 pages

Em Algorithm

Uploaded by

api-285777244

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

EM Algorithm

YIK LUN, KEI

set.seed(123)
tau_1_true <- 0.25
x <- y <- rep(0,1000)
for( i in 1:1000 ) {
if( runif(1) < tau_1_true ) {
x[i] <- rnorm(1, mean=1);y[i] <- "heads"
} else {
x[i] <- rnorm(1, mean=7);y[i] <- "tails"
}
}
library(lattice)
densityplot( ~x, par.settings = list(plot.symbol = list(col=as.factor(y))))

0.25

Density

0.20

0.15

0.10

0.05

0.00
0

x
##initial guesses for the distribution parameters
mu_1 <- 0
mu_2 <- 1
##latent variable parameters
tau_1 <- 0.5
tau_2 <- 0.5

for( i in 1:10 ) {
## Given the observed data and distribution parameters, what are the latent variables?
T_1 <- tau_1 * dnorm( x, mu_1 )
T_2 <- tau_2 * dnorm( x, mu_2 )
P_1 <- T_1 / (T_1 + T_2)
P_2 <- T_2 / (T_1 + T_2) ## note: P_2 = 1 - P_1
tau_1 <- mean(P_1)
tau_2 <- mean(P_2)
## Given the observed data, as well as the latent variables, what are the population parameters?
mu_1 <- sum( P_1 * x ) / sum(P_1)
mu_2 <- sum( P_2 * x ) / sum(P_2)
print( c(mu_1, mu_2, mean(P_1)) )
}
##
##
##
##
##
##
##
##
##
##

[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]

0.5045618
0.8546336
0.9732251
0.9853947
0.9864849
0.9865811
0.9865895
0.9865903
0.9865903
0.9865904

6.1011529
6.9403680
7.0006108
7.0054109
7.0058260
7.0058624
7.0058656
7.0058659
7.0058660
7.0058660

0.1002794
0.2301181
0.2423406
0.2434347
0.2435309
0.2435394
0.2435401
0.2435402
0.2435402
0.2435402

set.seed(123)
tau_true <- 0.25
x <- y <- rep(0,1000)
for( i in 1:1000 ) {
if( runif(1) < tau_true ) {
x[i] <- rnorm(1, mean=1);y[i] <- "heads"
} else {
x[i] <- rnorm(1, mean=4);y[i] <- "tails"
}
}
densityplot( ~x, par.settings = list( plot.symbol=list( col=as.factor(y) ) ) )

0.25

Density

0.20

0.15

0.10

0.05

0.00
2

x
mu_1 <- 0
mu_2 <- 1
tau_1 <- 0.5
tau_2 <- 0.5
for( i in 1:30 ) {
## Given the observed data and the distribution parameters, what are the latent variables?
T_1 <- tau_1 * dnorm( x, mu_1 )
T_2 <- tau_2 * dnorm( x, mu_2 )
P_1 <- T_1 / (T_1 + T_2)
P_2 <- T_2 / (T_1 + T_2) ## note: P_2 = 1 - P_1
tau_1 <- mean(P_1)
tau_2 <- mean(P_2)
## Given the observed data and the latent variables, what are the population parameters?
mu_1 <- sum( P_1 * x ) / sum(P_1)
mu_2 <- sum( P_2 * x ) / sum(P_2)
print( c(mu_1, mu_2, mean(P_1)) )

}
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##
##

[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]
[1]

1.0835357
0.6797230
0.7320122
0.7910984
0.8298998
0.8545108
0.8701122
0.8800221
0.8863270
0.8903429
0.8929026
0.8945350
0.8955764
0.8962408
0.8966648
0.8969354
0.8971081
0.8972184
0.8972887
0.8973336
0.8973623
0.8973806
0.8973922
0.8973997
0.8974045
0.8974075
0.8974094
0.8974107
0.8974115
0.8974120

3.6048714
3.8663167
3.9306341
3.9574819
3.9730967
3.9827182
3.9887344
3.9925240
3.9949222
3.9964445
3.9974127
3.9980293
3.9984223
3.9986729
3.9988327
3.9989347
3.9989998
3.9990414
3.9990679
3.9990848
3.9990956
3.9991025
3.9991069
3.9991097
3.9991115
3.9991126
3.9991134
3.9991138
3.9991141
3.9991143

0.1320495
0.1865272
0.2059336
0.2165093
0.2230743
0.2272189
0.2298464
0.2315159
0.2325783
0.2332551
0.2336866
0.2339618
0.2341373
0.2342493
0.2343208
0.2343664
0.2343955
0.2344141
0.2344260
0.2344335
0.2344384
0.2344414
0.2344434
0.2344447
0.2344455
0.2344460
0.2344463
0.2344465
0.2344466
0.2344467

myEM <- normalmixEM( x, mu = c(0,1), sigma=c(1,1), sd.constr=c(1,1) )

## number of iterations= 21
myEM$mu ## the means of the two distributions

## [1] 0.8974058 3.9991120

myEM$lambda ## the mixing probabilities

## [1] 0.2344461 0.7655539

L31 Bayesian Logistic Regression PDF
No ratings yet
L31 Bayesian Logistic Regression PDF
8 pages
AR Model Session2 Output: Install - Packages ("Forecast")
No ratings yet
AR Model Session2 Output: Install - Packages ("Forecast")
30 pages
Support Vector Machine With Multiple Classes
100% (1)
Support Vector Machine With Multiple Classes
5 pages
Samp Doc
No ratings yet
Samp Doc
4 pages
Chapter 5
No ratings yet
Chapter 5
22 pages
Survival Analysis Practical
No ratings yet
Survival Analysis Practical
22 pages
Reliability Theory and Survival Analysis Final
No ratings yet
Reliability Theory and Survival Analysis Final
12 pages
Chap 35
No ratings yet
Chap 35
62 pages
R Intro 2011
No ratings yet
R Intro 2011
115 pages
Em Algo For Multivariate GMM
No ratings yet
Em Algo For Multivariate GMM
9 pages
Matlab Code: 2.2 Exercises With Matlab 2.2.1 Standard Normal Distribution
No ratings yet
Matlab Code: 2.2 Exercises With Matlab 2.2.1 Standard Normal Distribution
12 pages
All All: % (A) Construct Side-By-Side Stem-And-Leaf Plots
No ratings yet
All All: % (A) Construct Side-By-Side Stem-And-Leaf Plots
34 pages
418 Material
No ratings yet
418 Material
16 pages
The Xtable Gallery: With Small Contributions From Others November 6, 2009
No ratings yet
The Xtable Gallery: With Small Contributions From Others November 6, 2009
19 pages
212011497-4SE5-Kautsar Hilmi Izzuddin Pertemuan 5
No ratings yet
212011497-4SE5-Kautsar Hilmi Izzuddin Pertemuan 5
13 pages
Week 2-A.Guess The Distribution
No ratings yet
Week 2-A.Guess The Distribution
10 pages
Lavaan Package in RStudio
No ratings yet
Lavaan Package in RStudio
39 pages
Lab-6
No ratings yet
Lab-6
3 pages
HW1 Econ
No ratings yet
HW1 Econ
8 pages
rsn matlab mouni
No ratings yet
rsn matlab mouni
11 pages
Problem Set 6 Solution Numerical Methods
No ratings yet
Problem Set 6 Solution Numerical Methods
11 pages
Exercise 3 Computer Intensive Statistics
No ratings yet
Exercise 3 Computer Intensive Statistics
10 pages
Sae P5 Kautsar
No ratings yet
Sae P5 Kautsar
13 pages
Latent Variables
No ratings yet
Latent Variables
20 pages
R Codes
No ratings yet
R Codes
5 pages
A028 GLM-SC3
No ratings yet
A028 GLM-SC3
137 pages
R Cheatsheet ABC
No ratings yet
R Cheatsheet ABC
3 pages
EM Algorithm: Shu-Ching Chang Hyung Jin Kim December 9, 2007
No ratings yet
EM Algorithm: Shu-Ching Chang Hyung Jin Kim December 9, 2007
10 pages
8 Limiteddependent2up
No ratings yet
8 Limiteddependent2up
9 pages
soruma-SECOND-ASSEsiment L Reg
No ratings yet
soruma-SECOND-ASSEsiment L Reg
33 pages
Kautsar Hilmi Izzuddin - Tugas SAE P5
No ratings yet
Kautsar Hilmi Izzuddin - Tugas SAE P5
13 pages
Evermann Slides PDF
No ratings yet
Evermann Slides PDF
364 pages
R Cheatsheet ABCD
No ratings yet
R Cheatsheet ABCD
3 pages
"C://mvnprob - Dat1" "C://mvnprob - Dat1": Inf Inf
No ratings yet
"C://mvnprob - Dat1" "C://mvnprob - Dat1": Inf Inf
2 pages
MIT 402 CAT 2 S
No ratings yet
MIT 402 CAT 2 S
8 pages
Soruma SECOND ASSEsiment Final L Reg
No ratings yet
Soruma SECOND ASSEsiment Final L Reg
34 pages
Linear Latent Variable Models in R: Odel Building ON Linear Constraints
No ratings yet
Linear Latent Variable Models in R: Odel Building ON Linear Constraints
2 pages
Ts Dyn
No ratings yet
Ts Dyn
35 pages
Confidence Interval and Credintial Interval
No ratings yet
Confidence Interval and Credintial Interval
15 pages
CourseKata R Cheatsheet ABC
No ratings yet
CourseKata R Cheatsheet ABC
5 pages
Loading Required Package: Timedate Loading Required Package: Timeseries
No ratings yet
Loading Required Package: Timedate Loading Required Package: Timeseries
4 pages
ESTIMASS
No ratings yet
ESTIMASS
5 pages
Fineng 508 hw1
No ratings yet
Fineng 508 hw1
7 pages
Simulating Multivariate Structures
No ratings yet
Simulating Multivariate Structures
3 pages
Problem Set 1 Solution Numerical Methods
No ratings yet
Problem Set 1 Solution Numerical Methods
32 pages
Assignment-1 80501
No ratings yet
Assignment-1 80501
6 pages
MAPLE Practice Problems
No ratings yet
MAPLE Practice Problems
11 pages
R Course
No ratings yet
R Course
7 pages
R Cheatsheet ABCD
No ratings yet
R Cheatsheet ABCD
3 pages
Latent 2
No ratings yet
Latent 2
4 pages
Econometrics 2019 PDF
No ratings yet
Econometrics 2019 PDF
143 pages
Multivariate Assign
No ratings yet
Multivariate Assign
11 pages
R Examples
No ratings yet
R Examples
56 pages
Homework 9: Nhi Ly 2025-04-10
No ratings yet
Homework 9: Nhi Ly 2025-04-10
6 pages
WEEK
No ratings yet
WEEK
17 pages
Exercises With R: Exercise 1
No ratings yet
Exercises With R: Exercise 1
9 pages
Topic 2 Applications
No ratings yet
Topic 2 Applications
4 pages
Yaikob Second Assesiment Final
No ratings yet
Yaikob Second Assesiment Final
33 pages
Statistical Models in S
No ratings yet
Statistical Models in S
115 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
HW 2
No ratings yet
HW 2
8 pages
SQL Statement
No ratings yet
SQL Statement
1 page
Adjusting Betas
No ratings yet
Adjusting Betas
2 pages
HW 3
No ratings yet
HW 3
10 pages
HW 2
No ratings yet
HW 2
13 pages
HW 4
No ratings yet
HW 4
12 pages
Stats101a Homework8
No ratings yet
Stats101a Homework8
7 pages
Clustering
No ratings yet
Clustering
8 pages
Anova Review
100% (1)
Anova Review
8 pages
Point of Tangency
No ratings yet
Point of Tangency
5 pages
Coordinate Descent and Golden Selection Search
No ratings yet
Coordinate Descent and Golden Selection Search
2 pages
Monte Carlo Integration
No ratings yet
Monte Carlo Integration
3 pages
Variable Selection
No ratings yet
Variable Selection
15 pages
Non-Stationary Models
No ratings yet
Non-Stationary Models
13 pages
PCR and Pls Regression
No ratings yet
PCR and Pls Regression
5 pages
Harmonic Seasonal Models
No ratings yet
Harmonic Seasonal Models
10 pages
Generalized Additive Model
No ratings yet
Generalized Additive Model
10 pages
Random Forests
No ratings yet
Random Forests
10 pages
Stockportfolio
No ratings yet
Stockportfolio
9 pages
Gradient Steepest Descent
No ratings yet
Gradient Steepest Descent
7 pages
Regression Splines
No ratings yet
Regression Splines
4 pages
Polynomial Regression and Step Function
100% (1)
Polynomial Regression and Step Function
6 pages
Cross-Validation and The Bootstrap
No ratings yet
Cross-Validation and The Bootstrap
5 pages
Ridge Regression and The Lasso
No ratings yet
Ridge Regression and The Lasso
7 pages
Constant Correlation Model
No ratings yet
Constant Correlation Model
3 pages
Multi-Group Model
No ratings yet
Multi-Group Model
2 pages
Portfolio With RF
No ratings yet
Portfolio With RF
3 pages
Single Index Model
No ratings yet
Single Index Model
4 pages