ML Mod32019

Uploaded by

mrbuttler001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views6 pages

ML Mod32019

Uploaded by

mrbuttler001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Expectation-maximisation algorithm

The maximum likelihood estimation method (MLE) is a method for estimating the
parameters of a statistical model, given observations (see Section 6.5 for details). The
method attempts to find the parameter values that maximize the likelihood function, or
equivalently the log-likelihood function, given the observations.

The expectation-maximisation algorithm (sometimes abbreviated as the EM algorithm) is

used to find maximum likelihood estimates of the parameters of a statistical model in
cases where the equations cannot be solved directly. These models generally involve
latent or unobserved variables in addition to unknown parameters and known data
observations. For example, a Gaussian mixture model can be described by assuming that
each observed data point has a corresponding unobserved data point, or latent variable,
specifying the mixture component to which each data point belongs.

In the case of Gaussian mixture problems, because of the nature of the function, finding a
maximum likelihood estimate by taking the derivatives of the log-likelihood function
with respect to all the parameters and simultaneously solving the resulting equations is
nearly impossible. So we apply the EM algorithm to solve the problem.

As already indicated, the EM algorithm is a general procedure for estimating the

parameters in a statistical model.
Dimensionality Reduction
Dimensionality reduction technique can be defined as, "It is a way of converting the higher
dimensions dataset into lesser dimensions dataset ensuring that it provides similar
information." These techniques are widely used in machine learning for obtaining a better fit
predictive model while solving the classification and regression problems.

It is commonly used in the fields that deal with high-dimensional data, such as speech
recognition, signal processing, bioinformatics, etc. It can also be used for data visualization,
noise reduction, cluster analysis, etc.
Benefits of applying Dimensionality Reduction
 By reducing the dimensions of the features, the space required to store the dataset also
gets reduced.
 Less Computation training time is required for reduced dimensions of features.
 Reduced dimensions of features of the dataset help in visualizing the data quickly.
 It removes the redundant features (if present) by taking care of multicollinearity.

Disadvantages of dimensionality Reduction

 Some data may be lost due to dimensionality reduction.
 In the PCA dimensionality reduction technique, sometimes the principal components
required to consider are unknown.
Principal Component Analysis

Principal Component Analysis is an unsupervised learning algorithm that is used for the
dimensionality reduction in machine learning. It is a statistical process that converts the
observations of correlated features into a set of linearly uncorrelated features with the help of
orthogonal transformation. These new transformed features are called the Principal
Components. It is one of the popular tools that is used for exploratory data analysis and
predictive modeling. It is a technique to draw strong patterns from the given dataset by
reducing the variances.

The PCA algorithm is based on some mathematical concepts such as:

 Variance and Covariance

 Eigenvalues and Eigen factors

Some common terms used in PCA algorithm:

 Dimensionality: It is the number of features or variables present in the given dataset.

More easily, it is the number of columns present in the dataset.
 Correlation: It signifies that how strongly two variables are related to each other. Such
as if one changes, the other variable also gets changed. The correlation value ranges from
-1 to +1. Here, -1 occurs if variables are inversely proportional to each other, and +1
indicates that variables are directly proportional to each other.
 Orthogonal: It defines that variables are not correlated to each other, and hence the
correlation between the pair of variables is zero.
 Eigenvectors: If there is a square matrix M, and a non-zero vector v is given. Then v will
be eigenvector if Av is the scalar multiple of v.
 Covariance Matrix: A matrix containing the covariance between the pair of variables is
called the Covariance Matrix.

Principal Components in PCA

As described above, the transformed new features or the output of PCA are the Principal
Components. The number of these PCs are either equal to or less than the original features
present in the dataset. Some properties of these principal components are given below:

 The principal component must be the linear combination of the original features.
 These components are orthogonal, i.e., the correlation between a pair of variables is
zero.
 The importance of each component decreases when going to 1 to n, it means the 1 PC
has the most importance, and n PC will have the least importance.

Steps for PCA algorithm

1. Getting the dataset

Firstly, we need to take the input dataset and divide it into two subparts X and Y, where
X is the training set, and Y is the validation set.
2. Representing data into a structure
Now we will represent our dataset into a structure. Such as we will represent the two-
dimensional matrix of independent variable X. Here each row corresponds to the data
items, and the column corresponds to the Features. The number of columns is the
dimensions of the dataset.
3. Standardizing the data
In this step, we will standardize our dataset. Such as in a particular column, the features
with high variance are more important compared to the features with lower variance.
If the importance of features is independent of the variance of the feature, then we will
divide each data item in a column with the standard deviation of the column. Here we
will name the matrix as Z.
4. Calculating the Covariance of Z
To calculate the covariance of Z, we will take the matrix Z, and will transpose it. After
transpose, we will multiply it by Z. The output matrix will be the Covariance matrix of Z.
5. Calculating the Eigen Values and Eigen Vectors
Now we need to calculate the eigenvalues and eigenvectors for the resultant covariance
matrix Z. Eigenvectors or the covariance matrix are the directions of the axes with high
information. And the coefficients of these eigenvectors are defined as the eigenvalues.
6. Sorting the Eigen Vectors
In this step, we will take all the eigenvalues and will sort them in decreasing order,
which means from largest to smallest. And simultaneously sort the eigenvectors
accordingly in matrix P of eigenvalues. The resultant matrix will be named as P*.
7. Calculating the new features Or Principal Components
Here we will calculate the new features. To do this, we will multiply the P* matrix to the
Z. In the resultant matrix Z*, each observation is the linear combination of original
features. Each column of the Z* matrix is independent of each other.
8. Remove less or unimportant features from the new dataset.
The new feature set has occurred, so we will decide here what to keep and what to
remove. It means, we will only keep the relevant or important features in the new
dataset, and unimportant features will be removed out.

Applications of Principal Component Analysis

 PCA is mainly used as the dimensionality reduction technique in various AI applications
such as computer vision, image compression, etc.
 It can also be used for finding hidden patterns if data has high dimensions. Some fields
where PCA is used are Finance, data mining, Psychology, etc.

Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
No ratings yet
Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
11 pages
Six Sigma Project - Operators Attrition
100% (3)
Six Sigma Project - Operators Attrition
25 pages
6 Principal Component Analysis
No ratings yet
6 Principal Component Analysis
7 pages
U5@-Data Reduction
No ratings yet
U5@-Data Reduction
22 pages
10 ASAP Advanced Statistics Dimension Reduction
No ratings yet
10 ASAP Advanced Statistics Dimension Reduction
8 pages
ML - Unit 3
No ratings yet
ML - Unit 3
4 pages
Unit Iii Dimentionality Reduction
No ratings yet
Unit Iii Dimentionality Reduction
12 pages
Unit 3dimentionality Reduction
No ratings yet
Unit 3dimentionality Reduction
13 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
U4 - PCA - 5th Sem - DS
No ratings yet
U4 - PCA - 5th Sem - DS
14 pages
DR Pca
No ratings yet
DR Pca
22 pages
W4.2 DataPreProcessing-PCA
No ratings yet
W4.2 DataPreProcessing-PCA
22 pages
Pca 1
No ratings yet
Pca 1
3 pages
Pca 1692550768
No ratings yet
Pca 1692550768
13 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
6 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
3.2 Pca
No ratings yet
3.2 Pca
27 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
19 pages
Module3 Notes
No ratings yet
Module3 Notes
13 pages
Sess03 Dimension Reduction Methods
No ratings yet
Sess03 Dimension Reduction Methods
36 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
STAT502
No ratings yet
STAT502
13 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
Unit 4 (PCA)
No ratings yet
Unit 4 (PCA)
12 pages
PCA Tutorial: Instructor: Forbes Burkowski
No ratings yet
PCA Tutorial: Instructor: Forbes Burkowski
12 pages
Module 3
No ratings yet
Module 3
41 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
30 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
Pca
No ratings yet
Pca
18 pages
ML Mod 4 Part 2
No ratings yet
ML Mod 4 Part 2
32 pages
PCA Finds Representation Through Linear Transformation
No ratings yet
PCA Finds Representation Through Linear Transformation
28 pages
Feature Extraction: - Saheni Patra
No ratings yet
Feature Extraction: - Saheni Patra
17 pages
It ML Unit 4 Notes Final
No ratings yet
It ML Unit 4 Notes Final
21 pages
Ferath Kherif PCA
No ratings yet
Ferath Kherif PCA
17 pages
7.3 Pca
No ratings yet
7.3 Pca
17 pages
Program 3
No ratings yet
Program 3
7 pages
PCA Theory
No ratings yet
PCA Theory
13 pages
PCA Dev
No ratings yet
PCA Dev
16 pages
Kinya Sharon - Ass2 - Machine Learning
No ratings yet
Kinya Sharon - Ass2 - Machine Learning
12 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
9 pages
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
No ratings yet
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
59 pages
PCA
100% (1)
PCA
33 pages
Unit 3
No ratings yet
Unit 3
102 pages
Unit 3
No ratings yet
Unit 3
28 pages
Principle Component Analysis
No ratings yet
Principle Component Analysis
4 pages
Principal Component Analysis PCA in Machine Learning
No ratings yet
Principal Component Analysis PCA in Machine Learning
20 pages
Principal Component Analysis (PCA) in Machine Learning
No ratings yet
Principal Component Analysis (PCA) in Machine Learning
20 pages
1501589578da Mod15 Q1 e Text
No ratings yet
1501589578da Mod15 Q1 e Text
9 pages
Unit V Foml
No ratings yet
Unit V Foml
18 pages
What Is PCA?: Image Source
No ratings yet
What Is PCA?: Image Source
17 pages
Lecture 14: Principal Component Analysis: Computing The Principal Components
No ratings yet
Lecture 14: Principal Component Analysis: Computing The Principal Components
6 pages
Multivariate Statistical Analysis
No ratings yet
Multivariate Statistical Analysis
12 pages
PCA - Ensemble Classifiers
No ratings yet
PCA - Ensemble Classifiers
9 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
Unit 4 Dimenstionality Reduction
No ratings yet
Unit 4 Dimenstionality Reduction
104 pages
Unit 3
No ratings yet
Unit 3
31 pages
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
No ratings yet
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
19 pages
Chapter Five Principal Comonent Analysis (PCA)
No ratings yet
Chapter Five Principal Comonent Analysis (PCA)
33 pages
Mathematics for Data Science: Linear Algebra with Matlab
From Everand
Mathematics for Data Science: Linear Algebra with Matlab
César Pérez López
No ratings yet
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Statistics For Technology A Course in Applied Statistics Third Edition 3rd Ed Chatfield PDF Download
No ratings yet
Statistics For Technology A Course in Applied Statistics Third Edition 3rd Ed Chatfield PDF Download
78 pages
Mini Project Analysis On Messi
No ratings yet
Mini Project Analysis On Messi
10 pages
Mcse615l - Data-Analytics - TH - 1.0 - 71 - Mcse615l - 67 Acp
No ratings yet
Mcse615l - Data-Analytics - TH - 1.0 - 71 - Mcse615l - 67 Acp
2 pages
T 2 F D N: HE Actorial Esig
No ratings yet
T 2 F D N: HE Actorial Esig
33 pages
Statistics Interview Questions
No ratings yet
Statistics Interview Questions
8 pages
Unit 1-QTM-Introduction To Statistics-MBA 1
No ratings yet
Unit 1-QTM-Introduction To Statistics-MBA 1
48 pages
Holmes 1 4/6/2015: Pstat 5Ls HW2
No ratings yet
Holmes 1 4/6/2015: Pstat 5Ls HW2
2 pages
SMB-R Programming Lab
No ratings yet
SMB-R Programming Lab
57 pages
JCN 9 381 PDF
No ratings yet
JCN 9 381 PDF
1 page
Class Test (Data Analytics)
No ratings yet
Class Test (Data Analytics)
4 pages
BCA Mathematics
No ratings yet
BCA Mathematics
25 pages
06 - Normal Distribution Template
No ratings yet
06 - Normal Distribution Template
16 pages
Chapter Six
No ratings yet
Chapter Six
7 pages
Sec 8 5 x2 Test For A Variance or Standard Deviation 1
No ratings yet
Sec 8 5 x2 Test For A Variance or Standard Deviation 1
58 pages
ID3, Information Gain and Entropy
No ratings yet
ID3, Information Gain and Entropy
8 pages
Panel Data On Eviews
No ratings yet
Panel Data On Eviews
15 pages
Business Statistics Syl Lab Us
No ratings yet
Business Statistics Syl Lab Us
2 pages
Joseph Bigtask Prostat
No ratings yet
Joseph Bigtask Prostat
11 pages
Least Square
No ratings yet
Least Square
6 pages
Business Analytics S3 MBA May 2022 (S)
No ratings yet
Business Analytics S3 MBA May 2022 (S)
2 pages
Statical Distriution Function
No ratings yet
Statical Distriution Function
8 pages
02 Multiple Regression and Issues in Regression Analysis-1
No ratings yet
02 Multiple Regression and Issues in Regression Analysis-1
43 pages
Numaamati, 07
No ratings yet
Numaamati, 07
13 pages
Estimation of Parameter
No ratings yet
Estimation of Parameter
19 pages
Staff Manual 06
No ratings yet
Staff Manual 06
3 pages
7 Measures of Dispersion
0% (1)
7 Measures of Dispersion
8 pages
Gage Repeatability and Reproducibility Data Sheet Variable Data Results
No ratings yet
Gage Repeatability and Reproducibility Data Sheet Variable Data Results
30 pages
Question Bank 1
No ratings yet
Question Bank 1
4 pages
Scheme of Work STA408 (MARCH 2014)
No ratings yet
Scheme of Work STA408 (MARCH 2014)
4 pages