0% found this document useful (0 votes)

27 views10 pages

Unit 5 - ML

The document provides an introduction to Gaussian Mixture Models (GMMs), explaining how they use multiple Gaussian distributions to cluster data points probabilistically. It details the Expectation-Maximization (EM) algorithm used to determine model parameters, including the E-step and M-step processes. Additionally, it briefly discusses reinforcement learning and Bayesian networks, highlighting their definitions, components, and applications.

Uploaded by

psaritha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views10 pages

Unit 5 - ML

Uploaded by

psaritha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Introduction to Gaussian Mixture Models (GMMs)

Gaussian Mixture Models (GMMs) assume that there are a certain number of Gaussian
distributions, and each of these distributions represent a cluster. Hence, a Gaussian Mixture
Model tends to group the data points belonging to a single distribution together.

Let’s say we have three Gaussian distributions – GD1, GD2, and GD3. These have a certain
mean (μ1, μ2, μ3) and variance (σ1, σ2, σ3) value respectively. For a given set of data points,
our GMM would identify the probability of each data point belonging to each of these
distributions.

Gaussian Mixture Models are probabilistic models and use the soft clustering approach for
distributing the points in different clusters.

Here, we have three clusters that are denoted by three colours – Blue, Green, and Cyan. Let’s
take the data point highlighted in red. The probability of this point being a part of the blue
cluster is 1, while the probability of it being a part of the green or cyan clusters is 0.

Now, consider another point – somewhere in between the blue and cyan (highlighted in the
below figure). The probability that this point is a part of cluster green is 0. And the probability
that this belongs to blue and cyan is 0.2 and 0.8 respectively.

Gaussian Mixture Models use the soft clustering technique for assigning data points to
Gaussian distributions. I’m sure you’re wondering what these distributions are so let me
explain that in the next section.
The Gaussian Distribution

Gaussian Distributions (or the Normal Distribution) has a bell-shaped curve, with the data
points symmetrically distributed around the mean value.

The below image has a few Gaussian distributions with a difference in mean (μ) and variance
(σ2). Remember that the higher the σ value more would be the spread:

In a one dimensional space, the probability density function of a Gaussian distribution is given
by:

where μ is the mean and σ2 is the variance.

In the case of two variables, instead of a 2D bell-shaped curve, we will have a 3D bell curve as
shown below:
The probability density function would be given by:

where x is the input vector, μ is the 2D mean vector, and Σ is the 2×2 covariance matrix. The
covariance would now define the shape of this curve. We can generalize the same for d-
dimensions.

Thus, this multivariate Gaussian model would have x and μ as vectors of length d, and Σ would
be a d x d covariance matrix.

Hence, for a dataset with d features, we would have a mixture of k Gaussian distributions
(where k is equivalent to the number of clusters), each having a certain mean vector and
variance matrix.

The mean and variance values are determined using a technique called Expectation-
Maximization (EM).

Expectation-Maximization (EM) is a statistical algorithm for finding the right model

parameters. We typically use EM when the data has missing values, or in other words, when
the data is incomplete.

These missing variables are called latent variables. It’s difficult to determine the right model
parameters due to these missing variables.

Since we do not have the values for the latent variables, Expectation-Maximization tries to
use the existing data to determine the optimum values for these variables and then finds
the model parameters. Based on these model parameters, we go back and update the values
for the latent variable, and so on.

Broadly, the Expectation-Maximization algorithm has two steps:

• E-step: In this step, the available data is used to estimate (guess) the values of the
missing variables
• M-step: Based on the estimated values generated in the E-step, the complete data is
used to update the parameters

•
Expectation-Maximization is the base of many algorithms, including Gaussian Mixture
Models.

Expectation-Maximization in Gaussian Mixture Models

Let’s understand this using another example. I want you to visualize the idea in your mind as
you read along. This will help you better understand what we’re talking about.

Let’s say we need to assign k number of clusters. This means that there are k Gaussian
distributions, with the mean and covariance values to be μ1, μ2, .. μk and Σ1, Σ2, .. Σk .
Additionally, there is another parameter for the distribution that defines the number of points
for the distribution. Or in other words, the density of the distribution is represented with Π i.

Now, we need to find the values for these parameters to define the Gaussian distributions.
We already decided the number of clusters, and randomly assigned the values for the mean,
covariance, and density. Next, we’ll perform the E-step and the M-step!

E-step:

For each point xi, calculate the probability that it belongs to cluster/distribution c1, c2, … ck.
This is done using the below formula:

This value will be high when the point is assigned to the right cluster and lower otherwise.

M-step:

Post the E-step, we go back and update the Π, μ and Σ values. These are updated in the
following manner:

1. The new density is defined by the ratio of the number of points in the cluster and the
total number of points:
2. The mean and the covariance matrix are updated based on the values assigned to the
distribution, in proportion with the probability values for the data point. Hence, a data
point that has a higher probability of being a part of that distribution will contribute a
larger portion:

Based on the updated values generated from this step, we calculate the new probabilities for
each data point and update the values iteratively. This process is repeated in order to
maximize the log-likelihood function. Effectively we can say that the

k-means only considers the mean to update the centroid while GMM takes into account the
mean as well as the variance of the data!

RIENFORCEMENT LEARNING
o Reinforcement Learning is a feedback-based Machine learning technique in which an
agent learns to behave in an environment by performing the actions and seeing the
results of actions. For each good action, the agent gets positive feedback, and for each
bad action, the agent gets negative feedback or penalty.
o Since there is no labeled data, so the agent is bound to learn by its experience only.
o RL solves a specific type of problem where decision making is sequential, and the goal
is long-term, such as game-playing, robotics, etc.
o The agent interacts with the environment and explores it by itself. The primary goal of
an agent in reinforcement learning is to improve the performance by getting the
maximum positive rewards.
o The agent learns with the process of hit and trial, and based on the experience, it
learns to perform the task in a better way. Hence, we can say that "Reinforcement
learning is a type of machine learning method where an intelligent agent (computer
program) interacts with the environment and learns to act within that
Terms used in Reinforcement Learning
o Agent(): An entity that can perceive/explore the environment and act upon it.
o Environment(): A situation in which an agent is present or surrounded by. In RL, we
assume the stochastic environment, which means it is random in nature.
o Action(): Actions are the moves taken by an agent within the environment.
o State(): State is a situation returned by the environment after each action taken by
the agent.
o Reward(): A feedback returned to the agent from the environment to evaluate the
action of the agent.
o Policy(): Policy is a strategy applied by the agent for the next action based on the
current state.
o Value(): It is expected long-term retuned with the discount factor and opposite to the
short-term reward.
o Q-value(): It is mostly similar to the value, but it takes one additional parameter as a
current action (a).

Types of Reinforcement: There are two types of Reinforcement:

1. Positive –
Positive Reinforcement is defined as when an event, occurs due to a particular
behavior, increases the strength and the frequency of the behavior. In other
words, it has a positive effect on behavior.
Advantages of reinforcement learning are:
• Maximizes Performance
• Sustain Change for a long period of time
• Too much Reinforcement can lead to an overload of states which
can diminish the results
2. Negative –
Negative Reinforcement is defined as strengthening of behavior because a
negative condition is stopped or avoided.
Advantages of reinforcement learning:
• Increases Behavior
• Provide defiance to a minimum standard of performance
• It Only provides enough to meet up the minimum behavior
Various Practical applications of Reinforcement Learning –

• RL can be used in robotics for industrial automation.

• RL can be used in machine learning and data processing
• RL can be used to create training systems that provide custom instruction and
materials according to the requirement of students.

RL can be used in large environments in the following situations:

1. A model of the environment is known, but an analytic solution is not available;
2. Only a simulation model of the environment is given (the subject of simulation-
based optimization)
3. The only way to collect information about the environment is to interact with
it.

BAYESIAN NETWORKS

• "A Bayesian network is a probabilistic graphical model which represents a set of variables
and their conditional dependencies using a directed acyclic graph."
• It is also called a Bayes network, belief network, decision network, or Bayesian model.
• Bayesian networks are probabilistic, because these networks are built from a probability
distribution, and also use probability theory for prediction and anomaly detection.

Bayesian Network can be used for building models from data and experts opinions, and it
consists of two parts:

o Directed Acyclic Graph

o Table of conditional probabilities.

A Bayesian network graph is made up of nodes and Arcs (directed links), where:

o
o Each node corresponds to the random variables, and a variable can
be continuous or discrete.
o Arc or directed arrows represent the causal relationship or conditional probabilities
between random variables. These directed links or arrows connect the pair of nodes
in the graph.

These links represent that one node directly influence the other node, and if there is
no directed link that means that nodes are independent with each other

o In the above diagram, A, B, C, and D are random variables represented by the

nodes of the network graph.
o If we are considering node B, which is connected with node A by a directed
arrow, then node A is called the parent of Node B.
o Node C is independent of node A.

The Bayesian network has mainly two components:

o Causal Component
o Actual numbers

Each node in the Bayesian network has condition probability distribution P(Xi |Parent(Xi) ),
which determines the effect of the parent on that node.

Bayesian network is based on Joint probability distribution and conditional probability.

• Consider this example:

• In the above figure, we have an alarm ‘A’ – a node, say installed in a house of a
person ‘gfg’, which rings upon two probabilities i.e burglary ‘B’ and fire ‘F’,
which are – parent nodes of the alarm node. The alarm is the parent node of
two probabilities P1 calls ‘P1’ & P2 calls ‘P2’ person nodes.
• Upon the instance of burglary and fire, ‘P1’ and ‘P2’ call person ‘gfg’,
respectively. But, there are few drawbacks in this case, as sometimes ‘P1’ may
forget to call the person ‘gfg’, even after hearing the alarm, as he has a
tendency to forget things, quick. Similarly, ‘P2’, sometimes fails to call the
person ‘gfg’, as he is only able to hear the alarm, from a certain distance.
•
Q) Find the probability that ‘P1’ is true (P1 has called ‘gfg’), ‘P2’ is true (P2 has called ‘gfg’)
when the alarm ‘A’ rang, but no burglary ‘B’ and fire ‘F’ has occurred.

=> P ( P1, P2, A, ~B, ~F) [ where- P1, P2 & A are ‘true’ events and ‘~B’ & ‘~F’ are ‘false’
events]
[ Note: The values mentioned below are neither calculated nor computed. They have
observed values ]
Burglary ‘B’ –
• P (B=T) = 0.001 (‘B’ is true i.e burglary has occurred)
• P (B=F) = 0.999 (‘B’ is false i.e burglary has not occurred)
Fire ‘F’ –
• P (F=T) = 0.002 (‘F’ is true i.e fire has occurred)
• P (F=F) = 0.998 (‘F’ is false i.e fire has not occurred)
Alarm ‘A’ –

B F P (A=T) P (A=F)

T T 0.95 0.05

T F 0.94 0.06

F T 0.29 0.71

F F 0.001 0.999

• The alarm ‘A’ node can be ‘true’ or ‘false’ ( i.e may have rung or may not have
rung). It has two parent nodes burglary ‘B’ and fire ‘F’ which can be ‘true’ or
‘false’ (i.e may have occurred or may not have occurred) depending upon
different conditions.
Person ‘P1’ –

A P (P1=T) P (P1=F)

T 0.95 0.05

F 0.05 0.95

• The person ‘P1’ node can be ‘true’ or ‘false’ (i.e may have called the person
‘gfg’ or not) . It has a parent node, the alarm ‘A’, which can be ‘true’ or ‘false’
(i.e may have rung or may not have rung ,upon burglary ‘B’ or fire ‘F’).
Person ‘P2’ –

A P (P2=T) P (P2=F)
T 0.80 0.20

F 0.01 0.99

• The person ‘P2’ node can be ‘true’ or false’ (i.e may have called the person ‘gfg’
or not). It has a parent node, the alarm ‘A’, which can be ‘true’ or ‘false’ (i.e
may have rung or may not have rung, upon burglary ‘B’ or fire ‘F’).
•
Solution: Considering the observed probabilistic scan –
With respect to the question — P ( P1, P2, A, ~B, ~F) , we need to get the probability of
‘P1’. We find it with regard to its parent node – alarm ‘A’. To get the probability of ‘P2’, we
find it with regard to its parent node — alarm ‘A’.
We find the probability of alarm ‘A’ node with regard to ‘~B’ & ‘~F’ since burglary ‘B’ and
fire ‘F’ are parent nodes of alarm ‘A’.
From the observed probabilistic scan, we can deduce –
P ( P1, P2, A, ~B, ~F)
= P (P1/A) * P (P2/A) * P (A/~B~F) * P (~B) * P (~F)
= 0.95 * 0.80 * 0.001 * 0.999 * 0.998
= 0.00075

COX, D. R. HINKLEY, D. V. Theoretical Statistics. 1974 PDF
100% (4)
COX, D. R. HINKLEY, D. V. Theoretical Statistics. 1974 PDF
522 pages
3 Schervish-1995
100% (1)
3 Schervish-1995
718 pages
M348 Applied Statistical Modelling - Linear Models
No ratings yet
M348 Applied Statistical Modelling - Linear Models
504 pages
Belisa Aliyi - Assignments - For - Econometrics
No ratings yet
Belisa Aliyi - Assignments - For - Econometrics
34 pages
Unit 9 Simple Linear Regression: Structure
No ratings yet
Unit 9 Simple Linear Regression: Structure
22 pages
GaussianMixtureModel (GMM)
No ratings yet
GaussianMixtureModel (GMM)
18 pages
Chap2 Part2 GMM
No ratings yet
Chap2 Part2 GMM
34 pages
Gaussian Mixture Mode
No ratings yet
Gaussian Mixture Mode
30 pages
Gaussian Mixture Models Unit-III
No ratings yet
Gaussian Mixture Models Unit-III
13 pages
Dsci303-19 GM - em
No ratings yet
Dsci303-19 GM - em
81 pages
GMM
No ratings yet
GMM
25 pages
Gaussian Mixture Model (GMM)
No ratings yet
Gaussian Mixture Model (GMM)
10 pages
ML Unit Iii
No ratings yet
ML Unit Iii
12 pages
Gaussian Distribution
No ratings yet
Gaussian Distribution
5 pages
Gaussian Mixture Model GMM
No ratings yet
Gaussian Mixture Model GMM
5 pages
Pattern Classification 08. Gaussian Mixture Model: Abdelmoniem Bayoumi, PHD
No ratings yet
Pattern Classification 08. Gaussian Mixture Model: Abdelmoniem Bayoumi, PHD
12 pages
Gaussian Mixture Model - GeeksforGeeks
No ratings yet
Gaussian Mixture Model - GeeksforGeeks
6 pages
Gaussian Mixture Modelling GMM
No ratings yet
Gaussian Mixture Modelling GMM
11 pages
Get One More Story in Your Member Preview When You Sign Up. It's Free
No ratings yet
Get One More Story in Your Member Preview When You Sign Up. It's Free
12 pages
Gaussian Mixture Models: LE Thi Khuyen
No ratings yet
Gaussian Mixture Models: LE Thi Khuyen
40 pages
L08 GMM
No ratings yet
L08 GMM
11 pages
Gaussian Mixtures
No ratings yet
Gaussian Mixtures
5 pages
Pattern Analysis-Machine Learning
No ratings yet
Pattern Analysis-Machine Learning
74 pages
Andrew Rosenberg - Lecture 18: Gaussian Mixture Models and Expectation Maximization
No ratings yet
Andrew Rosenberg - Lecture 18: Gaussian Mixture Models and Expectation Maximization
34 pages
GMMEMNotes
No ratings yet
GMMEMNotes
10 pages
Chapter 1 - Part1
No ratings yet
Chapter 1 - Part1
56 pages
ET4248E - Chap9 - K-Means and GMM
No ratings yet
ET4248E - Chap9 - K-Means and GMM
27 pages
14 Gaussian Mixture Models
No ratings yet
14 Gaussian Mixture Models
60 pages
Clustering Mixture
No ratings yet
Clustering Mixture
22 pages
Week 7 GMM
No ratings yet
Week 7 GMM
9 pages
ASSIGNMENT1
No ratings yet
ASSIGNMENT1
7 pages
9 Unsupervised Learning: 9.1 K-Means Clustering
No ratings yet
9 Unsupervised Learning: 9.1 K-Means Clustering
34 pages
Tutorial em
No ratings yet
Tutorial em
57 pages
16) ISM-Session 16 - 30th and 31st March 2024
No ratings yet
16) ISM-Session 16 - 30th and 31st March 2024
36 pages
Elliptical Mixture Models Improve The Accuracy of Gaussian Mixture Models With Expectationmaximization Algorithm
No ratings yet
Elliptical Mixture Models Improve The Accuracy of Gaussian Mixture Models With Expectationmaximization Algorithm
20 pages
Session No: CO2-1 Session Topic: Motion Analysis: Digital Video Processing
No ratings yet
Session No: CO2-1 Session Topic: Motion Analysis: Digital Video Processing
29 pages
Gaussian Mixture Model
No ratings yet
Gaussian Mixture Model
10 pages
Expectation-Maximization For The Gaussian Mixture Model
No ratings yet
Expectation-Maximization For The Gaussian Mixture Model
8 pages
ML Unit3
No ratings yet
ML Unit3
21 pages
EM and Kmeans Relations
No ratings yet
EM and Kmeans Relations
70 pages
کتاب ششم بارگزاری شده
No ratings yet
کتاب ششم بارگزاری شده
49 pages
15 GMC
No ratings yet
15 GMC
4 pages
PROBABILISTIC Learning Jb-New
No ratings yet
PROBABILISTIC Learning Jb-New
13 pages
20 Gaussian Mixture Model
No ratings yet
20 Gaussian Mixture Model
55 pages
Week 7 - Latent Variable Models and Expectation Maximization
No ratings yet
Week 7 - Latent Variable Models and Expectation Maximization
39 pages
Unit 4 - Machine Learning - WWW - Rgpvnotes.in PDF
No ratings yet
Unit 4 - Machine Learning - WWW - Rgpvnotes.in PDF
27 pages
Applied Statistics - Lecture 1: Mario Beraha
No ratings yet
Applied Statistics - Lecture 1: Mario Beraha
52 pages
Unit 2
No ratings yet
Unit 2
7 pages
Lec15 16 Handout
No ratings yet
Lec15 16 Handout
33 pages
ds11 2
No ratings yet
ds11 2
19 pages
Module13 GaussianMixtureModel
No ratings yet
Module13 GaussianMixtureModel
17 pages
6.2 K Means
No ratings yet
6.2 K Means
23 pages
DSA5102 Lecture10
No ratings yet
DSA5102 Lecture10
40 pages
AI29
No ratings yet
AI29
3 pages
cs229 Notes7b PDF
No ratings yet
cs229 Notes7b PDF
4 pages
S6, S7, S8 CS - U4 Getter Setter EM Algorithm
No ratings yet
S6, S7, S8 CS - U4 Getter Setter EM Algorithm
32 pages
Image Segmentation1
No ratings yet
Image Segmentation1
42 pages
Lec 12
No ratings yet
Lec 12
15 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
Gaussian Mixture Models
No ratings yet
Gaussian Mixture Models
3 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Markov Decision Process: Fundamentals and Applications
From Everand
Markov Decision Process: Fundamentals and Applications
Fouad Sabry
No ratings yet
Active Contour: Advancing Computer Vision with Active Contour Techniques
From Everand
Active Contour: Advancing Computer Vision with Active Contour Techniques
Fouad Sabry
No ratings yet
A Conversation About Calculus
From Everand
A Conversation About Calculus
Ginachukwu Amah
No ratings yet
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Classes in Python
No ratings yet
Classes in Python
7 pages
Ai 1
No ratings yet
Ai 1
3 pages
JOINS
No ratings yet
JOINS
9 pages
Power Bi
No ratings yet
Power Bi
5 pages
Flash Fill
No ratings yet
Flash Fill
2 pages
Sem 12
No ratings yet
Sem 12
25 pages
Physics COMPLETE
No ratings yet
Physics COMPLETE
110 pages
ML Unit 2 Part - 2
No ratings yet
ML Unit 2 Part - 2
6 pages
ML Unit 3
No ratings yet
ML Unit 3
17 pages
Computer ScienceCOMPLETE ICSE
No ratings yet
Computer ScienceCOMPLETE ICSE
110 pages
QP1151J
No ratings yet
QP1151J
1 page
DAX1
No ratings yet
DAX1
1 page
BA Sem 5
No ratings yet
BA Sem 5
2 pages
Data Analyst
No ratings yet
Data Analyst
7 pages
Project 1 AP
No ratings yet
Project 1 AP
2 pages
QP1151J
No ratings yet
QP1151J
1 page
Computers and Informatics B SC Data Science 2022 03 28 07 00 07
No ratings yet
Computers and Informatics B SC Data Science 2022 03 28 07 00 07
9 pages
Rapid Fire Remaring Questions
No ratings yet
Rapid Fire Remaring Questions
1 page
Laboratories 93. Infrastructure Available in The Labs
No ratings yet
Laboratories 93. Infrastructure Available in The Labs
3 pages
Cover
No ratings yet
Cover
1 page
Assembly Language Programming (Part I)
100% (1)
Assembly Language Programming (Part I)
20 pages
ER Diagram
No ratings yet
ER Diagram
1 page
05 Inference Lab
No ratings yet
05 Inference Lab
12 pages
Keynote 003
No ratings yet
Keynote 003
6 pages
Climamed'13 Congres Oct 10 Proceedings
0% (1)
Climamed'13 Congres Oct 10 Proceedings
820 pages
Sample Questions
No ratings yet
Sample Questions
8 pages
Applied Stochastic Processes PDF
No ratings yet
Applied Stochastic Processes PDF
104 pages
Prob&StatsBook PDF
No ratings yet
Prob&StatsBook PDF
202 pages
Fisher Information
No ratings yet
Fisher Information
59 pages
Gen AI Unit 2
100% (1)
Gen AI Unit 2
65 pages
MIT18 657F15 LecNote PDF
No ratings yet
MIT18 657F15 LecNote PDF
194 pages
Experimental Design and Analysis Seltman
100% (3)
Experimental Design and Analysis Seltman
428 pages
Bolker Et Al 2009 General Mixed Model
No ratings yet
Bolker Et Al 2009 General Mixed Model
9 pages
R Rec P.1410 4 200702 I!!pdf e
No ratings yet
R Rec P.1410 4 200702 I!!pdf e
28 pages
Quantitative Methods For Financial Analyssis Sample-8
No ratings yet
Quantitative Methods For Financial Analyssis Sample-8
58 pages
Stochastic User Equilibrium
No ratings yet
Stochastic User Equilibrium
18 pages
Modeling Relationship Among Factors That Affecting Customers' Intention in Purchasing Malaysian Cars Using Structural Equation Model
No ratings yet
Modeling Relationship Among Factors That Affecting Customers' Intention in Purchasing Malaysian Cars Using Structural Equation Model
10 pages
SAS Procedures
100% (1)
SAS Procedures
44 pages
5 Jorge Chapter Two Module
100% (1)
5 Jorge Chapter Two Module
80 pages
DML 2
No ratings yet
DML 2
117 pages
Jpeg Ls Loco
No ratings yet
Jpeg Ls Loco
16 pages
Biostatistical Methods II (Regression Methods) : Course No. BMTRY 701
No ratings yet
Biostatistical Methods II (Regression Methods) : Course No. BMTRY 701
4 pages
Kertas Kerja Modul Pelatihan 2024 Tambahan
No ratings yet
Kertas Kerja Modul Pelatihan 2024 Tambahan
207 pages
Computations of Flows For On Demand Irrigation Systems
No ratings yet
Computations of Flows For On Demand Irrigation Systems
52 pages
STAT 151A Syllabus
No ratings yet
STAT 151A Syllabus
2 pages
Employee Attrition Prediction
No ratings yet
Employee Attrition Prediction
3 pages
Analytics
0% (1)
Analytics
50 pages

Unit 5 - ML

Uploaded by

Unit 5 - ML

Uploaded by

Introduction to Gaussian Mixture Models (GMMs)

where μ is the mean and σ2 is the variance.

Expectation-Maximization (EM) is a statistical algorithm for finding the right model

Broadly, the Expectation-Maximization algorithm has two steps:

Expectation-Maximization in Gaussian Mixture Models

Types of Reinforcement: There are two types of Reinforcement:

• RL can be used in robotics for industrial automation.

RL can be used in large environments in the following situations:

o Directed Acyclic Graph

o In the above diagram, A, B, C, and D are random variables represented by the

The Bayesian network has mainly two components:

Bayesian network is based on Joint probability distribution and conditional probability.

• Consider this example:

You might also like