0% found this document useful (0 votes)

53 views15 pages

Session On Maximum Likelihood Estimation

Uploaded by

no819154

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views15 pages

Session On Maximum Likelihood Estimation

Uploaded by

no819154

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Recap

03 July 2023 16:36

Session on Maximum Likelihood Estimation Page 1

Some Examples
01 July 2023 07:47

Example 1 - Coin Toss

Example 2 - Drawing balls from bag

Example 3 - Normal Distribution

Session on Maximum Likelihood Estimation Page 2

Session on Maximum Likelihood Estimation Page 3
Probability Vs Likelihood
01 July 2023 09:37

Probability: This is a measure of the chance that a certain event will occur out
of all possible events. It's usually presented as a ratio or fraction, and it ranges
from 0 (meaning the event will not happen) to 1 (meaning the event is certain
to happen).

Likelihood: In statistical context, likelihood is a function that measures the

plausibility of a particular parameter value given some observed data. It
quantifies how well a specific outcome supports specific parameter values.

More Definitions

A probability quantifies how often you observe a certain outcome of a test,

given a certain understanding of the underlying data.

A likelihood quantifies how good one’s model is, given a set of data that’s been
observed.

Probabilities describe test outcomes, while likelihoods describe models.

Session on Maximum Likelihood Estimation Page 4

Maximum Likelihood Estimation
01 July 2023 10:08

Maximum Likelihood Estimation (MLE) is a method of estimating the parameters of a

statistical model given some observed data.

Session on Maximum Likelihood Estimation Page 5

MLE for Normal Distribution
01 July 2023 10:08

Session on Maximum Likelihood Estimation Page 6

Session on Maximum Likelihood Estimation Page 7
Session on Maximum Likelihood Estimation Page 8
MLE in Machine Learning
01 July 2023 14:31

Session on Maximum Likelihood Estimation Page 9

MLE in Logistic Regression
01 July 2023 14:31

Session on Maximum Likelihood Estimation Page 10

Tasks
03 July 2023 15:50

Session on Maximum Likelihood Estimation Page 11

Some Important Questions
01 July 2023 17:15

1. Is MLE a general concept applicable to all machine learning algorithms

Maximum Likelihood Estimation (MLE) is a general statistical concept that

can be applied to many machine learning algorithms, particularly those that
are parametric (i.e., defined by a set of parameters), but it's not applicable
to all machine learning algorithms.

MLE is commonly used in algorithms such as linear regression, logistic

regression, and neural networks, among others. These algorithms use MLE
to find the optimal values of the parameters that best fit the training data.

However, there are some machine learning algorithms that don't rely on
MLE. For example:

1. Non-parametric methods: Some machine learning methods, such as k-

Nearest Neighbors (k-NN) and Decision Trees, are non-parametric and
do not make strong assumptions about the underlying data
distribution. These methods don't have a fixed set of parameters that
can be optimized using MLE.

2. Unsupervised learning algorithms: Some unsupervised learning

algorithms, like K-means clustering, use different objective functions,
not necessarily tied to a probability distribution.

3. Reinforcement Learning: Reinforcement Learning methods generally

don't use MLE, as they are more focused on learning from rewards and
punishments over a sequence of actions rather than fitting to a specific
data distribution.

2. How is MLE related to the concept of loss functions?

In machine learning, a loss function measures how well a model's

predictions align with the actual values. The goal of training a machine
learning model is often to find the model parameters that minimize the loss
function.

Session on Maximum Likelihood Estimation Page 12

Maximum Likelihood Estimation (MLE) is a method of estimating the
parameters of a statistical model to maximize the likelihood function, which
is conceptually similar to minimizing a loss function. In fact, for many
common models, minimizing the loss function is equivalent to maximizing
the likelihood function.

MLE and the concept of loss functions in machine learning are closely
related. Many common loss functions can be derived from the principle of
maximum likelihood estimation under certain assumptions about the data
or the model. By minimizing these loss functions, we're effectively
performing maximum likelihood estimation.

3. Then why does loss function exist, why don't we maximize Likelihood

The confusion arises from the fact that we're using two different
perspectives to look at the same problem.

In many machine learning algorithms, the aim is to minimize the difference

between the predicted and actual values, and this is typically represented
by a loss function. When we talk about minimizing the loss function, it's
essentially the same as saying we're trying to find the best model
parameters that give us the closest predictions to the actual values.

On the other hand, when we look at the problem from a statistical

perspective, we talk in terms of maximizing the likelihood of seeing the
observed data given the model parameters. This is represented by a
likelihood function.

For many models, these two perspectives are equivalent - minimizing the
loss function is the same as maximizing the likelihood function. In fact,
many common loss functions can be derived from the principle of MLE
under certain assumptions about the data.

So why do we often talk about minimizing the loss function instead of

maximizing the likelihood? There are a few reasons:

1. Computational reasons: It's often easier and more computationally

efficient to minimize a loss function than to maximize a likelihood
function. This is particularly true when working with complex models
like neural networks.

2. Generalization: The concept of a loss function is more general and can

Session on Maximum Likelihood Estimation Page 13
2. Generalization: The concept of a loss function is more general and can
be applied to a wider range of problems. Not all machine learning
problems can be framed in terms of maximizing a likelihood. For
example, many non-parametric methods and unsupervised learning
algorithms don't involve likelihoods.

3. Flexibility: Loss functions can be easily customized to the specific needs

of a problem. For instance, we might want to give more weight to
certain types of errors, or we might want to use a loss function that is
robust to outliers.

In summary, while the concepts of loss function minimization and

maximum likelihood estimation are closely related and often equivalent,
the concept of a loss function is more flexible and computationally
convenient, which is why it's more commonly used in the machine learning
community.

4. Then why study about maximum likelihood at all?

The study of Maximum Likelihood Estimation (MLE) is essential for several

reasons, despite the prevalence of loss functions in machine learning:

1. Statistical Foundation: MLE provides a strong statistical foundation for

understanding machine learning models. It gives a principled way of
deriving the loss functions used in many common machine learning
algorithms, and it helps us understand why these loss functions work
and under what assumptions.

2. Interpretability: The MLE framework gives us a way to interpret our

model parameters. The MLEs are the parameters that make the
observed data most likely under our model, which can be a powerful
way of understanding what our model has learned.

3. Model Comparison: MLE gives us a way to compare different models

on the same dataset. This can be done using tools like the Akaike
Information Criterion (AIC) or the Bayesian Information Criterion (BIC),
which are based on the likelihood function and can help us choose the
best model for our data.

4. Generalization to Other Methods: MLE is a specific case of more

general methods, like Expectation-Maximization and Bayesian
inference, which are used in more complex statistical modelling.
Understanding MLE can provide a stepping stone to these more
advanced topics.

Session on Maximum Likelihood Estimation Page 14

advanced topics.

5. Deeper Understanding: Lastly, understanding MLE can give us a deeper

understanding of our models, leading to better intuition, better model
selection, and ultimately, better performance on our machine learning
tasks.

In short, while you can often get by with a practical understanding of loss
functions and optimization algorithms in applied machine learning,
understanding MLE can be extremely valuable for gaining a deeper
understanding of how and why these models work.

Session on Maximum Likelihood Estimation Page 15

Monaco IMRT/VMAT Treatment Planning: Training Module
50% (2)
Monaco IMRT/VMAT Treatment Planning: Training Module
126 pages
Introduction To Management Science
100% (1)
Introduction To Management Science
42 pages
Maximum Likelihood Estimation
No ratings yet
Maximum Likelihood Estimation
17 pages
KNN Evaluation
No ratings yet
KNN Evaluation
51 pages
Managing Water Resources Methods and Tools For A Systems Approach Slobodan P. Simonovic PDF Download
No ratings yet
Managing Water Resources Methods and Tools For A Systems Approach Slobodan P. Simonovic PDF Download
52 pages
CMA - Volume 1
No ratings yet
CMA - Volume 1
181 pages
Machine Learning QB
No ratings yet
Machine Learning QB
32 pages
Tutorial On Maximum Likelihood Estimation
100% (2)
Tutorial On Maximum Likelihood Estimation
11 pages
CH 3
No ratings yet
CH 3
73 pages
Unit 04 - Maximum Likelihood Estimation - 1 Per Page
No ratings yet
Unit 04 - Maximum Likelihood Estimation - 1 Per Page
62 pages
Lecture 3 ML - Optimization
No ratings yet
Lecture 3 ML - Optimization
32 pages
Statistical Learning Theory
No ratings yet
Statistical Learning Theory
4 pages
Goal Programming
100% (1)
Goal Programming
46 pages
Chapter 2: Maximum Likelihood Estimation: Advanced Econometrics - HEC Lausanne
No ratings yet
Chapter 2: Maximum Likelihood Estimation: Advanced Econometrics - HEC Lausanne
207 pages
LN NN Rug
No ratings yet
LN NN Rug
215 pages
Linear Regression1
No ratings yet
Linear Regression1
98 pages
ML Merge
No ratings yet
ML Merge
145 pages
A GUIDE To QADM (New) (Repaired)
No ratings yet
A GUIDE To QADM (New) (Repaired)
226 pages
Unit-2 Machine Learning
No ratings yet
Unit-2 Machine Learning
110 pages
Unit 2 - ML - SRM
No ratings yet
Unit 2 - ML - SRM
89 pages
ABDUA 3 and 4
No ratings yet
ABDUA 3 and 4
102 pages
DL145611 03 Shallow
No ratings yet
DL145611 03 Shallow
92 pages
Report On Cost Estimation
No ratings yet
Report On Cost Estimation
41 pages
DNN - 1 - M1 - Fundamentals of Neural Network
No ratings yet
DNN - 1 - M1 - Fundamentals of Neural Network
95 pages
Unit 2 - ML - SRM
No ratings yet
Unit 2 - ML - SRM
66 pages
Tree-Structured Parzen Estimator: Understanding Its Algorithm Components and Their Roles For Better Empirical Performance
No ratings yet
Tree-Structured Parzen Estimator: Understanding Its Algorithm Components and Their Roles For Better Empirical Performance
74 pages
Sta255 Week 11-1 Pre
No ratings yet
Sta255 Week 11-1 Pre
37 pages
DSA5102 Lecture1
No ratings yet
DSA5102 Lecture1
60 pages
DSA5102X Lecture1
No ratings yet
DSA5102X Lecture1
51 pages
Lec8 MLE
No ratings yet
Lec8 MLE
35 pages
When Models Meet Data
No ratings yet
When Models Meet Data
25 pages
CM20315 02 Supervised
No ratings yet
CM20315 02 Supervised
53 pages
Machine Learning Models
No ratings yet
Machine Learning Models
52 pages
DSA5105 Lecture1
No ratings yet
DSA5105 Lecture1
51 pages
Modeling 5 Text
No ratings yet
Modeling 5 Text
45 pages
Chapter 02.background-Theory
No ratings yet
Chapter 02.background-Theory
20 pages
AIML-Unit 3 Notes-Assignment 3
No ratings yet
AIML-Unit 3 Notes-Assignment 3
37 pages
Notes 05
No ratings yet
Notes 05
51 pages
7 - LP-Graphical& Computer Methods
No ratings yet
7 - LP-Graphical& Computer Methods
93 pages
Maximum Likelihood Estimation by K.Kashin
No ratings yet
Maximum Likelihood Estimation by K.Kashin
34 pages
Ds 6
No ratings yet
Ds 6
21 pages
An Information-Theoretic Approach To Generalization Theory - Part2
No ratings yet
An Information-Theoretic Approach To Generalization Theory - Part2
22 pages
Ds 7
No ratings yet
Ds 7
20 pages
Lecture17 Mle Map
No ratings yet
Lecture17 Mle Map
29 pages
SN Quantitative Methods For Economics - 1 Set 1 Q 1
No ratings yet
SN Quantitative Methods For Economics - 1 Set 1 Q 1
10 pages
AIML - Unit 4 Notes
No ratings yet
AIML - Unit 4 Notes
23 pages
Foundations of Machine Learning - 3
No ratings yet
Foundations of Machine Learning - 3
38 pages
435 PDF
No ratings yet
435 PDF
22 pages
Lec 12
No ratings yet
Lec 12
15 pages
Deep Learning
No ratings yet
Deep Learning
13 pages
Beyond Classification Beyond Classification Beyond Classification Beyond Classification
No ratings yet
Beyond Classification Beyond Classification Beyond Classification Beyond Classification
23 pages
Unit II Deep Learning
No ratings yet
Unit II Deep Learning
11 pages
Lecture 03 - Feedforward Networks - 4p
No ratings yet
Lecture 03 - Feedforward Networks - 4p
19 pages
Slide 1
No ratings yet
Slide 1
37 pages
Ds 8
No ratings yet
Ds 8
10 pages
ML 01
No ratings yet
ML 01
24 pages
Lecture 6
No ratings yet
Lecture 6
13 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
Maximum Likelihood Estimation
No ratings yet
Maximum Likelihood Estimation
6 pages
Supervised Learning
No ratings yet
Supervised Learning
5 pages
Session 1 - Introduction and Graphical Method To Solve LPP
No ratings yet
Session 1 - Introduction and Graphical Method To Solve LPP
74 pages
Neural Networks and Deep Learning
No ratings yet
Neural Networks and Deep Learning
22 pages
Lecture 5
No ratings yet
Lecture 5
13 pages
Etf3600 Lecture3 Mle LPM 2013
No ratings yet
Etf3600 Lecture3 Mle LPM 2013
36 pages
Notes 1
No ratings yet
Notes 1
3 pages
Output 25
No ratings yet
Output 25
8 pages
Note 4: EECS 189 Introduction To Machine Learning Fall 2020 1 MLE and MAP For Regression (Part I)
No ratings yet
Note 4: EECS 189 Introduction To Machine Learning Fall 2020 1 MLE and MAP For Regression (Part I)
6 pages
Linear Regression - Ipynb - Colab
No ratings yet
Linear Regression - Ipynb - Colab
3 pages
Output 23
No ratings yet
Output 23
6 pages
Experiment 1
No ratings yet
Experiment 1
5 pages
Cours SCI31 - Reine Talj - A2015 - Séances 1 Et 2
No ratings yet
Cours SCI31 - Reine Talj - A2015 - Séances 1 Et 2
47 pages
Boosting Algorithms: Regularization, Prediction and Model Fitting
No ratings yet
Boosting Algorithms: Regularization, Prediction and Model Fitting
29 pages
Homework 1
No ratings yet
Homework 1
2 pages
ML Notes
No ratings yet
ML Notes
4 pages
Computers & Industrial Engineering: Ching-Ter Chang, Huang-Mu Chen, Zheng-Yun Zhuang
No ratings yet
Computers & Industrial Engineering: Ching-Ter Chang, Huang-Mu Chen, Zheng-Yun Zhuang
8 pages
Mathematical Statistics (MA212M) : Lecture Slides
No ratings yet
Mathematical Statistics (MA212M) : Lecture Slides
14 pages
AFW2020 S1 2011 Final Exam
No ratings yet
AFW2020 S1 2011 Final Exam
8 pages
Lec 3 - Marginal Analysis For Optimal Decisions
No ratings yet
Lec 3 - Marginal Analysis For Optimal Decisions
15 pages
Maximum Likelihood Estimators and Least Squares
No ratings yet
Maximum Likelihood Estimators and Least Squares
5 pages
Lecture 3: Applications of Machine Learning Algorithms Jul. 06 & 09, 2018
No ratings yet
Lecture 3: Applications of Machine Learning Algorithms Jul. 06 & 09, 2018
3 pages
Linear Programming Sensitivity Analysis: © 2007 Pearson Education
No ratings yet
Linear Programming Sensitivity Analysis: © 2007 Pearson Education
37 pages
A Beginner's Notes On Bayesian Econometrics (Art)
No ratings yet
A Beginner's Notes On Bayesian Econometrics (Art)
21 pages
Maximum Likelihood
No ratings yet
Maximum Likelihood
16 pages
MLEstimation
No ratings yet
MLEstimation
8 pages
Fundamentals of Machine Learning: a Simplified Approach
From Everand
Fundamentals of Machine Learning: a Simplified Approach
Er. Sudhir Goswami
No ratings yet
MATHEMATICAL FOUNDATIONS OF MACHINE LEARNING: Unveiling the Mathematical Essence of Machine Learning (2024 Guide for Beginners)
From Everand
MATHEMATICAL FOUNDATIONS OF MACHINE LEARNING: Unveiling the Mathematical Essence of Machine Learning (2024 Guide for Beginners)
DAVID MACKAY
No ratings yet
Mastering Machine Learning: A Comprehensive Guide to Success
From Everand
Mastering Machine Learning: A Comprehensive Guide to Success
Rick Spair
No ratings yet
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
Machine Learning: Fundamentals and Applications
From Everand
Machine Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet

Session On Maximum Likelihood Estimation

Uploaded by

Session On Maximum Likelihood Estimation

Uploaded by

Recap

03 July 2023 16:36

Session on Maximum Likelihood Estimation Page 1

Example 1 - Coin Toss

Example 2 - Drawing balls from bag

Example 3 - Normal Distribution

Session on Maximum Likelihood Estimation Page 2

Likelihood: In statistical context, likelihood is a function that measures the

A probability quantifies how often you observe a certain outcome of a test,

Probabilities describe test outcomes, while likelihoods describe models.

Session on Maximum Likelihood Estimation Page 4

Maximum Likelihood Estimation (MLE) is a method of estimating the parameters of a

Session on Maximum Likelihood Estimation Page 5

Session on Maximum Likelihood Estimation Page 6

Session on Maximum Likelihood Estimation Page 9

Session on Maximum Likelihood Estimation Page 10

Session on Maximum Likelihood Estimation Page 11

1. Is MLE a general concept applicable to all machine learning algorithms

Maximum Likelihood Estimation (MLE) is a general statistical concept that

MLE is commonly used in algorithms such as linear regression, logistic

1. Non-parametric methods: Some machine learning methods, such as k-

2. Unsupervised learning algorithms: Some unsupervised learning

3. Reinforcement Learning: Reinforcement Learning methods generally

2. How is MLE related to the concept of loss functions?

In machine learning, a loss function measures how well a model's

Session on Maximum Likelihood Estimation Page 12

In many machine learning algorithms, the aim is to minimize the difference

On the other hand, when we look at the problem from a statistical

So why do we often talk about minimizing the loss function instead of

1. Computational reasons: It's often easier and more computationally

2. Generalization: The concept of a loss function is more general and can

3. Flexibility: Loss functions can be easily customized to the specific needs

In summary, while the concepts of loss function minimization and

4. Then why study about maximum likelihood at all?

The study of Maximum Likelihood Estimation (MLE) is essential for several

1. Statistical Foundation: MLE provides a strong statistical foundation for

2. Interpretability: The MLE framework gives us a way to interpret our

3. Model Comparison: MLE gives us a way to compare different models

4. Generalization to Other Methods: MLE is a specific case of more

Session on Maximum Likelihood Estimation Page 14

5. Deeper Understanding: Lastly, understanding MLE can give us a deeper

Session on Maximum Likelihood Estimation Page 15

You might also like