0% found this document useful (0 votes)

162 views29 pages

Chapter 3 (PR)

1) The document discusses parameter estimation using maximum likelihood and Bayesian approaches. 2) For maximum likelihood, parameters are viewed as fixed quantities estimated by maximizing the likelihood of observed training examples. For Bayesian estimation, parameters are random variables with a prior distribution transformed into a posterior distribution via Bayes' theorem. 3) When the class-conditional probability density function (pdf) has a parametric form, maximum likelihood estimates the parameters by differentiating the log-likelihood function. For the Gaussian case with unknown mean and variance, the estimates are simply the sample mean and variance. 4) Bayesian estimation proceeds in three phases: 1) applying Bayes' formula to derive the posterior pdf of parameters from their prior pdf and

Uploaded by

Srikanta Karthik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

162 views29 pages

Chapter 3 (PR)

Uploaded by

Srikanta Karthik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Chapter 3

Maximum-Likelihood and
Bayesian Parameter Estimation

Pattern Recognition Soochow, Fall Semester 1

Bayes Theorem for Classification

To compute posterior probability , we need to know:

Prior probability: Likelihood:

The collection of training examples is composed of c data sets
 Each example in is drawn according to the class‐
conditional pdf, i.e.
 Examples in are i.i.d. random variables, i.e.
independent and identically distributed (独立同
分布)

Pattern Recognition Soochow, Fall Semester 2

Bayes Theorem for Classification (Cont.)
For prior probability: no difficulty
(Here, returns the cardinality,
i.e. number of elements, of a set)
For class-conditional pdf:
Ch. 3  Case I: has certain parametric form
e.g.: (parameters: )

To show the dependence of
on explicitly:

Ch. 4  Case II: doesn’t have parametric form
Pattern Recognition Soochow, Fall Semester 3
Estimation Under Parametric Form
Parametric class-conditional pdf:

 Assumption I: Maximum‐Likelihood (ML) estimation (极大似然估计)
Estimate parameter values by
View parameters as
maximizing the likelihood
quantities whose values
(probability) of observing the
are fixed but unknown
actual training examples

 Assumption II: Bayesian estimation (贝叶斯估计)

View parameters as Observation of the actual training
random variables examples transforms parameters’
having some known prior distribution into posterior
prior distribution distribution (via Bayes theorem)

Pattern Recognition Soochow, Fall Semester 4

Maximum-Likelihood Estimation
Settings
Likelihood function for each category is governed by some
fixed but unknown parameters, i.e.
Task: Estimate from
A simplified treatment
Examples in gives no information about

Work with each category separately and therefore simplify
the notations by dropping subscripts w.r.t. categories
without loss of generality:

Pattern Recognition Soochow, Fall Semester 5

Maximum-Likelihood Estimation (Cont.)
Parameters to be estimated
A set of i.i.d. examples

The objective function

The likelihood of w.r.t. the
set of observed examples

The maximum-likelihood estimation

Intuitively, best agrees with
the actually observed examples

Pattern Recognition Soochow, Fall Semester 6

Maximum-Likelihood Estimation (Cont.)
Gradient Operator (梯度算子)
 Let be a p‐dimensional vector
 Let be p‐variate real‐valued function over μ

is named as the log‐likelihood function

Pattern Recognition Soochow, Fall Semester 7

Maximum-Likelihood Estimation (Cont.)

p‐dimensional vector with p‐variate real‐valued

each component being a function over (not
function over over xk)

Necessary conditions for ML estimate

(a set of p equations)

Pattern Recognition Soochow, Fall Semester 8

The Gaussian Case: Unknown

suppose is known

Pattern Recognition Soochow, Fall Semester 9

The Gaussian Case: Unknown ¹
(Cont.) Intuitive result
¹
ML estimate for the unknown
is just the arithmetic average of
training samples – sample mean

(necessary condition
for ML estimate ) Multiply on
both sides

Pattern Recognition Soochow, Fall Semester 10

The Gaussian Case: Unknown ¹ and

Consider univariate case

Pattern Recognition Soochow, Fall Semester 11

The Gaussian Case: Unknown ¹ and
(Cont.)

(xk ¡ μ1 )

(necessary condition
for ML estimate and )

Pattern Recognition Soochow, Fall Semester 12

The Gaussian Case: Unknown ¹ and
(Cont.)

ML estimate in univariate case

Pattern Recognition Soochow, Fall Semester 13

The Gaussian Case: Unknown ¹ and §
(Cont.) Intuitive
ML estimate in multivariate case result as well！

Arithmetic average of
n vectors

Arithmetic average
of n matrices

Pattern Recognition Soochow, Fall Semester 14

Bayesian Estimation
Settings
 The parametric form of the likelihood function for
each category is known
 However, is considered to be random variables
instead of being fixed (but unknown) values

In this case, we can no longer make a single ML estimate
and then infer based on and
How can we Fully exploit training examples!
proceed under
this situation

Pattern Recognition Soochow, Fall Semester 15

Bayesian Estimation (Cont.)

Eq.22 [pp.91]
Two assumptions

Eq.23 [pp.91]

Pattern Recognition Soochow, Fall Semester 16

Bayesian Estimation (Cont.)
Key problem
Determine

Treat each class Simplify the class‐conditional pdf
independently notation as

( random variables w.r.t. parametric form)

( is independent of given )

Pattern Recognition Soochow, Fall Semester 17

Bayesian Estimation: The General
Procedure
Phase I: prior pdf  posterior pdf (for )

parametric
form
training posterior
set Bayes pdf
Formula

prior pdf

Pattern Recognition Soochow, Fall Semester 18

Bayesian Estimation: The General
Procedure
Phase II: posterior pdf (for )  class‐conditional pdf (for x)

parametric
form
posterior class‐conditional
pdf Law of pdf
Total Prob.

Phase III:

Pattern Recognition Soochow, Fall Semester 19

The Gaussian Case: Unknown
Consider univariate case: ( is known)

Phase I: prior pdf  posterior pdf (for )

Gaussian parametric
form

 Prior pdf still takes
Gaussian form
 Other form of
prior pdf could be
How would look like in this case? assumed as well

Pattern Recognition Soochow, Fall Semester 20

The Gaussian Case: Unknown
(Cont.)

( is a constant
not related to )

(examples in are i.i.d.)

Pattern Recognition Soochow, Fall Semester 21

The Gaussian Case: Unknown
(Cont.) is an exponential is a
function of a quadratic normal pdf
function of as well

Pattern Recognition Soochow, Fall Semester 22

The Gaussian Case: Unknown
(Cont.)

Equating the

coefficients in
both form:

Pattern Recognition Soochow, Fall Semester 23

The Gaussian Case: Unknown
(Cont.)
Phase II: posterior pdf (for )  class‐conditional pdf (for x)

How would look

like in this case?

Pattern Recognition Soochow, Fall Semester 24

The Gaussian Case: Unknown
(Cont.) Then, phase III
follows naturally
Eq.25 [pp.92] for prediction

Eq.36 [pp.95]

is an exponential is a
function of a quadratic normal pdf
function of as well

Pattern Recognition Soochow, Fall Semester 25

The Gaussian Case: Unknown
(Multivariate)
( is known)

Pattern Recognition Soochow, Fall Semester 26

Summary
 Key issue for PR
 Estimate prior and class‐conditional pdf from
training set
 Basic assumption on training examples: i.i.d.
 Two strategies to the key issue
 Parametric form for class‐conditional pdf
 Maximum likelihood (ML) estimation

 Bayesian estimation

 No parametric form for class‐conditional pdf
Pattern Recognition Soochow, Fall Semester 27
Summary (Cont.)
 Maximum likelihood estimation
 Settings: parameters as fixed but unknown values

 The objective function: Log‐likelihood function

 Necessary conditions for ML estimation: gradient
for the objective function should be zero vector

 The Gaussian case
 Unknown ¹

 ¹
Unknown and §

Pattern Recognition Soochow, Fall Semester 28

Summary (Cont.)
 Bayesian estimation
 Settings: parameters as random variables

 The general procedure
 Phase I: prior pdf  posterior pdf (for μ )

 Phase II: posterior pdf (for ) 
μ class‐conditional pdf (for x)

 Phase III: prediction (Eq.22 [pp.91])

 The Gaussian case
 ¹
Unknown : univariate and multivariate

Pattern Recognition Soochow, Fall Semester 29

Naive Bayes Classifier and Other Topics
No ratings yet
Naive Bayes Classifier and Other Topics
52 pages
Non Parametric
No ratings yet
Non Parametric
18 pages
Notice: Federal Agency Urine Drug Testing Certified Laboratories Meeting Minimum Standards, List
No ratings yet
Notice: Federal Agency Urine Drug Testing Certified Laboratories Meeting Minimum Standards, List
2 pages
Summative Assessments
No ratings yet
Summative Assessments
16 pages
Kashdanetal Fivedimensionalcuriosityscalerevised PAID
No ratings yet
Kashdanetal Fivedimensionalcuriosityscalerevised PAID
11 pages
Bcs602 ML Mod-4 Notes @vtunetwork
No ratings yet
Bcs602 ML Mod-4 Notes @vtunetwork
31 pages
Bayes Soleved Examples
No ratings yet
Bayes Soleved Examples
5 pages
Lecture No. 03
No ratings yet
Lecture No. 03
23 pages
PML Class 1 2025
No ratings yet
PML Class 1 2025
54 pages
CLASS 2025 Bayesian Framework
No ratings yet
CLASS 2025 Bayesian Framework
46 pages
BML Lecture Notes
No ratings yet
BML Lecture Notes
126 pages
A Closer Look at Deep Learning On Tabular Data
No ratings yet
A Closer Look at Deep Learning On Tabular Data
43 pages
Naive Bays & Support Vector Machines 2024-PPG
No ratings yet
Naive Bays & Support Vector Machines 2024-PPG
63 pages
Unec 1711787818
No ratings yet
Unec 1711787818
6 pages
Course Material On Gns 302
100% (2)
Course Material On Gns 302
65 pages
Notes3 Likelihood
No ratings yet
Notes3 Likelihood
13 pages
4.ML Estimation
No ratings yet
4.ML Estimation
19 pages
Analytics of Observational Data Lec 10
No ratings yet
Analytics of Observational Data Lec 10
23 pages
Motivations, Objectives and Ethics
No ratings yet
Motivations, Objectives and Ethics
37 pages
@vtudeveloper - in ML Mod 4
No ratings yet
@vtudeveloper - in ML Mod 4
11 pages
Assignment 10 Solution
No ratings yet
Assignment 10 Solution
8 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
180 pages
EX02 Logistic Regression Solution
No ratings yet
EX02 Logistic Regression Solution
73 pages
PR Lab Assignment 1-5
No ratings yet
PR Lab Assignment 1-5
31 pages
L09 Learning I Bayesian Learning
No ratings yet
L09 Learning I Bayesian Learning
66 pages
Introduction To Academic Writing-Course Outline-Fall2022
No ratings yet
Introduction To Academic Writing-Course Outline-Fall2022
4 pages
Maximum Likelihood and Bayesian Parameter Estimation: Chapter 3, DHS
No ratings yet
Maximum Likelihood and Bayesian Parameter Estimation: Chapter 3, DHS
35 pages
Statistical Perspective
No ratings yet
Statistical Perspective
85 pages
Probability Models
No ratings yet
Probability Models
23 pages
Unit 3-Generative Models
No ratings yet
Unit 3-Generative Models
23 pages
El-Angbawi Et Al-2015-Cochrane Database of Systematic Reviews
No ratings yet
El-Angbawi Et Al-2015-Cochrane Database of Systematic Reviews
30 pages
Module 4
No ratings yet
Module 4
51 pages
Chapter 4
No ratings yet
Chapter 4
57 pages
Chapter 3
No ratings yet
Chapter 3
34 pages
Performance of Public Sector
No ratings yet
Performance of Public Sector
20 pages
Safety and Capacity Analysis of The Rail 2
No ratings yet
Safety and Capacity Analysis of The Rail 2
113 pages
Lecture 4
No ratings yet
Lecture 4
51 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
Notes4 BayesianLearning
No ratings yet
Notes4 BayesianLearning
8 pages
Transnational Capital in Somalia
No ratings yet
Transnational Capital in Somalia
91 pages
FYDP Guidelines Batch 2020F JUNE 2024
No ratings yet
FYDP Guidelines Batch 2020F JUNE 2024
13 pages
ML Unit 3 Part 1
No ratings yet
ML Unit 3 Part 1
36 pages
Case Study Set B
No ratings yet
Case Study Set B
2 pages
Module4 Notes
100% (1)
Module4 Notes
31 pages
WSP Report
No ratings yet
WSP Report
14 pages
Ba Yes Naive
No ratings yet
Ba Yes Naive
15 pages
A Case Study On Service Recovery Frontline Employees' Perspectives and The Role of Empowerment
No ratings yet
A Case Study On Service Recovery Frontline Employees' Perspectives and The Role of Empowerment
11 pages
Module - 4 QB Solved-1
No ratings yet
Module - 4 QB Solved-1
31 pages
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
No ratings yet
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
56 pages
Naive Bayes
No ratings yet
Naive Bayes
32 pages
A Journey of Tim1
No ratings yet
A Journey of Tim1
53 pages
ML Unit 3 Part 1
No ratings yet
ML Unit 3 Part 1
36 pages
Module05 - Bayesian Reasoning
No ratings yet
Module05 - Bayesian Reasoning
37 pages
A Functional Approach To Basics of Data Science With Excel-Book - Chapter 1 and 2 - 1st Print
No ratings yet
A Functional Approach To Basics of Data Science With Excel-Book - Chapter 1 and 2 - 1st Print
13 pages
Diagnostic Accuracy of Physical Examination Tests of The Ankle-Foot Complex - A Systematic Review.
No ratings yet
Diagnostic Accuracy of Physical Examination Tests of The Ankle-Foot Complex - A Systematic Review.
11 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
14 pages
Mod 4
No ratings yet
Mod 4
26 pages
Bayesian
No ratings yet
Bayesian
91 pages
Impact Assessment of Employee Motivation On Workers Performance in Nigeria Communications Commission, Abuja, Nigeria
No ratings yet
Impact Assessment of Employee Motivation On Workers Performance in Nigeria Communications Commission, Abuja, Nigeria
98 pages
Introduction To Bayesian Learning: Aaron Hertzmann University of Toronto SIGGRAPH 2004 Tutorial
No ratings yet
Introduction To Bayesian Learning: Aaron Hertzmann University of Toronto SIGGRAPH 2004 Tutorial
141 pages
BaYesian Models Machine Learning 2016
No ratings yet
BaYesian Models Machine Learning 2016
126 pages
Digital Marketing Strategy of Creative Consultant During COVID-19 Pandemic: A Qualitative Approach
No ratings yet
Digital Marketing Strategy of Creative Consultant During COVID-19 Pandemic: A Qualitative Approach
18 pages
RPL Frequently Asked Questions FAQs For Web Autosaved
No ratings yet
RPL Frequently Asked Questions FAQs For Web Autosaved
15 pages
Dr. Arslan Shaukat
No ratings yet
Dr. Arslan Shaukat
18 pages
Bayes Theorem
No ratings yet
Bayes Theorem
20 pages
2MLIntrodpart 2
No ratings yet
2MLIntrodpart 2
42 pages
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
No ratings yet
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
15 pages
Fina LLLLL
100% (1)
Fina LLLLL
18 pages
Basics of Probabilistic/Bayesian Modeling and Parameter Estimation
No ratings yet
Basics of Probabilistic/Bayesian Modeling and Parameter Estimation
21 pages
Lecture 2 Part 1: Statistical Analysis (Bayesian Decision Theory, Probability Theory)
No ratings yet
Lecture 2 Part 1: Statistical Analysis (Bayesian Decision Theory, Probability Theory)
22 pages
FNCE 625-12ChairApproved
No ratings yet
FNCE 625-12ChairApproved
8 pages
Bayesian Uncertainty Quantification
No ratings yet
Bayesian Uncertainty Quantification
23 pages
Bayesian Learning: Berrin Yanikoglu
No ratings yet
Bayesian Learning: Berrin Yanikoglu
64 pages
CSCE 970 Lecture 2: Bayesian-Based Classifiers: Most Probable
No ratings yet
CSCE 970 Lecture 2: Bayesian-Based Classifiers: Most Probable
5 pages
Probability Theory For Machine Learning: Chris Cremer September 2015
No ratings yet
Probability Theory For Machine Learning: Chris Cremer September 2015
40 pages
Introduction to Statistics
From Everand
Introduction to Statistics
Simone Malacrida
No ratings yet
Bayesian Statistics: Thomas Bayes
No ratings yet
Bayesian Statistics: Thomas Bayes
22 pages
New Era University: Integrated School Senior High School No. 9 Central Avenue, New Era, Quezon City
No ratings yet
New Era University: Integrated School Senior High School No. 9 Central Avenue, New Era, Quezon City
9 pages
A Pattern Is An Abstract Object, Such As A Set of Measurements Describing A Physical Object
No ratings yet
A Pattern Is An Abstract Object, Such As A Set of Measurements Describing A Physical Object
12 pages
Bayes ML Tutorial
No ratings yet
Bayes ML Tutorial
69 pages
Graph Theory: Penn State Math 485 Lecture Notes: Licensed Under A
100% (1)
Graph Theory: Penn State Math 485 Lecture Notes: Licensed Under A
154 pages
Naive Bayes Classifier: Fundamentals and Applications
From Everand
Naive Bayes Classifier: Fundamentals and Applications
Fouad Sabry
No ratings yet
Academic - Graduate Studies and Research Division Birla Institute of Technology and Science, Pilani-Hyderabad Campus
No ratings yet
Academic - Graduate Studies and Research Division Birla Institute of Technology and Science, Pilani-Hyderabad Campus
2 pages
Introduction To Probabilistic Learning
No ratings yet
Introduction To Probabilistic Learning
9 pages
Weekly-Home-Learning-Plan 11 PR1 Week 1
100% (2)
Weekly-Home-Learning-Plan 11 PR1 Week 1
7 pages
Thesis New
No ratings yet
Thesis New
72 pages
Assign 1
No ratings yet
Assign 1
5 pages
Analytical Dynamics
No ratings yet
Analytical Dynamics
731 pages
ML UNIT-5 Notes PDF
No ratings yet
ML UNIT-5 Notes PDF
41 pages
Bayesian Learning Unit 3 PDF
No ratings yet
Bayesian Learning Unit 3 PDF
18 pages

Chapter 3 (PR)

Uploaded by

Chapter 3 (PR)

Uploaded by

Chapter 3

Pattern Recognition Soochow, Fall Semester 1

To compute posterior probability , we need to know:

Prior probability: Likelihood:

Pattern Recognition Soochow, Fall Semester 2

Pattern Recognition Soochow, Fall Semester 4

Pattern Recognition Soochow, Fall Semester 5

The objective function

The maximum-likelihood estimation

Pattern Recognition Soochow, Fall Semester 6

Pattern Recognition Soochow, Fall Semester 7

p‐dimensional vector with p‐variate real‐valued

Necessary conditions for ML estimate

Pattern Recognition Soochow, Fall Semester 8

Pattern Recognition Soochow, Fall Semester 9

Pattern Recognition Soochow, Fall Semester 10

Pattern Recognition Soochow, Fall Semester 11

Pattern Recognition Soochow, Fall Semester 12

Pattern Recognition Soochow, Fall Semester 13

Pattern Recognition Soochow, Fall Semester 14

Pattern Recognition Soochow, Fall Semester 15

Pattern Recognition Soochow, Fall Semester 16

Pattern Recognition Soochow, Fall Semester 17

Pattern Recognition Soochow, Fall Semester 18

Pattern Recognition Soochow, Fall Semester 19

Phase I: prior pdf  posterior pdf (for )

Pattern Recognition Soochow, Fall Semester 20

Pattern Recognition Soochow, Fall Semester 21

Pattern Recognition Soochow, Fall Semester 22

Pattern Recognition Soochow, Fall Semester 23

How would look

Pattern Recognition Soochow, Fall Semester 24

Pattern Recognition Soochow, Fall Semester 25

Pattern Recognition Soochow, Fall Semester 26

Pattern Recognition Soochow, Fall Semester 28

Pattern Recognition Soochow, Fall Semester 29

You might also like