0% found this document useful (0 votes)

30 views

Dr. Arslan Shaukat

The document discusses Bayesian estimation and classification. It explains that in Bayesian estimation, parameters are treated as random variables rather than fixed values. The key steps are: (1) computing the posterior probability distribution P(θ|D) using Bayes' rule, and (2) deriving the class-conditional density P(x|D) by integrating over the posterior. This allows incorporating prior knowledge and quantifying uncertainty about parameter values. The document provides detailed examples for univariate and multivariate Gaussian cases.

Uploaded by

Muhammad Hammad Sabir Pansota

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views

Dr. Arslan Shaukat

Uploaded by

Muhammad Hammad Sabir Pansota

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 18

Dr.

Arslan Shaukat
Bayesian Estimation
In MLE  was supposed to have a fixed value
In BE  is a random variable
Training data allows us to convert a distribution on this
variable into a posterior probability density
The computation of posterior probabilities P(i|x) lies at
the heart of Bayesian classification
P ( x | i ).P (i )
P (i | x)  c
 P( x |  j ).P( j )
j 1

Given the training sample set D, Bayes formula can be

written P ( x | i , D).P(i | D)
P(i | x, D)  c

 P( x |  , D).P(
j 1
j j | D)
The training samples D can be used to determine the
class-conditional densities and prior probabilities
Assume that the true values of the a priori
probabilities are known or obtainable from a trivial
calculation; thus we substitute P(ωi) = P(ωi|D)
We can separate the training samples by class into c
subsets D1, ...,Dc, with the samples in Di belonging to ωi
The previous expression can be written as:
P ( x | i , Di ).P(i )
P (i | x, D)  c

 P( x |  , D ).P( )
j 1
j j j

Like MLE, each class is treated independently, so we

can dispense with needless class distinctions and
simplify our notation for P(x|ω,D) to P(x,D)
P(x) is unknown but has known parametric form by
saying that the function p(x|θ) is completely known.
Any information we might have about θ prior to
observing the samples is assumed to be contained
in a known prior density p(θ).
Observation of the samples converts this to a
posterior density p(θ|D)
Goal is to compute p(x|D)
Do this by integrating the joint density p(x, θ|D)
over θ. That is,

p( x | D)   p( x, | D)d   p( x |  , D) p( | D)d   p( x |  ) p( | D)d

In general, if we are less certain about the exact value of θ,
this equation directs us to average p(x|θ) over the possible
values of θ.
Thus, when the unknown densities have a known parametric
form, the samples exert their influence on p(x|D) through the
posterior density p(θ|D).
The basic problem is: “Compute the posterior density P( |
D)” then “Derive P(x | D)”
Using Bayes formula, we have:

P (D |  ).P( )
P ( | D)  ,
 P(D |  ).P( )d
And independence assumption leads to the value of P(D|θ) as

k n
P (D |  )   P ( xk |  )
k 1
5
Bayesian Parameter Estimation:
General Theory
P(x |D) computation can be applied to any situation
in which the unknown density can be parametrized:
the basic assumptions are:
The form of P(x |) is assumed known, but the value of
 is not known exactly
Our knowledge about  is assumed to be contained in a
known prior density P()
The rest of our knowledge  is contained in a set D of n
random variables x1, x2, …, xn drawn independently
according to the unknown probability density P(x)
p ( x | D )   p ( x |  ) p ( | D )d
P (D |  ).P( ) k n
P( | D)  , P (D |  )   P ( xk |  )
 P(D |  ).P( )d k 1 5
Bayesian Parameter Estimation:
Gaussian Case
Goal: Estimate  using the a-posteriori density P(|D)
The univariate case: P(|D)
 is the only unknown parameter
P(x |  ) ~ N(  ,  2 )
P(  ) ~ N(  0 ,  02 )
We assume that whatever prior knowledge we might have
about μ can be expressed by a known prior density p(μ)
0 and  0 are known
Roughly speaking, μ 0 represents our best a priori guess
for μ, and σ20 measures our uncertainty about this guess
4
By Bayes formula
P(D |  ).P(  )
P(  | D)  (1)
 P(D |  ).P( )d
k n
   P( xk |  ).P(  )
k 1

We assume that
P(x k |  ) ~ N(  ,  2 )
P(  ) ~ N(  0 ,  02 )
We have

4
If we write P (  | D) ~ N (  n ,  n2 )
μ n and σ2n can be found by equating coefficients in
the previous Eq. with corresponding coefficients in
the Gaussian form:
1  1   n 
 
2

P(  | D)  exp    
2  n  2   n  

 n 02  2
 n    ˆ 
2  n
. 0
 n0 0    n 0  
2 2 2

 0
2 2
and  n2 
n 02   2
μ n represents our best guess for μ after observing n
samples, and σ2n measures our uncertainty about this
guess.
Since σ2n decreases monotonically with n —
approaching σ2/n as n approaches infinity — each
additional observation decreases our uncertainty about
the true value of μ.
As n increases, p(μ|D) becomes more and more sharply
peaked
This behavior is commonly known as Bayesian learning
4
The Univariate Case P(x |D)
 P( | D) computed
 P(x | D) remains to be computed

P( x | D)   P( x |  ).P(  | D)d is Gaussian

where

4
It provides:
P( x | D) ~ N (  n ,  2   n2 )

(Desired class-conditional density P(x | Dj, j))

Therefore: P(x | Dj, j) together with P(j)
And using Bayes formula, we obtain the
Bayesian classification rule:

  
Max P ( j | x, D)  Max P ( x |  j , D j ).P ( j )
j j

Multivariate Case
Assume: p( x| µ)~N(µ, ∑)
p(µ) ~N(µ0, ∑0)

We get :P(µ|D) ~ N (µn, ∑n)

Recursive Bayes Learning

Using Bayes formula

and

An incremental or on-line learning method, where learning goes

on as the data is collected
Difference between the two
methods
Computational complexity:
Maximum likelihood is simpler
Our confidence in the prior information
Maximum likelihood must be of the assumed parametric
form, not so for the Bayesian solution.
Bayesian methods use more of the information than
maximum likelihood thus it gives better results.
Bayesian methods exploit asymmetric information contained
in θ distribution while maximum likelihood does not.
Classification Error
To apply these results to multiple classes, separate the
training samples to c subsets D1, . . . ,Dc, with the
samples in Di belonging to class wi, and then estimate
each density p(x|wi,Di) separately
Different sources of error
Bayes error: due to overlapping class-conditional densities
(related to the features used)
Model error: due to incorrect model
Estimation error: due to estimation of parameters from a
finite sample (can be reduced by increasing the amount of
training data)
Conclusion
Maximum likelihood approach estimates a point in θ
space, the Bayesian approach estimates a distribution.
Bayesian method has strong theoretical and
methodological arguments supporting it, though in
practice maximum likelihood is simpler.
When used for classifiers, they mostly give same
result.

SolvedTechnical Past Papers NTDC
97% (29)
SolvedTechnical Past Papers NTDC
27 pages
6th Central Pay Commission Salary Calculator
100% (436)
6th Central Pay Commission Salary Calculator
15 pages
Bayesian Learning Unit 3 PDF
No ratings yet
Bayesian Learning Unit 3 PDF
18 pages
Maximum Likelihood and Bayesian Parameter Estimation: Chapter 3, DHS
No ratings yet
Maximum Likelihood and Bayesian Parameter Estimation: Chapter 3, DHS
35 pages
Notes4_BayesianLearning
No ratings yet
Notes4_BayesianLearning
8 pages
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
No ratings yet
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
56 pages
Chapter 3
No ratings yet
Chapter 3
34 pages
Lecture 4
No ratings yet
Lecture 4
51 pages
Bayes ML Tutorial
No ratings yet
Bayes ML Tutorial
69 pages
ML Unit-4
No ratings yet
ML Unit-4
24 pages
Bayesian
No ratings yet
Bayesian
91 pages
module_5_notes BAYESIAN learning notes
No ratings yet
module_5_notes BAYESIAN learning notes
24 pages
Assign 1
No ratings yet
Assign 1
5 pages
Lecture Notes For Probability and Statistics
No ratings yet
Lecture Notes For Probability and Statistics
7 pages
Ba Yes Naive
No ratings yet
Ba Yes Naive
15 pages
ML Unit 3 Part 1
No ratings yet
ML Unit 3 Part 1
36 pages
ML Unit III
No ratings yet
ML Unit III
40 pages
Naive Bayes
No ratings yet
Naive Bayes
60 pages
Module05 - Bayesian Reasoning
No ratings yet
Module05 - Bayesian Reasoning
37 pages
확통1 LectureNote09 on Bayesian Statistical Inference
No ratings yet
확통1 LectureNote09 on Bayesian Statistical Inference
78 pages
MODULE - 4 QB SOLVED-1
No ratings yet
MODULE - 4 QB SOLVED-1
31 pages
3.1 New
No ratings yet
3.1 New
12 pages
Naïve Bayes Classifier: April 25, 2006
No ratings yet
Naïve Bayes Classifier: April 25, 2006
19 pages
ML Unit 3 Bayesian - Learning (Textbook)
No ratings yet
ML Unit 3 Bayesian - Learning (Textbook)
25 pages
15CS73 Module 4
No ratings yet
15CS73 Module 4
60 pages
Module - 4 AIML
No ratings yet
Module - 4 AIML
22 pages
ML - Unit 1 - Part Ii
No ratings yet
ML - Unit 1 - Part Ii
18 pages
ML UNIT 4-1-24
No ratings yet
ML UNIT 4-1-24
24 pages
Module 4
No ratings yet
Module 4
51 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
Module 4 - Bayesian Learning
No ratings yet
Module 4 - Bayesian Learning
36 pages
Chapter 4
No ratings yet
Chapter 4
57 pages
Bayes Algorithm
No ratings yet
Bayes Algorithm
26 pages
Stat 535 C - Statistical Computing & Monte Carlo Methods: Arnaud Doucet
No ratings yet
Stat 535 C - Statistical Computing & Monte Carlo Methods: Arnaud Doucet
23 pages
ML - Unit4pdf
No ratings yet
ML - Unit4pdf
65 pages
E-Note 14654 Content Document 20231228101425AM
No ratings yet
E-Note 14654 Content Document 20231228101425AM
10 pages
Unit 4
No ratings yet
Unit 4
18 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
65 pages
Bayesian Learning: Salma Itagi, Svit
No ratings yet
Bayesian Learning: Salma Itagi, Svit
14 pages
Unit 2 Bayesian Learning
No ratings yet
Unit 2 Bayesian Learning
50 pages
Module - 4 Bayeian Learning
No ratings yet
Module - 4 Bayeian Learning
44 pages
ML UNIT-5 Notes PDF
No ratings yet
ML UNIT-5 Notes PDF
41 pages
L23 Bayesian Naive
No ratings yet
L23 Bayesian Naive
18 pages
I2ml3e Chap4
No ratings yet
I2ml3e Chap4
28 pages
Bayesian Classifier and ML Estimation: 6.1 Conditional Probability
100% (3)
Bayesian Classifier and ML Estimation: 6.1 Conditional Probability
11 pages
Point Estimation: Definition of Estimators
No ratings yet
Point Estimation: Definition of Estimators
8 pages
BML Lecture Notes
No ratings yet
BML Lecture Notes
126 pages
Module 2 Notes
No ratings yet
Module 2 Notes
24 pages
Mathematics - Iii: Institute of Science&Technology
No ratings yet
Mathematics - Iii: Institute of Science&Technology
16 pages
Stat 535 C - Statistical Computing & Monte Carlo Methods: Arnaud Doucet
No ratings yet
Stat 535 C - Statistical Computing & Monte Carlo Methods: Arnaud Doucet
23 pages
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
No ratings yet
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
25 pages
Module 5
No ratings yet
Module 5
24 pages
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
No ratings yet
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
53 pages
Data Mining - Module 7
No ratings yet
Data Mining - Module 7
8 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
63 pages
Assignment 10 solution
No ratings yet
Assignment 10 solution
8 pages
Chapter 4 ML Parametric Classification
No ratings yet
Chapter 4 ML Parametric Classification
42 pages
CS775 Lec 2
No ratings yet
CS775 Lec 2
66 pages
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
No ratings yet
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
15 pages
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
No ratings yet
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
18 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
TOPES Brochure E
No ratings yet
TOPES Brochure E
12 pages
Project Case 2
No ratings yet
Project Case 2
5 pages
Online Deposit Slip The Bank of Punjab Online Deposit Slip The Bank of Punjab Online Deposit Slip The Bank of Punjab
No ratings yet
Online Deposit Slip The Bank of Punjab Online Deposit Slip The Bank of Punjab Online Deposit Slip The Bank of Punjab
1 page
Project Status Report: Note: This Work Is Done in 3 Months and Workforce of 9 at Average
No ratings yet
Project Status Report: Note: This Work Is Done in 3 Months and Workforce of 9 at Average
1 page
Eutron Solutions (PVT) LTD Apartment D Northern Heights, 3 Floor Sector E-11/3 Markaz Islamabad, Pakistan 051-8732772
No ratings yet
Eutron Solutions (PVT) LTD Apartment D Northern Heights, 3 Floor Sector E-11/3 Markaz Islamabad, Pakistan 051-8732772
1 page
Single Line Diagram
No ratings yet
Single Line Diagram
6 pages
Attendence Sheet: Yawar Ul Haq USAID (Training For Pakistan Project) Ministry of Petroleum and Natural Resources DG (PC)
No ratings yet
Attendence Sheet: Yawar Ul Haq USAID (Training For Pakistan Project) Ministry of Petroleum and Natural Resources DG (PC)
2 pages
An Evaluation of Wheel Chair Cum Bed Mechanism With Side Panel Movement For Bed
No ratings yet
An Evaluation of Wheel Chair Cum Bed Mechanism With Side Panel Movement For Bed
6 pages
Candidate Details: Post Name Roll Number Full Name Father Name Cnic Venue Date Time
No ratings yet
Candidate Details: Post Name Roll Number Full Name Father Name Cnic Venue Date Time
1 page
10 1 1 104
No ratings yet
10 1 1 104
182 pages
CHAPTER 1 Living in The IT Era
No ratings yet
CHAPTER 1 Living in The IT Era
3 pages
2021-fintech-eng
No ratings yet
2021-fintech-eng
3 pages
Jblas - Fast Matrix Computations For Java
No ratings yet
Jblas - Fast Matrix Computations For Java
19 pages
RFBT 11
No ratings yet
RFBT 11
23 pages
PhonePe Statement Jan2025 Jan2025
No ratings yet
PhonePe Statement Jan2025 Jan2025
2 pages
Computer and Internet: 1. Who Is The Father of Computer?
No ratings yet
Computer and Internet: 1. Who Is The Father of Computer?
18 pages
Incident Management Process & Guidelines
No ratings yet
Incident Management Process & Guidelines
8 pages
Pgdca 1 Sem Programming in C 217 Dec 2018
No ratings yet
Pgdca 1 Sem Programming in C 217 Dec 2018
2 pages
Unit 1 - DATA ANALYTICS - KIT-601 - AKTU
No ratings yet
Unit 1 - DATA ANALYTICS - KIT-601 - AKTU
24 pages
Larrie Paul Tiernan - Photographer Introduction - Dragan Effect
No ratings yet
Larrie Paul Tiernan - Photographer Introduction - Dragan Effect
10 pages
576047-204 TLS4XX Specification Sheet
No ratings yet
576047-204 TLS4XX Specification Sheet
2 pages
NodeJs Deploy Using Github Actions
No ratings yet
NodeJs Deploy Using Github Actions
7 pages
Module 3 Visual Elements of Art
No ratings yet
Module 3 Visual Elements of Art
4 pages
Mictor To Jtag Adapter P&E Microcomputer Systems
No ratings yet
Mictor To Jtag Adapter P&E Microcomputer Systems
1 page
Chapter 4 Requirements Engineering
No ratings yet
Chapter 4 Requirements Engineering
78 pages
BDA UT2 QB Answers
100% (1)
BDA UT2 QB Answers
22 pages
Computerized Accconting With Tally
No ratings yet
Computerized Accconting With Tally
5 pages
Summative Assessment - Introduction To Python Programming - Y8
No ratings yet
Summative Assessment - Introduction To Python Programming - Y8
17 pages
Course Outline: Security Center - Autovu Fixed 5.8 Technical Certification
No ratings yet
Course Outline: Security Center - Autovu Fixed 5.8 Technical Certification
3 pages
Iphone 13 Midnight - Google Search
No ratings yet
Iphone 13 Midnight - Google Search
1 page
Homework 1: ME 570 - Prof. Tron
No ratings yet
Homework 1: ME 570 - Prof. Tron
14 pages
Swe 202: Introduction To Software Engineering: Chapter 8 (Part 1) Lecturer: Rand Albrahim
No ratings yet
Swe 202: Introduction To Software Engineering: Chapter 8 (Part 1) Lecturer: Rand Albrahim
19 pages
Samsung - Galaxy Tab S Tablet - Android Tablet - Warranty (WIF - SM-T800 - TAB - S - EN - HS - MM - 6!0!091516 - FINAL) (2016)
No ratings yet
Samsung - Galaxy Tab S Tablet - Android Tablet - Warranty (WIF - SM-T800 - TAB - S - EN - HS - MM - 6!0!091516 - FINAL) (2016)
38 pages
WMIC Sample Output
No ratings yet
WMIC Sample Output
1 page
Curriculum Maps Grade 4docx
No ratings yet
Curriculum Maps Grade 4docx
4 pages
Numerical Methods For Engineers
80% (10)
Numerical Methods For Engineers
179 pages
Will Immersive Environments Change The Way The World Does Business?
No ratings yet
Will Immersive Environments Change The Way The World Does Business?
8 pages
Commonly Used Approaches To Real-Time Scheduling
No ratings yet
Commonly Used Approaches To Real-Time Scheduling
40 pages
Past
No ratings yet
Past
5 pages
Lab No. 5:: Method Overloading
No ratings yet
Lab No. 5:: Method Overloading
6 pages

Dr. Arslan Shaukat

Uploaded by

Dr. Arslan Shaukat

Uploaded by

Dr.

Given the training sample set D, Bayes formula can be

Like MLE, each class is treated independently, so we

p( x | D)   p( x, | D)d   p( x |  , D) p( | D)d   p( x |  ) p( | D)d

P( x | D)   P( x |  ).P(  | D)d is Gaussian

(Desired class-conditional density P(x | Dj, j))

We get :P(µ|D) ~ N (µn, ∑n)

Using Bayes formula

An incremental or on-line learning method, where learning goes

You might also like