0% found this document useful (0 votes)

21 views20 pages

Bayesian Learning Note

mlt unit 3

Uploaded by

Rohan Rathi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views20 pages

Bayesian Learning Note

mlt unit 3

Uploaded by

Rohan Rathi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

MACHINE LEARNING

BAYESIAN LEARNING
Two Roles for Bayesian Methods
 Provides practical learning algorithms:
– Naive Bayes learning
– Bayesian belief network learning
– Combine prior knowledge (prior probabilities) with observed data
– Requires prior probabilities

 Provides useful conceptual framework

– Provides “gold standard” for evaluating other learning algorithms
– Additional insight into Occam’s razor

2
Bayes Theorem

 P(h) = prior probability of hypothesis h

 P(D) = prior probability of training data D
 P(h|D) = probability of h given D
 P(D|h) = probability of D given h

3
Choosing Hypotheses

 Generally want the most probable hypothesis given the

training data
Maximum a posteriori hypothesis hMAP:

 If assume P(hi) = P(hj) then can further simplify, and

choose the Maximum likelihood (ML) hypothesis

4
Bayes Theorem
 Does patient have cancer or not?
A patient takes a lab test and the result comes back positive.
The test returns a correct positive result in only 98% of the
cases in which the disease is actually present, and a correct
negative result in only 97% of the cases in which the disease
is not present. Furthermore, .008 of the entire population
have this cancer.
P(cancer) = P(cancer) =
P(|cancer) = P(|cancer) =
P(|cancer) = P(|cancer) =
5
Basic Formulas for Probabilities
 Product Rule: probability P(A  B) of a conjunction of
two events A and B:
P(A  B) = P(A | B) P(B) = P(B | A) P(A)
 Sum Rule: probability of a disjunction of two events A
and B:
P(A  B) = P(A) + P(B) - P(A  B)
 Theorem of total probability: if events A1,…, An are
mutually exclusive with , then

6
BRUTE FORCE MAP HYPOTHESIS LEARNER

1. For each hypothesis h in H, calculate the posterior probability

2. Output the hypothesis hMAP with the highest posterior

probability

7
Naive Bayes Classifier
 Along with decision trees, neural networks, nearest
nbr, one of the most practical learning methods.
 When to use
– Moderate or large training set available
– Attributes that describe instances are conditionally
independent given classification
 Successful applications:
– Diagnosis
– Classifying text documents
8
Naive Bayes Classifier

9
Naive Bayes Classifier

10
Naïve Bayes Classifier Example

11
12
13
Learning to Classify Text (1/4)

 Why?
– Learn which news articles are of interest
– Learn to classify web pages by topic

 Naive Bayes is among most effective algorithms

 What attributes shall we use to represent text
documents??

14
Bayes Optimal Classifier
 Bayes optimal classification:

 Example:
P(h1|D) = .4, P(|h1) = 0, P(+|h1) = 1
P(h2|D) = .3, P(|h2) = 1, P(+|h2) = 0
P(h3|D) = .3, P(|h3) = 1, P(+|h3) = 0
therefore

and

15
Gibbs Classifier
 Bayes optimal classifier provides best result, but can be
expensive if many hypotheses.
 Gibbs algorithm:
1. Choose one hypothesis at random, according to P(h|D)
2. Use this to classify new instance
 Surprising fact: Assume target concepts are drawn at
random from H according to priors on H. Then:
E[errorGibbs]  2E [errorBayesOptional]
 Suppose correct, uniform prior distribution over H, then
– Pick any hypothesis from VS, with uniform probability
– Its expected error no worse than twice Bayes optimal
16
Expectation Maximization (EM)
 When to use:
– Data is only partially observable
– Unsupervised clustering (target value unobservable)
– Supervised learning (some instance attributes unobservable)
 Some uses:
– Train Bayesian Belief Networks
– Unsupervised clustering (AUTOCLASS)
– Learning Hidden Markov Models

17
EM Algorithm
 Converges to local maximum likelihood h
and provides estimates of hidden variables zij

 In fact, local maximum in E[ln P(Y|h)]

– Y is complete (observable plus unobservable
variables) data
– Expected value is taken over possible values of
unobserved variables in Y

18
General EM Problem
 Given:
– Observed data X = {x1,…, xm}
– Unobserved data Z = {z1,…, zm}
– Parameterized probability distribution P(Y|h), where
 Y = {y1,…, ym} is the full data yi = xi  zi
 h are the parameters
 Determine: h that (locally) maximizes E[ln P(Y|h)]
 Many uses:
– Train Bayesian belief networks
– Unsupervised clustering (e.g., k means)
– Hidden Markov Models
19
GENERAL EM METHOD
Define likelihood function Q(h'|h) which calculates
Y = X  Z using observed X and current parameters h to
estimate Z
Q(h'|h)  E[ln P(Y| h')|h, X]
EM Algorithm:
 Estimation (E) step: Calculate Q(h'|h) using the current hypothesis h and the
observed data X to estimate the probability distribution over Y .
Q(h'|h)  E[ln P(Y| h')|h, X]

 Maximization (M) step: Replace hypothesis h by the hypothesis h' that

maximizes this Q function.

Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
0% (1)
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
15 pages
6.1 Bayesian Learning
No ratings yet
6.1 Bayesian Learning
33 pages
Bayesian Learning Essentials
No ratings yet
Bayesian Learning Essentials
49 pages
Slide07 Bayes
No ratings yet
Slide07 Bayes
51 pages
Unit 3 Bayesian Learning
No ratings yet
Unit 3 Bayesian Learning
49 pages
Bayesian Learning for ML Experts
No ratings yet
Bayesian Learning for ML Experts
18 pages
Unit 3
No ratings yet
Unit 3
16 pages
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
No ratings yet
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
54 pages
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
No ratings yet
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
18 pages
Bayesian Learning
No ratings yet
Bayesian Learning
42 pages
L13 Bayesian Methods
No ratings yet
L13 Bayesian Methods
30 pages
Unit 6
No ratings yet
Unit 6
19 pages
Classification Bayes
No ratings yet
Classification Bayes
21 pages
Unit 4
No ratings yet
Unit 4
24 pages
Wa0002.
No ratings yet
Wa0002.
24 pages
Introduction to Bayesian Reasoning
No ratings yet
Introduction to Bayesian Reasoning
9 pages
Bayes Algorithm
No ratings yet
Bayes Algorithm
26 pages
3.1 New
No ratings yet
3.1 New
12 pages
Bayesian and Computational Learning
No ratings yet
Bayesian and Computational Learning
178 pages
Module - 3 - Last Part
No ratings yet
Module - 3 - Last Part
16 pages
Chapter 8
No ratings yet
Chapter 8
26 pages
Unit 2 Bayesian Learning
No ratings yet
Unit 2 Bayesian Learning
50 pages
Bayesian Learning Methods Overview
No ratings yet
Bayesian Learning Methods Overview
60 pages
Bayesian Learning in Machine Learning
No ratings yet
Bayesian Learning in Machine Learning
60 pages
Bayesian Learning in Machine Learning
No ratings yet
Bayesian Learning in Machine Learning
44 pages
Bayesian Decision Theory in ML
No ratings yet
Bayesian Decision Theory in ML
56 pages
@vtudeveloper - in ML Mod 4
No ratings yet
@vtudeveloper - in ML Mod 4
11 pages
Bayesian Learning in Machine Learning
No ratings yet
Bayesian Learning in Machine Learning
65 pages
Machine Learning PPT Part III
No ratings yet
Machine Learning PPT Part III
26 pages
Machine 2023 Part 1
No ratings yet
Machine 2023 Part 1
4 pages
ML Unit 4-1-24
No ratings yet
ML Unit 4-1-24
24 pages
Unit III
No ratings yet
Unit III
19 pages
Module 4
No ratings yet
Module 4
15 pages
Unit - 5 ML
No ratings yet
Unit - 5 ML
57 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
14 pages
Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
91 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
Bayesian Learning and Classification Techniques
No ratings yet
Bayesian Learning and Classification Techniques
44 pages
Unit 4
No ratings yet
Unit 4
36 pages
Unit 3
No ratings yet
Unit 3
99 pages
E-Note 14654 Content Document 20231228101425AM
No ratings yet
E-Note 14654 Content Document 20231228101425AM
10 pages
ML 5
No ratings yet
ML 5
28 pages
Ba Yes Naive
No ratings yet
Ba Yes Naive
15 pages
Bayesian Concept Learning Guide
No ratings yet
Bayesian Concept Learning Guide
157 pages
Machine Learning Module 4 / DR Loganathan D / Cambridge Institute of Technology, Bangalore
No ratings yet
Machine Learning Module 4 / DR Loganathan D / Cambridge Institute of Technology, Bangalore
199 pages
Features of Bayesian Learning Methods
No ratings yet
Features of Bayesian Learning Methods
39 pages
Advanced Machine Learning
No ratings yet
Advanced Machine Learning
63 pages
Bayesian Learning in Machine Learning
No ratings yet
Bayesian Learning in Machine Learning
45 pages
Bcs602 ML Mod-4 Notes @vtunetwork
No ratings yet
Bcs602 ML Mod-4 Notes @vtunetwork
31 pages
ML Unit-4
No ratings yet
ML Unit-4
24 pages
ML Unit3
No ratings yet
ML Unit3
21 pages
ML Unit 3 Bayesian - Learning (Textbook)
No ratings yet
ML Unit 3 Bayesian - Learning (Textbook)
25 pages
Module 2 Notes
No ratings yet
Module 2 Notes
24 pages
Bayesian Learning Unit 3 PDF
No ratings yet
Bayesian Learning Unit 3 PDF
18 pages
Bayesian Classification, Nearest
No ratings yet
Bayesian Classification, Nearest
46 pages
ML Unit 3 MID1
0% (1)
ML Unit 3 MID1
83 pages
SMTP and E-Mail
No ratings yet
SMTP and E-Mail
16 pages
Unit 3 SE
No ratings yet
Unit 3 SE
118 pages
Understanding Domain Name System (DNS)
No ratings yet
Understanding Domain Name System (DNS)
20 pages
VHS Unit-1
No ratings yet
VHS Unit-1
104 pages
VHS Unit-3
No ratings yet
VHS Unit-3
19 pages
Assigment 6
No ratings yet
Assigment 6
3 pages
Electro-Mechanical Equipment BOQ for Water Supply
No ratings yet
Electro-Mechanical Equipment BOQ for Water Supply
17 pages
Order Number Inputting On Selection Screen
No ratings yet
Order Number Inputting On Selection Screen
5 pages
Meghan Kneringer: Teaching Portfolio
No ratings yet
Meghan Kneringer: Teaching Portfolio
2 pages
Hybrid Energy Feasibility for Rural Ethiopia
No ratings yet
Hybrid Energy Feasibility for Rural Ethiopia
123 pages
Ajeel Field Well Design Example
No ratings yet
Ajeel Field Well Design Example
5 pages
Dimensional Analysis in Physics Concepts
No ratings yet
Dimensional Analysis in Physics Concepts
112 pages
Social Cost Benefit Analysis
No ratings yet
Social Cost Benefit Analysis
29 pages
CNC Turning Optimization for EN16 Steel
No ratings yet
CNC Turning Optimization for EN16 Steel
5 pages
TakumaNakahira Circulation Exhibition
No ratings yet
TakumaNakahira Circulation Exhibition
6 pages
The Medical College Admission Test2016 - b-1
No ratings yet
The Medical College Admission Test2016 - b-1
3 pages
Legrand 180 Sensor
No ratings yet
Legrand 180 Sensor
4 pages
Create User Account and Update User Profile
No ratings yet
Create User Account and Update User Profile
3 pages
Amy Daniels Resume
No ratings yet
Amy Daniels Resume
2 pages
Dwatch Suerte
No ratings yet
Dwatch Suerte
2 pages
Science Daily: Lesson Log
No ratings yet
Science Daily: Lesson Log
2 pages
Cole Davidson
No ratings yet
Cole Davidson
4 pages
Kes 2008 v4 Final
No ratings yet
Kes 2008 v4 Final
8 pages
A320/A330 Pilot Training Updates
100% (2)
A320/A330 Pilot Training Updates
555 pages
Managing The Human Resource
No ratings yet
Managing The Human Resource
4 pages
A Strain Based Topology Optimization Method For Compliant Mechanism Design
No ratings yet
A Strain Based Topology Optimization Method For Compliant Mechanism Design
9 pages
Drager 8000 SC Incubator Test
No ratings yet
Drager 8000 SC Incubator Test
2 pages
1.1 Indias Long Road - The Search For Prosperity
No ratings yet
1.1 Indias Long Road - The Search For Prosperity
9 pages
LEMD-LIME Operational Flight Plan
No ratings yet
LEMD-LIME Operational Flight Plan
106 pages
Guidelines For FYP PDF
No ratings yet
Guidelines For FYP PDF
10 pages
Chapter 5
No ratings yet
Chapter 5
5 pages
Ep 4
No ratings yet
Ep 4
1 page
Dream11's PowerPlay Strategy
No ratings yet
Dream11's PowerPlay Strategy
10 pages
Partial Fractions: by Arafath IGCSE / A Level Mathematics Teacher
No ratings yet
Partial Fractions: by Arafath IGCSE / A Level Mathematics Teacher
14 pages
8.6.4 Coated Macadams
No ratings yet
8.6.4 Coated Macadams
4 pages
Orion Main
No ratings yet
Orion Main
27 pages

Bayesian Learning Note

Uploaded by

Bayesian Learning Note

Uploaded by

MACHINE LEARNING

 Provides useful conceptual framework

 P(h) = prior probability of hypothesis h

 Generally want the most probable hypothesis given the

 If assume P(hi) = P(hj) then can further simplify, and

1. For each hypothesis h in H, calculate the posterior probability

2. Output the hypothesis hMAP with the highest posterior

 Naive Bayes is among most effective algorithms

 In fact, local maximum in E[ln P(Y|h)]

 Maximization (M) step: Replace hypothesis h by the hypothesis h' that

You might also like