0% found this document useful (0 votes)

17 views25 pages

Bayesian Classification

The document discusses the application of the Naïve Bayes classifier for digit recognition, specifically predicting whether an image represents the digit 5 or 6 based on pixel values. It explains the use of Bayes' Rule to compute probabilities and the necessity of learning likelihood and prior functions, while addressing challenges such as the independence assumption of features. Additionally, it compares Naïve Bayes with decision trees, highlighting differences in model characteristics and training requirements.

Uploaded by

d36078067

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views25 pages

Bayesian Classification

Uploaded by

d36078067

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Another Application

• Digit Recognition (5 or 6)

Classifier 5
• X1,…,Xn ∈ {0,1} (Black vs. White pixels)
• Y ∈ {5,6} (predict whether a digit is a 5 or a 6)
The Bayes Classifier

• A good strategy is to predict:

X: Collection of pixel values

• (for example: what is the probability that the image represents a 5 given its
pixels?)

• So … How do we compute that?

The Bayes Classifier
Likelihood
• Use Bayes Rule! Prior

Normalization Constant = Total

probability of feature set

• Why did this help? Well, we think that we might be able to specify how features
are “generated” by the class
Y1 label Y3

Y2 X space
The Bayes Classifier
• Let’s expand this for our digit recognition task:

• To classify, we’ll simply compute these two probabilities and predict based on which one is
greater
Model Parameters
• For the Bayes classifier, we need to “learn” two functions, the likelihood and the
prior

• How many parameters are required to specify the likelihood for our digit
recognition example?
Model Parameters
• How many parameters are required to specify the likelihood?
• (Supposing that each image is 30x30 pixels)

?
Model Parameters
• The problem with explicitly modeling P(X1,…,Xn|Y) is that there are usually way
too many parameters:
• We’ll run out of space
• We’ll run out of time
• And we’ll need tons of training data (which is usually not available)
The Naïve Bayes Model
• The Naïve Bayes Assumption: Assume that all features are independent given the
class label Y
• Equationally speaking:

• (We will discuss the validity of this assumption later)

Why is this useful?
• # of likelihoods for modeling Pk(X1,…,Xn|Y)
• K Classes and n features:

▪ K(2n) = 2*2900 Likelihoods

• # of parameters for modeling P(X1|Y),…,P(Xn|Y)

▪ Kn (2*900 Likelihoods)

▪ K Priors
Naïve Bayes Training
• Now that we’ve decided to use a Naïve Bayes classifier, we need to train it with some data.
Assume BW images:

MNIST Training Data

Naïve Bayes Training
• Training in Naïve Bayes is easy:
• FOR PRIORS: Estimate P(Y=v) as the fraction of records with Y=v

• FOR LIKELIHOOD-FACTORS: Estimate P(Xi=u|Y=v) as the fraction of

records with Y=v for which Xi=u
Naïve Bayes Training - smoothing
• In practice, some of these probabilities/ counts can be
zero
m*p
• Fix this by adding “virtual” counts:

m
• m = Number of values the parameter may take
• p probability of ith parameter value (1/m if
equiprobable)
Smoothing
•
Color Images NB Training
• For binary digits, how many pixel values are there ?
• training amounts to either
• finding probabilities of each pixel being R,G,B for each class
• finding normal distribution averages and std dev for each of R,G,B values for
each class
Naïve Bayes Classification
Another Example of the Naïve Bayes Classifier
The weather data, with counts and probabilities
outlook temperature humidity windy play
yes no yes no yes no yes no yes no

sunny 2 3 hot 2 2 high 3 4 false 6 2 9 5

overcast 4 0 mild 4 2 normal 6 1 true 3 3
rainy 3 2 cool 3 1
sunny 2/9 3/5 hot 2/9 2/5 high 3/9 4/5 false 6/9 2/5 9/14 5/14
overcast 4/9 0/5 mild 4/9 2/5 normal 6/9 1/5 true 3/9 3/5
rainy 3/9 2/5 cool 3/9 1/5

A new day
outlook temperature humidity windy play
sunny cool high true ?
• Likelihood of yes

• Likelihood of no

• Therefore, the prediction is No

The Naive Bayes Classifier for Data
Sets with Numerical Attribute Values

• One common practice to handle numerical attribute

values is to assume normal distributions for numerical
attributes.
The numeric weather data with summary statistics
outlook temperature humidity windy play
yes no yes no yes no yes no yes no

sunny 2 3 83 85 86 85 false 6 2 9 5
overcast 4 0 70 80 96 90 true 3 3
rainy 3 2 68 65 80 70
64 72 65 95
69 71 70 91
75 80
75 70
72 90
81 75
sunny 2/9 3/5 mean 73 74.6 mean 79.1 86.2 false 6/9 2/5 9/14 5/14
overcast 4/9 0/5 std 6.2 7.9 std 10.2 9.7 true 3/9 3/5
dev dev
rainy 3/9 2/5
TWO WAYS TO HANDLE CONTINUOUS VALUED ATTRIBUTES

μ
Deriving Normal Distribution
• Let x1, x2, …, xn be the values of a numerical attribute
in the training data set.
Given a new case: Outlook = sunny, temperature = 66, humidity = 80 ,
wind = true. Posterior probabilities?

• For examples,

• Likelihood of Yes =

• Likelihood of No =

Total Prob of Yes = 36/(36+136)= 26.47%

Total Prob of No = 136/(36+136)= 73.53%
Outputting Probabilities

• What’s nice about Naïve Bayes (and generative models in general) is that it
returns probabilities
• These probabilities can tell us how confident the algorithm is
• Such a confidence level is not immediately present in DT
Comparison of DT and NB

DT NB
1. Greedy heuristic 1. Statistical
2. Discriminative model, cant generate data. 2. Generative model (calculates prob dist and can
generate data) and need Bayes theorem to calculate
3. Automatic feature prioritization a-posteriors
4. Overfitting - Need pruning / stop growth 3. Manual feature selection
5. Support at leaves 4. No Need for pruning or post training tuning
6. No issue with disappearance of values 5. Probabilities show confidence level
7. No assumption of independence of features 6. Can suffer vanishing probs of likelihoods - smoothing
8. Discretization of continuous values needed 7. NB assumption is there
9. Good with lots of data 8. Prob distribution can take care of real values
9. Good with low amounts of data

(Ebook PDF) Reconceptualizing Mathematics 3rd Editioninstant Download
100% (3)
(Ebook PDF) Reconceptualizing Mathematics 3rd Editioninstant Download
57 pages
Thesis Presentation and Analysis of Data
100% (1)
Thesis Presentation and Analysis of Data
6 pages
Guidelines Project STA404 - Students'
100% (1)
Guidelines Project STA404 - Students'
8 pages
VARSIGMA - GB - Preparatory Module V1.1 PDF
100% (1)
VARSIGMA - GB - Preparatory Module V1.1 PDF
104 pages
Moving Average Thesis
100% (3)
Moving Average Thesis
8 pages
Naive Bayes
No ratings yet
Naive Bayes
11 pages
A Study On The Relationship Between Sensory Marketing On Customer Satisfaction
100% (2)
A Study On The Relationship Between Sensory Marketing On Customer Satisfaction
11 pages
Complete Bundle Business Statistics A First Course 7th Edition Levine
No ratings yet
Complete Bundle Business Statistics A First Course 7th Edition Levine
413 pages
Naive Bayes Classifier in Machine Learning - Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning - Javatpoint
19 pages
Measurement and Scaling: Fundamentals and Comparative Scaling
No ratings yet
Measurement and Scaling: Fundamentals and Comparative Scaling
29 pages
How To Use The Likert Scale in Statistical Analysis
100% (1)
How To Use The Likert Scale in Statistical Analysis
3 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
51 pages
Naive Bayes
No ratings yet
Naive Bayes
62 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
39 pages
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
No ratings yet
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
66 pages
2 Naive Bayes
No ratings yet
2 Naive Bayes
49 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
51 pages
CHAPTER 03-Random Variable
No ratings yet
CHAPTER 03-Random Variable
68 pages
Naive Bayes
No ratings yet
Naive Bayes
62 pages
Yates y Cochran 1938
No ratings yet
Yates y Cochran 1938
25 pages
Class 3 Navie Bayes
No ratings yet
Class 3 Navie Bayes
21 pages
Naive Ba Yes
No ratings yet
Naive Ba Yes
65 pages
29-Naive Bayes-03-10-2024
No ratings yet
29-Naive Bayes-03-10-2024
48 pages
Corn Silk Lemon Grass Tea 1
No ratings yet
Corn Silk Lemon Grass Tea 1
50 pages
Naïve Bayesv1
No ratings yet
Naïve Bayesv1
31 pages
5 Logistic Regression
No ratings yet
5 Logistic Regression
48 pages
CS464 Chapter 4: Naïve Bayes: (Slides Based On The Slides Provided by Öznur Taştan and Mehmet Koyutürk)
No ratings yet
CS464 Chapter 4: Naïve Bayes: (Slides Based On The Slides Provided by Öznur Taştan and Mehmet Koyutürk)
55 pages
Pgm5 With Output
No ratings yet
Pgm5 With Output
13 pages
L10-Naive Bayes Continuous
No ratings yet
L10-Naive Bayes Continuous
16 pages
Lecture10 - Bayesian Classifier
No ratings yet
Lecture10 - Bayesian Classifier
40 pages
Naive Bayes Classifiers - Parta
No ratings yet
Naive Bayes Classifiers - Parta
17 pages
Lecture - 4.1 - Bayes Classifier
No ratings yet
Lecture - 4.1 - Bayes Classifier
31 pages
CSE546: Naïve Bayes: Winter 2012
No ratings yet
CSE546: Naïve Bayes: Winter 2012
35 pages
Naive Bayes
No ratings yet
Naive Bayes
41 pages
RSM Wiley
No ratings yet
RSM Wiley
22 pages
NB Slides
No ratings yet
NB Slides
29 pages
Lecture 10 Naïve Bayes Classification
No ratings yet
Lecture 10 Naïve Bayes Classification
29 pages
Unit1 2
No ratings yet
Unit1 2
101 pages
Module05 - Bayesian Reasoning
No ratings yet
Module05 - Bayesian Reasoning
37 pages
Utilizing Xai Technique To Improve Autoencoder Based Model For Computer Network Anomaly Detection With Shapley Additive Explanation (Shap)
No ratings yet
Utilizing Xai Technique To Improve Autoencoder Based Model For Computer Network Anomaly Detection With Shapley Additive Explanation (Shap)
20 pages
Bayesian Learning
No ratings yet
Bayesian Learning
58 pages
Practical Research File Kani 4
No ratings yet
Practical Research File Kani 4
36 pages
BSC ML CH2
No ratings yet
BSC ML CH2
79 pages
Naive Bayes
No ratings yet
Naive Bayes
26 pages
Nized Research Template Format November 27 2023 Raw Copy 1 2 1
No ratings yet
Nized Research Template Format November 27 2023 Raw Copy 1 2 1
34 pages
Chapter 8
No ratings yet
Chapter 8
24 pages
lec20-ML I
No ratings yet
lec20-ML I
48 pages
Chapter 4 Lesson 6
No ratings yet
Chapter 4 Lesson 6
28 pages
Lecture 7
No ratings yet
Lecture 7
15 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
18 pages
Class Adv Classification IV
No ratings yet
Class Adv Classification IV
49 pages
Unit 5-6
No ratings yet
Unit 5-6
18 pages
NaiveBayersClassification BA
No ratings yet
NaiveBayersClassification BA
36 pages
Lecture3 Linear Classifiers
No ratings yet
Lecture3 Linear Classifiers
36 pages
LM3 - Naive Bayes Model
No ratings yet
LM3 - Naive Bayes Model
21 pages
Tugas Statel Individu
No ratings yet
Tugas Statel Individu
17 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
21 pages
Naive Bayes Ons
No ratings yet
Naive Bayes Ons
29 pages
IML Module 3
No ratings yet
IML Module 3
95 pages
Chapter 14 Statistics Test 03
No ratings yet
Chapter 14 Statistics Test 03
15 pages
Bayes Classifier
No ratings yet
Bayes Classifier
35 pages
Case Study Report Format: Cover Page Introduction
No ratings yet
Case Study Report Format: Cover Page Introduction
9 pages
Naive Bayes
No ratings yet
Naive Bayes
9 pages
Ame: Waqar Ali
No ratings yet
Ame: Waqar Ali
22 pages
Naïve Bayes Classifier: April 25, 2006
No ratings yet
Naïve Bayes Classifier: April 25, 2006
19 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
Performance Tuning
No ratings yet
Performance Tuning
24 pages
Test Bank For Forecasting and Predictive Analytics With Forecast X (TM), 7th Edition, Barry Keating, J. Holton Wilson, John Solutions Inc
100% (6)
Test Bank For Forecasting and Predictive Analytics With Forecast X (TM), 7th Edition, Barry Keating, J. Holton Wilson, John Solutions Inc
51 pages
Alpha PDF
No ratings yet
Alpha PDF
8 pages
Learning System
No ratings yet
Learning System
26 pages
Text Data Mining: A Case Study: Charles Wesley Ford, Chia-Chu Chiang, Hao Wu, Radhika R. Chilka, and John R. Talburt
No ratings yet
Text Data Mining: A Case Study: Charles Wesley Ford, Chia-Chu Chiang, Hao Wu, Radhika R. Chilka, and John R. Talburt
6 pages
16 - Naïve Bayes Classifier
No ratings yet
16 - Naïve Bayes Classifier
21 pages
NBayes 1 20 2011 Ann
No ratings yet
NBayes 1 20 2011 Ann
21 pages
Naïve Baye's Classifier
No ratings yet
Naïve Baye's Classifier
17 pages
WK 08
No ratings yet
WK 08
10 pages
Lesson 6.0 Supervised Learning With Naive Bayes Classifiers
No ratings yet
Lesson 6.0 Supervised Learning With Naive Bayes Classifiers
13 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
11 pages
Process Capability - A Managers Tool For 6 Sigma Quality Advantage
No ratings yet
Process Capability - A Managers Tool For 6 Sigma Quality Advantage
9 pages
Harnessing Artificial Intelligence For Hyper-Perso
No ratings yet
Harnessing Artificial Intelligence For Hyper-Perso
9 pages
Fiverr Gig Research
No ratings yet
Fiverr Gig Research
7 pages
NPTEL NLP Assignment 11.bin
No ratings yet
NPTEL NLP Assignment 11.bin
9 pages
Practical-3 Ritesh
No ratings yet
Practical-3 Ritesh
5 pages
Moment PDF
No ratings yet
Moment PDF
5 pages
Practical Exam Aug 2021
No ratings yet
Practical Exam Aug 2021
5 pages
Bayes Classifier
No ratings yet
Bayes Classifier
20 pages
Naive Bayesian Classifier: National Institute of Technology Sikkim
No ratings yet
Naive Bayesian Classifier: National Institute of Technology Sikkim
6 pages
07 Naive Bayes
No ratings yet
07 Naive Bayes
6 pages
9.program Naive Bayes
No ratings yet
9.program Naive Bayes
9 pages
Practice For Chapter 1
No ratings yet
Practice For Chapter 1
4 pages
NPTEL NLP Assignment 10.bin
No ratings yet
NPTEL NLP Assignment 10.bin
4 pages
Assignment 3 Part 1 and 4
No ratings yet
Assignment 3 Part 1 and 4
3 pages
NPTEL NLP Assignment 9.bin
No ratings yet
NPTEL NLP Assignment 9.bin
4 pages
Catch and Release Lab
No ratings yet
Catch and Release Lab
3 pages
Prac 7
No ratings yet
Prac 7
3 pages
NPTEL NLP Assignment 12.bin
No ratings yet
NPTEL NLP Assignment 12.bin
3 pages
An Approach of The Naive Bayes Classifier For The Document Classification
No ratings yet
An Approach of The Naive Bayes Classifier For The Document Classification
4 pages