0% found this document useful (0 votes)

53 views19 pages

Naïve Bayes Classifier: Ke Chen

Naïve Bayes is a probabilistic classifier that applies Bayes' theorem with a strong (naive) independence assumption. It estimates the probability of classes given an instance by calculating the product of the probabilities of the values for each attribute given a class. This allows fast and simple training by independently estimating each attribute probability. Classification is done by selecting the class with the highest posterior probability given the instance's attribute values. Naïve Bayes works surprisingly well despite its independence assumption and is widely used for tasks like spam filtering.

Uploaded by

Alimushwan Adnan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views19 pages

Naïve Bayes Classifier: Ke Chen

Uploaded by

Alimushwan Adnan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 19

Naïve Bayes Classifier

Ke Chen

http://intranet.cs.man.ac.uk/mlo/comp20411/

Modified and extended by Longin Jan Latecki

[email protected]
Outline

• Background
• Probability Basics
• Probabilistic Classification
• Naïve Bayes
• Example: Play Tennis
• Relevant Issues
• Conclusions

2
Background
• There are three methods to establish a classifier
a) Model a classification rule directly
Examples: k-NN, decision trees, perceptron, SVM
b) Model the probability of class memberships given input data
Example: multi-layered perceptron with the cross-entropy cost
c) Make a probabilistic model of data within each class
Examples: naive Bayes, model based classifiers
• a) and b) are examples of discriminative classification
• c) is an example of generative classification
• b) and c) are both examples of probabilistic classification

3
Probability Basics
• Prior, conditional and joint probability
– Prior probability: P(X )
– Conditional probability: P( X1 |X2 ), P(X2 | X1 )
– Joint probability: X  ( X1 , X2 ), P( X )  P(X1 ,X2 )
– Relationship: P(X1 ,X2 )  P( X2 | X1 )P( X1 )  P( X1 | X2 )P( X2 )
– Independence: P( X2 | X1 )  P( X2 ), P( X1 | X2 )  P( X1 ), P(X1 ,X2 )  P( X1 )P( X2 )
• Bayesian Rule

P( X |C )P(C ) Likelihood  Prior

P(C |X )  Posterior 
P( X ) Evidence

4
Example by Dieter Fox
Probabilistic Classification
• Establishing a probabilistic model for classification
– Discriminative model
P(C |X ) C  c1 ,  , c L , X  (X1 ,  , Xn )
– Generative model
P( X |C ) C  c1 ,  , c L , X  (X1 ,  , Xn )

• MAP classification rule

– MAP: Maximum A Posterior
– Assign x to c* if P(C  c *
| X  x )  P(C  c | X  x ) c  c *
, c  c1 ,  , c L

• Generative classification with the MAP rule

P( X |C )P(C )
– Apply Bayesian rule to convert: P(C |X )   P( X |C )P(C )
P( X )

8
Feature Histograms

P(x)
C1
C2

Slide by Stephen Marsland

x
Posterior Probability
P(C|x)

0
Slide by Stephen Marsland
x
Naïve Bayes
• Bayes classification
P(C |X )  P( X |C )P(C )  P( X1 ,  , Xn |C )P(C )

Difficulty: learning the joint probability P( X1 ,  , Xn |C )

• Naïve Bayes classification
– Making the assumption that all input attributes are independent
P( X1 , X2 ,  , Xn |C )  P( X1 | X2 ,  , Xn ; C )P( X2 ,  , Xn |C )
 P( X1 |C )P( X2 ,  , Xn |C )
 P( X1 |C )P( X2 |C )    P( Xn |C )

– MAP classification rule

[ P( x1 |c * )    P( xn |c * )]P(c * )  [ P( x1 |c)    P( xn |c)]P(c), c  c * , c  c1 ,  , c L

11
Naïve Bayes
• Naïve Bayes Algorithm (for discrete input attributes)
– Learning Phase: Given a training set S,
For each target value of ci (ci  c1 ,  , c L )
Pˆ (C  ci )  estimate P(C  ci ) with examples in S;
For every attribute value a jk of each attribute x j ( j  1,  , n; k  1,  , N j )
Pˆ ( X j  a jk |C  ci )  estimate P( X j  a jk |C  ci ) with examples in S;

Output: conditional probability tables; for x j , N j  L elements

– Test Phase: Given an unknown instance X  ( a1 ,  , an ),
Look up tables to assign the label c* to X’ if
[ Pˆ ( a1 |c * )    Pˆ ( an |c * )]Pˆ ( c * )  [ Pˆ ( a1 |c)    Pˆ ( an |c )]Pˆ (c), c  c * , c  c1 ,  , c L

12
Example
• Example: Play Tennis

13
Learning Phase
P(Outlook=o|Play=b) P(Temperature=t|Play=b)

Outlook Play=Yes Play=No Temperature Play=Yes Play=No

Sunny 2/9 3/5 Hot 2/9 2/5
Overcast 4/9 0/5 Mild 4/9 2/5
Rain 3/9 2/5 Cool 3/9 1/5

P(Humidity=h|Play=b) P(Wind=w|Play=b)

Humidity Play=Yes Play=No Wind Play=Yes Play=No

High Strong 3/9 3/5
3/9 4/5
Normal Weak 6/9 2/5
6/9 1/5

P(Play=Yes) = 9/14 P(Play=No) = 5/14

14
Example
• Test Phase
– Given a new instance,
x’=(Outlook=Sunny, Temperature=Cool, Humidity=High, Wind=Strong)
– Look up tables
P(Outlook=Sunny|Play=Yes) = 2/9 P(Outlook=Sunny|Play=No) = 3/5
P(Temperature=Cool|Play=Yes) = 3/9 P(Temperature=Cool|Play==No) = 1/5
P(Huminity=High|Play=Yes) = 3/9 P(Huminity=High|Play=No) = 4/5
P(Wind=Strong|Play=Yes) = 3/9 P(Wind=Strong|Play=No) = 3/5
P(Play=Yes) = 9/14 P(Play=No) = 5/14

Given the fact P(Yes|x’) < P(No|x’), we label x’ to be “No”.

15
Relevant Issues
• Violation of Independence Assumption
– For many real world tasks, P( X1 ,  , Xn |C )  P( X1 |C )    P( Xn |C )
– Nevertheless, naïve Bayes works surprisingly well anyway!
• Zero conditional probability Problem
– If no example contains the attribute value X j  a jk , Pˆ ( X j  a jk |C  ci )  0
– In this circumstance, Pˆ ( x |c )    Pˆ ( a |c )    Pˆ ( x |c )  0 during test
1 i jk i n i

– For a remedy, conditional probabilities estimated with Laplace

smoothing: n  mp
Pˆ ( X j  a jk |C  ci )  c
nm
nc : number of training examples for which X j  a jk and C  ci
n : number of training examples for which C  ci
p : prior estimate (usually, p  1 / t for t possible values of X j )
m : weight to prior (number of " virtual" examples, m  1)
16
Homework
• Redo the test on Slide 15 using the formula on Slide 16
with m=1.
• Compute P(Play=Yes|x’) and P(Play=No|x’) with m=0
and with m=1 for
x’=(Outlook=Overcast, Temperature=Cool, Humidity=High, Wind=Strong)
Does the result change?

17
Relevant Issues
• Continuous-valued Input Attributes
– Numberless values for an attribute
– Conditional probability modeled with the normal distribution
1  ( X j   ji )2 
Pˆ ( X j |C  ci )  exp  
2  ji  2 ji 
2

 ji : mean (avearage) of attribute values X j of examples for which C  ci
 ji : standard deviation of attribute values X j of examples for which C  ci

– Learning Phase: for X  ( X1 ,  , Xn ), C  c1 ,  , c L

Output: n  L normal distributions and P(C  ci ) i  1,  , L
– Test Phase: for X  ( X1 ,  , Xn )
• Calculate conditional probabilities with all the normal distributions
• Apply the MAP rule to make a decision

18
Conclusions
• Naïve Bayes based on the independence assumption
– Training is very easy and fast; just requiring considering each
attribute in each class separately
– Test is straightforward; just looking up tables or calculating
conditional probabilities with normal distributions
• A popular generative model
– Performance competitive to most of state-of-the-art classifiers even
in presence of violating independence assumption
– Many successful applications, e.g., spam mail filtering
– Apart from classification, naïve Bayes can do more…

06 - NaiveBayes and ME
No ratings yet
06 - NaiveBayes and ME
26 pages
AIN2601 - Assessment 5 - 2024 - S2
No ratings yet
AIN2601 - Assessment 5 - 2024 - S2
35 pages
EC - A1P - Language Test 3B
No ratings yet
EC - A1P - Language Test 3B
4 pages
MLLN
100% (5)
MLLN
13 pages
Pattern Recognition
No ratings yet
Pattern Recognition
76 pages
ML BayesionBeliefNetwork Lect12 14
No ratings yet
ML BayesionBeliefNetwork Lect12 14
99 pages
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
No ratings yet
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
56 pages
Naïve Bayes Classifier: Adopted From Slides by Ke Chen From University of Manchester and Yangqiu Song From Msra
No ratings yet
Naïve Bayes Classifier: Adopted From Slides by Ke Chen From University of Manchester and Yangqiu Song From Msra
25 pages
Bayesian Learning: Berrin Yanikoglu
No ratings yet
Bayesian Learning: Berrin Yanikoglu
64 pages
Naïve Bayes Classifier: Ke Chen
No ratings yet
Naïve Bayes Classifier: Ke Chen
20 pages
Naïve Bayes Classifier: Dr. Hussain Dawood
No ratings yet
Naïve Bayes Classifier: Dr. Hussain Dawood
20 pages
S23 CND 22634
100% (5)
S23 CND 22634
27 pages
Exercise Quadratic Equations
100% (1)
Exercise Quadratic Equations
7 pages
L10-Naive Bayes Continuous
No ratings yet
L10-Naive Bayes Continuous
16 pages
Naïve Bayes Classifier: Ke Chen
No ratings yet
Naïve Bayes Classifier: Ke Chen
18 pages
Naïve Bayes Classifier: April 25, 2006
No ratings yet
Naïve Bayes Classifier: April 25, 2006
19 pages
Weekly Status Report - Template
0% (1)
Weekly Status Report - Template
4 pages
Factors Considered in Deciding Compensation
100% (1)
Factors Considered in Deciding Compensation
21 pages
ML Lecture#5
No ratings yet
ML Lecture#5
65 pages
Data Classification and Prediction : Lecture-11
No ratings yet
Data Classification and Prediction : Lecture-11
36 pages
Lectures
No ratings yet
Lectures
262 pages
The Protein: Presented by Dr. Shazzad Hosain Asst. Prof. EECS, NSU
No ratings yet
The Protein: Presented by Dr. Shazzad Hosain Asst. Prof. EECS, NSU
157 pages
Naïve Bayes Classifier: Ke Chen
No ratings yet
Naïve Bayes Classifier: Ke Chen
20 pages
Compensation Philosophies
No ratings yet
Compensation Philosophies
15 pages
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
No ratings yet
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
54 pages
Lecture 5-Naïve Bayes
No ratings yet
Lecture 5-Naïve Bayes
26 pages
Machine Learning - Unit 2
No ratings yet
Machine Learning - Unit 2
104 pages
M870 Operator Manual PDF
No ratings yet
M870 Operator Manual PDF
62 pages
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
No ratings yet
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
17 pages
SRC Cobra
No ratings yet
SRC Cobra
36 pages
Sadia Zannat - Final Report
No ratings yet
Sadia Zannat - Final Report
56 pages
Sequence Alignment: Lecture 2, Thursday April 3, 2003
No ratings yet
Sequence Alignment: Lecture 2, Thursday April 3, 2003
39 pages
Naïve Bayes Classifier: Ke Chen
No ratings yet
Naïve Bayes Classifier: Ke Chen
18 pages
Gaming PC Components and Their Specifications: Personal Professional Development 2 Research Skills
No ratings yet
Gaming PC Components and Their Specifications: Personal Professional Development 2 Research Skills
7 pages
Probabilistic Class I Fiers
No ratings yet
Probabilistic Class I Fiers
5 pages
Pgm5 With Output
No ratings yet
Pgm5 With Output
13 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
17 pages
Naive Bayes
No ratings yet
Naive Bayes
18 pages
9 Regression Analysis
No ratings yet
9 Regression Analysis
38 pages
University of Cagliari: Blynk Platform
No ratings yet
University of Cagliari: Blynk Platform
34 pages
Exact String Matching Algorithms: Presented by Dr. Shazzad Hosain Assoc. Prof. EECS, NSU
No ratings yet
Exact String Matching Algorithms: Presented by Dr. Shazzad Hosain Assoc. Prof. EECS, NSU
80 pages
Naive Bayes Classifier PDF
No ratings yet
Naive Bayes Classifier PDF
17 pages
3 - Classification - Naive Bayes
No ratings yet
3 - Classification - Naive Bayes
30 pages
Exact String Matching Algorithms: Presented by Dr. Shazzad Hosain Asst. Prof. EECS, NSU
No ratings yet
Exact String Matching Algorithms: Presented by Dr. Shazzad Hosain Asst. Prof. EECS, NSU
27 pages
Unit 3 Bayesian Learning
No ratings yet
Unit 3 Bayesian Learning
49 pages
8 ML
No ratings yet
8 ML
22 pages
Classification - Naive Bayes
No ratings yet
Classification - Naive Bayes
17 pages
HL Paper1
No ratings yet
HL Paper1
15 pages
Naive Bayes
No ratings yet
Naive Bayes
31 pages
2024 - Slide2 - BayesML Sub
No ratings yet
2024 - Slide2 - BayesML Sub
40 pages
2 Naive Bayes
No ratings yet
2 Naive Bayes
49 pages
L23 Bayesian Naive
No ratings yet
L23 Bayesian Naive
18 pages
CSE 516/CSE 446 Introduction To Bioinformatics: Presented by Dr. Shazzad Hosain Asst. Prof. EECS, NSU
No ratings yet
CSE 516/CSE 446 Introduction To Bioinformatics: Presented by Dr. Shazzad Hosain Asst. Prof. EECS, NSU
25 pages
Bayes Algorithm
No ratings yet
Bayes Algorithm
26 pages
Eight Essential Components of Communication
No ratings yet
Eight Essential Components of Communication
7 pages
Lecture - 4.1 - Bayes Classifier
No ratings yet
Lecture - 4.1 - Bayes Classifier
31 pages
Improving Throughput and Availability of Cellular Digital Packet Data (CDPD)
No ratings yet
Improving Throughput and Availability of Cellular Digital Packet Data (CDPD)
12 pages
Bayesian Learning
No ratings yet
Bayesian Learning
41 pages
Networks
No ratings yet
Networks
27 pages
Chapter 5 - GEE 4 - MIDTERM PROJECT
No ratings yet
Chapter 5 - GEE 4 - MIDTERM PROJECT
4 pages
Drop Box
No ratings yet
Drop Box
2,667 pages
Naive Bayes
No ratings yet
Naive Bayes
9 pages
Submitted To: Dr. Nazrul Islam Professor, Dean, SOB Canadian University of Bangladesh
No ratings yet
Submitted To: Dr. Nazrul Islam Professor, Dean, SOB Canadian University of Bangladesh
3 pages
JS Essentials 1 Overview (2021!10!29)
No ratings yet
JS Essentials 1 Overview (2021!10!29)
24 pages
Final 2017 Even Co Po
No ratings yet
Final 2017 Even Co Po
9 pages
Communication Structure
No ratings yet
Communication Structure
5 pages
Basic Computer System Hardware: Computer Basics-Rockaway Township Public Library Class
No ratings yet
Basic Computer System Hardware: Computer Basics-Rockaway Township Public Library Class
6 pages
Seminar On Emerging Topics
No ratings yet
Seminar On Emerging Topics
3 pages
23-Naive Bayes
No ratings yet
23-Naive Bayes
22 pages
Lect 7 DM
No ratings yet
Lect 7 DM
65 pages
WM - W800 - SDK User Manual V1.1: Beijing Lianshengde Microelectronics Co., Ltd. (Winner Micro)
No ratings yet
WM - W800 - SDK User Manual V1.1: Beijing Lianshengde Microelectronics Co., Ltd. (Winner Micro)
20 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
Lecture 7
No ratings yet
Lecture 7
15 pages
L1 - Naïve Bayes Classifier
No ratings yet
L1 - Naïve Bayes Classifier
10 pages
6 Naive-Bayes
No ratings yet
6 Naive-Bayes
18 pages
S23 Training
No ratings yet
S23 Training
20 pages
Ba Yes Naive
No ratings yet
Ba Yes Naive
15 pages
Naive Bayes
No ratings yet
Naive Bayes
25 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
10 pages
CBSN4103 Network Security (SG) - Ejan23
No ratings yet
CBSN4103 Network Security (SG) - Ejan23
142 pages
Software Engineering
No ratings yet
Software Engineering
8 pages
ML Unit 2
No ratings yet
ML Unit 2
107 pages
Yunita 2021 J. Phys. Conf. Ser. 1898 012044
No ratings yet
Yunita 2021 J. Phys. Conf. Ser. 1898 012044
15 pages
Data Mining - Module 7
No ratings yet
Data Mining - Module 7
8 pages
32-Naive Bayes Cont''d-03-10-2024
No ratings yet
32-Naive Bayes Cont''d-03-10-2024
31 pages
What Is A Domain Name
No ratings yet
What Is A Domain Name
2 pages
29.11.2024 FN Seating
No ratings yet
29.11.2024 FN Seating
4 pages
Python
No ratings yet
Python
2 pages
Naive by
No ratings yet
Naive by
23 pages
Slide07 Bayes
No ratings yet
Slide07 Bayes
51 pages
ml3 - Text Classification - Naive Bayes
No ratings yet
ml3 - Text Classification - Naive Bayes
50 pages
GGJ Upload Instructions
No ratings yet
GGJ Upload Instructions
31 pages
Chapter 4
No ratings yet
Chapter 4
22 pages
Lec4 - Probability Theory and Naive Bayes Classifier
No ratings yet
Lec4 - Probability Theory and Naive Bayes Classifier
27 pages
L4 Naive Bayes
No ratings yet
L4 Naive Bayes
31 pages
Incident Response Report
No ratings yet
Incident Response Report
4 pages
ML 05 Bayesian Classifier
No ratings yet
ML 05 Bayesian Classifier
19 pages
Lecture 5 Bayesian
No ratings yet
Lecture 5 Bayesian
37 pages
Lec 03 NaiveBayesClassification
No ratings yet
Lec 03 NaiveBayesClassification
33 pages
SPO Single Pass Optimization For Soccer Simulation 2D
No ratings yet
SPO Single Pass Optimization For Soccer Simulation 2D
24 pages
Naive Bayes
No ratings yet
Naive Bayes
6 pages

Naïve Bayes Classifier: Ke Chen

Uploaded by

Naïve Bayes Classifier: Ke Chen

Uploaded by

Naïve Bayes Classifier

Modified and extended by Longin Jan Latecki

P( X |C )P(C ) Likelihood  Prior

• MAP classification rule

• Generative classification with the MAP rule

Slide by Stephen Marsland

Difficulty: learning the joint probability P( X1 ,  , Xn |C )

– MAP classification rule

Output: conditional probability tables; for x j , N j  L elements

Outlook Play=Yes Play=No Temperature Play=Yes Play=No

Humidity Play=Yes Play=No Wind Play=Yes Play=No

P(Play=Yes) = 9/14 P(Play=No) = 5/14

Given the fact P(Yes|x’) < P(No|x’), we label x’ to be “No”.

– For a remedy, conditional probabilities estimated with Laplace

– Learning Phase: for X  ( X1 ,  , Xn ), C  c1 ,  , c L

You might also like