0% found this document useful (0 votes)

142 views16 pages

Understanding Bayesian Classification Techniques

Bayesian classification is a statistical classification method that uses Bayes' theorem. It can be used to predict class membership probabilities. The naive Bayesian classifier assumes conditional independence between attributes. It performs comparably to decision trees and neural networks. Bayesian belief networks are graphical models that can represent dependencies between attributes using a directed acyclic graph and conditional probability tables. They allow modeling of conditional independence relationships.

Uploaded by

Ahsan Asim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

142 views16 pages

Understanding Bayesian Classification Techniques

Uploaded by

Ahsan Asim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Bayesian Classification

1
Bayesian Classification
 A statistical classifier
 Probabilistic prediction
 Predict class membership probabilities
 Based on Bayes’ Theorem
 Naive Bayesian classifier
 comparable performance with decision tree and selected neural
network classifiers
 Accuracy and Speed is good when applied to large databases
 Incremental

2
Bayesian Classification

 Naïve Bayesian Classifier

 Class Conditional Independence

 Effect of an attribute value on a given class is

independent of the values of other attributes

 Simplifies Computations

 Bayesian Belief Networks

 Graphical models

 Represent dependencies among subsets of

attributes

3
Bayesian Theorem: Basics
 Let X be a data sample class label is unknown
 Let H be a hypothesis that X belongs to class C
 Classification is to determine P(H|X), the probability that the
hypothesis holds given the observed data sample X
 Posterior Probability

 P(H) (prior probability), the initial probability

 P(X): probability that sample data is observed
 P(X|H) (posteriori probability), the probability of observing the
sample X, given that the hypothesis holds
 X – Round and Red Fruit H - Apple

4
Bayesian Theorem

 Given training data X, posteriori probability of a hypothesis H,

P(H|X), follows the Bayes theorem

P(H | X) = P(X | H )P(H )

P(X)
 Predicts X belongs to Ci iff the probability P(Ci|X) is the highest
among all the P(Ck|X) for all the k classes
 Practical difficulty: require initial knowledge of many probabilities,
significant computational cost

5
Naïve Bayesian Classifier
 Let D be a training set of tuples and their associated class
labels, and each tuple is represented by an n-D attribute
vector X = (x1, x2, …, xn)
 Suppose there are m classes C1, C2, …, Cm.
 Classification is to derive the maximum posteriori, i.e., the
maximal P(Ci|X)
 This can be derived from Bayes’ theorem

P(X | C ) P(C )
P(C | X) = i i
i P(X)

6
Naïve Bayesian Classifier
 Since P(X) is constant for all classes, only
P(C | X) = P(X | C )P(C )
i i i

7
Derivation of Naïve Bayes Classifier

 This greatly reduces the computation cost: Only counts the

class distribution
 If Ak is categorical, P(xk|Ci) = sik /si where sik is the # of tuples in Ci
having value xk for Ak and si is the number of training samples
belonging to Ci
 If Ak is continuous-valued, P(xk|Ci) is usually computed based
on Gaussian distribution with a mean μ and standard deviation
σ
P(xk|Ci) is g(xk, µCi, σCi)
( x −µ ) 2
1 −
g ( x, µ, σ ) = e 2σ 2

2πσ

8
Example age income studentcredit_rating
buys_compu
<=30 high no fair no
<=30 high no excellent no
31…40 high no fair yes
>40 medium no fair yes
Class:
C1:buys_computer = ‘yes’
>40 low yes fair yes
C2:buys_computer = ‘no’ >40 low yes excellent no
31…40 low yes excellent yes
Data sample <=30 medium no fair no
X = (age <=30, <=30 low yes fair yes
Income = medium, >40 medium yes fair yes
Student = yes <=30 medium yes excellent yes
Credit_rating = Fair) 31…40 medium no excellent yes
31…40 high yes fair yes
>40 medium no excellent no

9
Example
 P(Ci): P(buys_computer = “yes”) = 9/14 = 0.643
P(buys_computer = “no”) = 5/14= 0.357
 Compute P(X|Ci) for each class
P(age = “<=30” | buys_computer = “yes”) = 2/9 = 0.222
P(age = “<= 30” | buys_computer = “no”) = 3/5 = 0.6
P(income = “medium” | buys_computer = “yes”) = 4/9 = 0.444
P(income = “medium” | buys_computer = “no”) = 2/5 = 0.4
P(student = “yes” | buys_computer = “yes) = 6/9 = 0.667
P(student = “yes” | buys_computer = “no”) = 1/5 = 0.2
P(credit_rating = “fair” | buys_computer = “yes”) = 6/9 = 0.667
P(credit_rating = “fair” | buys_computer = “no”) = 2/5 = 0.4
 X = (age <= 30 , income = medium, student = yes, credit_rating = fair)
P(X|Ci) : P(X|buys_computer = “yes”) = 0.222 x 0.444 x 0.667 x 0.667 = 0.044
P(X|buys_computer = “no”) = 0.6 x 0.4 x 0.2 x 0.4 = 0.019
P(X|Ci)*P(Ci) : P(X|buys_computer = “yes”) * P(buys_computer = “yes”) = 0.028
P(X|buys_computer = “no”) * P(buys_computer = “no”) = 0.007
Therefore, X belongs to class (“buys_computer = yes”)

10
Avoiding the 0-Probability Problem
 Naïve Bayesian prediction requires each conditional prob. be
non-zero. Otherwise, the predicted prob. will be zero
n
P( X | C i) = ∏P ( x k | C i )
k =1
 Ex. Suppose a dataset with 1000 tuples, income=low (0),
income= medium (990), and income = high (10),
 Use Laplacian correction (or Laplacian estimator)
 Adding 1 to each case

Prob(income = low) = 1/1003

Prob(income = medium) = 991/1003
Prob(income = high) = 11/1003
 The “corrected” prob. estimates are close to their
“uncorrected” counterparts

11
Naïve Bayesian Classifier
 Advantages
 Easy to implement
 Good results obtained in most of the cases
 Disadvantages
 Assumption: class conditional independence, therefore loss of
accuracy
 Practically, dependencies exist among variables
 E.g., hospitals: patients: Profile: age, family history, etc.
Symptoms: fever, cough etc., Disease: lung cancer, diabetes, etc.
 Dependencies among these cannot be modeled by Naïve

Bayesian Classifier

12
Bayesian Belief Networks
 Models dependencies between variables
 Defined by Two components
 Directed Acyclic Graph
 Conditional Probability Table (CPT) for each variable
 Bayesian belief network allows a subset of the
variables to be conditionally independent

13
Bayesian Belief Networks
 A graphical model of causal relationships
 Represents dependency among the variables
 Gives a specification of joint probability distribution

 Nodes: random variables

 Links: dependency
X Y
 X and Y are the parents of Z, and Y is
the parent of P
Z
P  No dependency between Z and P
 Has no loops or cycles

14
Bayesian Belief Network: An Example
Family The conditional probability table
History
Smoker (CPT) for variable LungCancer:
(FH, S) (FH, ~S) (~FH, S) (~FH, ~S)

LC 0.8 0.5 0.7 0.1

LungCancer Emphysema ~LC 0.2 0.5 0.3 0.9

CPT shows the conditional probability for

each possible combination of its parents

Derivation of the probability of a

PositiveXRay Dyspnea
particular combination of values of X,
from CPT:
n
Bayesian Belief Networks P ( x1 ,..., xn ) = ∏ P ( x i | Parents (Y i ))
i =1

15
Training Bayesian Networks
 Several scenarios:
 Given both the network structure and all variables observable:
learn only the CPTs
 Network structure known, some hidden variables: gradient
descent (greedy hill-climbing) method, analogous to neural
network learning
 Network structure unknown, all variables observable: search
through the model space to reconstruct network topology
 Unknown structure, all hidden variables: No good algorithms
known for this purpose

Unit-4 DWDM
No ratings yet
Unit-4 DWDM
10 pages
Data Mining - Bayesian Classification
No ratings yet
Data Mining - Bayesian Classification
6 pages
Bayes Classification
No ratings yet
Bayes Classification
9 pages
Unit6 - 3 Classification-Bayesian
No ratings yet
Unit6 - 3 Classification-Bayesian
15 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
AI Notes
No ratings yet
AI Notes
19 pages
20210913115710D3708 - Session 09-12 Bayes Classifier
No ratings yet
20210913115710D3708 - Session 09-12 Bayes Classifier
30 pages
Naive Bayesian Classification Overview
No ratings yet
Naive Bayesian Classification Overview
15 pages
Naive Bayes
No ratings yet
Naive Bayes
37 pages
Classification Bayes
No ratings yet
Classification Bayes
21 pages
Statistical Inference INF312 - Is - Lecture 03 - Part 3
No ratings yet
Statistical Inference INF312 - Is - Lecture 03 - Part 3
18 pages
Naive Bayes Classifier Guide
No ratings yet
Naive Bayes Classifier Guide
16 pages
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
No ratings yet
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
16 pages
Bayesian
No ratings yet
Bayesian
23 pages
Understanding Bayesian Classification Techniques
No ratings yet
Understanding Bayesian Classification Techniques
25 pages
Module 3 - Bayesian Classifier
No ratings yet
Module 3 - Bayesian Classifier
17 pages
ML 05 Bayesian Classifier
No ratings yet
ML 05 Bayesian Classifier
19 pages
UNIT - IV
No ratings yet
UNIT - IV
169 pages
Naïve Bayes Classifier in AI Training
No ratings yet
Naïve Bayes Classifier in AI Training
27 pages
2.3 Bayes Classification
No ratings yet
2.3 Bayes Classification
15 pages
9-Decision Tree Induction-23-01-2025
No ratings yet
9-Decision Tree Induction-23-01-2025
40 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
47 pages
ML Module4 Classification
No ratings yet
ML Module4 Classification
79 pages
WINSEM2024-25 BCSE334L TH VL2024250502042 2025-03-03 Reference-Material-I
No ratings yet
WINSEM2024-25 BCSE334L TH VL2024250502042 2025-03-03 Reference-Material-I
18 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
Bayesian Classification Explained
No ratings yet
Bayesian Classification Explained
7 pages
Bayesian Classification, Nearest
No ratings yet
Bayesian Classification, Nearest
46 pages
Bays Classifier (Machine Learning)
No ratings yet
Bays Classifier (Machine Learning)
16 pages
CSC 325 AI Lecture08 Supervised Learning Fall2024 DR Raheel 20022025 034558pm
No ratings yet
CSC 325 AI Lecture08 Supervised Learning Fall2024 DR Raheel 20022025 034558pm
29 pages
Lecture12 Ch8 ClassBasic Part2
No ratings yet
Lecture12 Ch8 ClassBasic Part2
22 pages
K - Nearest Neighbours Classifier / Regressor
No ratings yet
K - Nearest Neighbours Classifier / Regressor
35 pages
A5 PDF
No ratings yet
A5 PDF
9 pages
8 - Classification NaiveBayes PDF
No ratings yet
8 - Classification NaiveBayes PDF
13 pages
Nayes Bayes Classifier
No ratings yet
Nayes Bayes Classifier
46 pages
NB Classifier & Bayesian Network 2
No ratings yet
NB Classifier & Bayesian Network 2
37 pages
CSC 325 AI Lecture08 Supervised Learning
No ratings yet
CSC 325 AI Lecture08 Supervised Learning
32 pages
Bayes Classifier
No ratings yet
Bayes Classifier
20 pages
Module 3 - Classification
No ratings yet
Module 3 - Classification
111 pages
4 22865 IS465 2019 1 2 1 08ClassBasic
No ratings yet
4 22865 IS465 2019 1 2 1 08ClassBasic
43 pages
Naïve Bayes for Data Scientists
No ratings yet
Naïve Bayes for Data Scientists
31 pages
Naïve Bayes Classifier in Machine Learning
No ratings yet
Naïve Bayes Classifier in Machine Learning
19 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
Lesson 3.3 - Supervised Learning Rule Based Classification
No ratings yet
Lesson 3.3 - Supervised Learning Rule Based Classification
43 pages
Naïve Bayesian Classification Overview
No ratings yet
Naïve Bayesian Classification Overview
38 pages
Naïve Bayesian Classifier Overview
No ratings yet
Naïve Bayesian Classifier Overview
48 pages
Classification Clustering
No ratings yet
Classification Clustering
44 pages
L11 Slides
No ratings yet
L11 Slides
28 pages
Understanding Bayesian Classification
No ratings yet
Understanding Bayesian Classification
66 pages
Machine Learning-Lecture 04
No ratings yet
Machine Learning-Lecture 04
31 pages
Classification Naive Bayes
No ratings yet
Classification Naive Bayes
17 pages
23-Naive Bayes
No ratings yet
23-Naive Bayes
22 pages
10 Classification New 1
No ratings yet
10 Classification New 1
31 pages
Abdul Hanan 2018-Uam-1253 Human Computer Interaction (Final)
No ratings yet
Abdul Hanan 2018-Uam-1253 Human Computer Interaction (Final)
13 pages
Human-Computer Interaction Exam Insights
No ratings yet
Human-Computer Interaction Exam Insights
9 pages
Metro Media Player RAD Development
No ratings yet
Metro Media Player RAD Development
7 pages
3.1.1. Multithreaded Design
No ratings yet
3.1.1. Multithreaded Design
18 pages
2
No ratings yet
2
9 pages
Classification Techniques in Data Mining
No ratings yet
Classification Techniques in Data Mining
56 pages
Emerging Trends in HCI
No ratings yet
Emerging Trends in HCI
32 pages
Data Mining Course Syllabus Overview
No ratings yet
Data Mining Course Syllabus Overview
7 pages
CS 412: Introduction To Data Mining Course Syllabus
No ratings yet
CS 412: Introduction To Data Mining Course Syllabus
7 pages
Introduction To Information Technology (COMP1111) : Dr. Aisha Mahmood
No ratings yet
Introduction To Information Technology (COMP1111) : Dr. Aisha Mahmood
14 pages
Introduction To Information Technology (COMP1111) : Dr. Aisha Mahmood
No ratings yet
Introduction To Information Technology (COMP1111) : Dr. Aisha Mahmood
11 pages
Introduction To Information Technology (COMP1111) : Dr. Aisha Mahmood
No ratings yet
Introduction To Information Technology (COMP1111) : Dr. Aisha Mahmood
16 pages
Ethical Issues in Change Management
No ratings yet
Ethical Issues in Change Management
10 pages
Exp.7 Decomposition Kinetics of Hydrogen Peroxide
No ratings yet
Exp.7 Decomposition Kinetics of Hydrogen Peroxide
9 pages
PT. Pingai Jaya 2017 Trial Balance
No ratings yet
PT. Pingai Jaya 2017 Trial Balance
2 pages
Schedule
No ratings yet
Schedule
1 page
Understanding COPAR in Community Health
No ratings yet
Understanding COPAR in Community Health
5 pages
PRT67270151400
No ratings yet
PRT67270151400
2 pages
Alfa Laval - Improve Sustainability From Your Palm Oil Mill With Alfa Laval's Technology
No ratings yet
Alfa Laval - Improve Sustainability From Your Palm Oil Mill With Alfa Laval's Technology
20 pages
Cuizon V Ramolete
No ratings yet
Cuizon V Ramolete
2 pages
PAES 401-Housing For Swine Production
100% (4)
PAES 401-Housing For Swine Production
20 pages
Dining Privilege Membership Holiday Inn
No ratings yet
Dining Privilege Membership Holiday Inn
6 pages
Hydrogen Recovery via PSA Process
0% (1)
Hydrogen Recovery via PSA Process
8 pages
Web Design's Evolution and Survival Guide
No ratings yet
Web Design's Evolution and Survival Guide
46 pages
Codigo Certificacoes Profissionais
No ratings yet
Codigo Certificacoes Profissionais
11 pages
Employee Value Proposition
100% (1)
Employee Value Proposition
5 pages
Overview of The Edgeworth Model
No ratings yet
Overview of The Edgeworth Model
7 pages
Aria Valve Pneumatic Valve
No ratings yet
Aria Valve Pneumatic Valve
9 pages
Week 1-Subsoil Exploration
No ratings yet
Week 1-Subsoil Exploration
70 pages
Accounting and Financial Management
No ratings yet
Accounting and Financial Management
11 pages
Brocade Fos Target Path
No ratings yet
Brocade Fos Target Path
9 pages
Invoice from Sri Ramana Solvex Pvt Ltd
No ratings yet
Invoice from Sri Ramana Solvex Pvt Ltd
2 pages
2 The Return To Total War - Foreign Affairs 2024 Dic
No ratings yet
2 The Return To Total War - Foreign Affairs 2024 Dic
12 pages
Rib Costx 7.2 Release Notes
No ratings yet
Rib Costx 7.2 Release Notes
10 pages
Phenomenological Research For Distribution
No ratings yet
Phenomenological Research For Distribution
25 pages
Case Study
No ratings yet
Case Study
6 pages
Preprocessing of STL File
No ratings yet
Preprocessing of STL File
40 pages
Immigrant Impact on California's Growth
No ratings yet
Immigrant Impact on California's Growth
2 pages
Obesity - A Complex, Chronic Disease
No ratings yet
Obesity - A Complex, Chronic Disease
24 pages
Letter Shake Shack Maxims
No ratings yet
Letter Shake Shack Maxims
3 pages
Linux Driver
0% (1)
Linux Driver
13 pages
2022 Acc GR 11 T3 Teachers Guide
No ratings yet
2022 Acc GR 11 T3 Teachers Guide
23 pages

Understanding Bayesian Classification Techniques

Uploaded by

Understanding Bayesian Classification Techniques

Uploaded by

Bayesian Classification

 Naïve Bayesian Classifier

 Effect of an attribute value on a given class is

independent of the values of other attributes

 Bayesian Belief Networks

 Represent dependencies among subsets of

 P(H) (prior probability), the initial probability

 Given training data X, posteriori probability of a hypothesis H,

P(H | X) = P(X | H )P(H )

 This greatly reduces the computation cost: Only counts the

Prob(income = low) = 1/1003

 Nodes: random variables

LC 0.8 0.5 0.7 0.1

LungCancer Emphysema ~LC 0.2 0.5 0.3 0.9

CPT shows the conditional probability for

Derivation of the probability of a

You might also like