Bayesian

The document provides an overview of Bayesian Classification, detailing its principles, including Bayes Theorem and the Naïve Bayesian Classifier, which simplifies computations through the assumption of class conditional independence. It also discusses the k-Nearest Neighbor algorithm and Case-Based Reasoning as alternative classification methods, while addressing classifier accuracy and techniques for handling class-imbalanced datasets. The document emphasizes the importance of probabilistic learning and the practical applications of Bayesian methods in data classification.

Uploaded by

researchanalystforapurpose

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views23 pages

Bayesian

Uploaded by

researchanalystforapurpose

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Classification-Bayesian

Classification
Dr. Manish Kumar
Associate Professor
Chair: Data Analytics Lab & M.Tech (Data Engg.)
Department of Information Technology
Indian Institute of Information Technology-Allahabad, Prayagraj
Bayesian Classification
Bayesian Classification
What are Bayesian Classifiers?
▪ Statistical Classifiers
▪ Predict class membership probabilities
▪ Based on Bayes Theorem
▪ Naïve Bayesian Classifier
▪ Computationally Simple
▪ Comparable performance with DT and NN
classifiers
Bayesian Classification
▪ Probabilistic learning: Calculate explicit
probabilities for hypothesis, among the most
practical approaches to certain types of learning
problems
▪ Incremental: Each training example can
incrementally increase/decrease the probability that
a hypothesis is correct. Prior knowledge can be
combined with observed data.
Bayes Theorem
▪ Let X be a data sample whose class label is
unknown
▪ Let H be some hypothesis that X belongs to a
class C
▪ For classification determine P(H/X)
▪ P(H/X) is the probability that H holds given the
observed data sample X
▪ P(H/X) is posterior probability of H conditioned on
X
Bayes Theorem
Example: Sample space: All Fruits described by their
color and shape
X is “round” and “red”
H= hypothesis that X is an Apple
P(H/X) is our confidence that X is an apple given
that X is “round” and “red”
▪ P(H) is Prior Probability of H, ie, the probability
that any given data sample is an apple regardless
of how it looks
▪ P(H/X) is based on more information
▪ Note that P(H) is independent of X
Bayes Theorem
Example: Sample space: All Fruits
▪ P(X/H) ?
▪ It is the probability that X is round and
red given that we know that it is true
that X is an apple
▪ Here P(X) is prior probability =
P(data sample from our set of fruits is
red and round)
Estimating Probabilities
▪ P(X), P(H), and P(X/H) may be estimated
from given data
▪ Bayes Theorem

▪ Use of Bayes Theorem in Naïve Bayesian

Classifier!!
Naïve Bayesian Classification
▪ Also called Simple BC

▪ Class Conditional Independence

Effect of an attribute values on a given class is
independent of the values of other attributes
▪ This assumption simplifies computations
Naïve Bayesian Classification
Steps Involved
1. Each data sample is of the type
X=(xi) i =1(1)n, where xi is the values of X for
attribute Ai
2. Suppose there are m classes C i, i=1(1)m.
X ∈ Ci iff
P(Ci|X) > P(Cj|X) for 1≤ j ≤ m, j≠i
i.e BC assigns X to class Ci having highest
posterior probability conditioned on X
Naïve Bayesian Classification
The class for which P(Ci|X) is maximized is called the
maximum posteriori hypothesis.maximum posterior hypothesis.
From Bayes Theorem
P ( Ci | X ) = P ( X | Ci ) P ( Ci )
P( X )
3. P(X) is constant. Only need be maximized.
▪ If class prior probabilities not known, then assume all
classes to be equally likely i.e. P(C1)=P(C2)=…=P(Cm)
therefore maximize P(X/Ci )
▪ Otherwise maximize
P(Ci) = Si/S
Problem: computing P(X|Ci) is computationally expensive
(may be infeasible)
Naïve Bayesian Classification
4. Naïve assumption: attribute independence
(class conditional independence)
= P(x1,…,xn|C) = Π P(xk|C) over k=1 to n
5. In order to classify an unknown sample X,
evaluate for each class Ci. Sample X
is assigned to the class Ci iff
P(X|Ci)P(Ci) > P(X|Cj) P(Cj) for 1≤ j ≤ m, j≠i
Naïve Bayesian Classification
Example
Age Income Student Credit_rating Class:Buys_comp
<=30 HIGH N FAIR N
<=30 HIGH N EXCELLENT N
31…..40 HIGH N FAIR Y
>40 MEDIUM N FAIR Y
>40 LOW Y FAIR Y
>40 LOW Y EXCELLENT N
31…..40 LOW Y EXCELLENT Y
<=30 MEDIUM N FAIR N
<=30 LOW Y FAIR Y
>40 MEDIUM Y FAIR Y
<=30 MEDIUM Y EXCELLENT Y
31….40 MEDIUM N EXCELLENT Y
31….40 HIGH Y FAIR Y
>40 MEDIUM N EXCELLENT N
Naïve Bayesian Classification
Example
X= (<=30,MEDIUM, Y,FAIR, ???)
We need to max.
P(X|Ci)P(Ci) for i =1,2.
P(Ci) is computed from training sample
P(buys_comp=Y) = 9/14 = 0.643
P(buys_comp=N) = 5/14 = 0.357
How to calculate P(X|Ci)P(Ci) for i=1,2?
P(X|Ci) = P(x1, x2, x3, x4|C) = ΠP(xk|C)
Naïve Bayesian Classification
Example
P(age<=30 | buys_comp=Y)=2/9=0.222
P(age<=30 | buys_comp=N)=3/5=0.600
P(income=medium | buys_comp=Y)=4/9=0.444
P(income=medium | buys_comp=N)=2/5=0.400
P(student=Y | buys_comp=Y)=6/9=0.667
P(student=Y | buys_comp=N)=1/5=0.200
P(credit_rating=FAIR | buys_comp=Y)=6/9=0.667
P(credit_rating=FAIR | buys_comp=N)=2/5=0.400
Naïve Bayesian Classification
Example
P(X | buys_comp=Y)=0.222*0.444*0.667*0.667=0.044
P(X | buys_comp=N)=0.600*0.400*0.200*0.400=0.019

P(X | buys_comp=Y)P(buys_comp=Y) = 0.044*0.643=0.028

P(X | buys_comp=N)P(buys_comp=N) = 0.019*0.357=0.007

CONCLUSION: Bayesian classifier predicts buys_comp=Y for

sample X. X buys computer
Bayesian Belief Networks
▪ Naïve BC assumes Class Conditional Independence
▪ This assumption simplifies computations
▪ When this assumption holds true, Naïve BC is most
accurate compared to all other classifiers
▪ In real problems, dependencies do exist between variables
▪ 2 methods to overcome this limitation of NBC
▪ Bayesian networks, that combine Bayesian reasoning
with causal relationships between attributes
▪ Decision trees, that reason on one attribute at the
time, considering most important attributes first
Bayesian Belief Networks
Known as
▪ Belief Networks
▪ Bayesian Networks
▪ Probabilistic Networks
has 2 components
▪ Directed Acyclic Graph (DAG)
▪ Conditional Probability Table (CPT)
The k-Nearest Neighbor Algorithm
▪ All instances correspond to points in the n-D space.
▪ The nearest neighbor are defined in terms of Euclidean
distance.
▪ Euclidean distance between two points, X = (x1,x2,…,xn) and
Y = (y1,y2,…,yn) is d(X,Y)= √(Σi=1n (xi-yi)2)
▪ The target function could be discrete- or real- valued.
▪ For discrete-valued, the k-NN returns the most common
value among the k training examples nearest to xq .
▪ The k-NN algorithm for continuous-valued target functions
▪ Calculate the mean values of the k nearest neighbors
The k-Nearest Neighbor Algorithm
▪ Distance-weighted nearest neighbor algorithm
▪ Weight the contribution of each of the k neighbors
according to their distance to the query point xq. (giving
greater weight to closer neighbors)
▪ Nearest neighbor classifiers are lazy learners: they store all
of the training samples and do not build a classifier until a
new sample needs to be classified
▪ Robust to noisy data by averaging k-nearest neighbors.
▪ Curse of dimensionality: distance between neighbors could
be dominated by irrelevant attributes
▪ To overcome it, elimination of the least relevant
attributes.
Case-Based Reasoning
▪ Also uses: lazy evaluation + analyze similar instances
▪ Difference: Instances are not “points in a Euclidean space”
▪ Methodology
▪ Instances represented by rich symbolic descriptions
(e.g., function graphs)
▪ Multiple retrieved cases may be combined
▪ Tight coupling between case retrieval, knowledge-based
reasoning, and problem solving
▪ Research issues
Indexing based on syntactic similarity measure, and when
failure, backtracking, and adapting to additional cases
Classifier Accuracy

▪ How it can be measured?

▪ Holdout Method (Random Sub sampling)
▪ K-fold Cross Validation
▪ Bootstrapping
▪ How we can improve classifier Accuracy?
▪ Bagging
▪ Boosting
▪ Is accuracy enough to judge a classifier?
Classification of Class-Imbalanced Data Sets
• Class-imbalance problem: Rare positive example but numerous negative
ones, e.g., medical diagnosis, fraud, oil-spill, fault, etc.
• Traditional methods assume a balanced distribution of classes and
equal error costs: not suitable for class-imbalanced data
• Typical methods for imbalance data in 2-class classification:
– Oversampling: re-sampling of data from positive class
– Under-sampling: randomly eliminate tuples from negative class
– Threshold-moving: moves the decision threshold, t, so that the rare
class tuples are easier to classify, and hence, less chance of false
negative errors
– Ensemble techniques: Ensemble multiple classifiers introduced
above

Unit6 - 3 Classification-Bayesian
No ratings yet
Unit6 - 3 Classification-Bayesian
15 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Lecture 5 Bayesian Classification
No ratings yet
Lecture 5 Bayesian Classification
16 pages
Data Mining - Bayesian Classification
No ratings yet
Data Mining - Bayesian Classification
6 pages
ML Module4 Classification
No ratings yet
ML Module4 Classification
79 pages
ML 05 Bayesian Classifier
No ratings yet
ML 05 Bayesian Classifier
19 pages
K - Nearest Neighbours Classifier / Regressor
No ratings yet
K - Nearest Neighbours Classifier / Regressor
35 pages
UNIT - IV
No ratings yet
UNIT - IV
169 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
Bayes Classification
No ratings yet
Bayes Classification
9 pages
Naive Bayes
No ratings yet
Naive Bayes
37 pages
IME672 - Lecture 44
No ratings yet
IME672 - Lecture 44
16 pages
Unit-Iv Data Classification: Data Warehousing and Data Mining
No ratings yet
Unit-Iv Data Classification: Data Warehousing and Data Mining
7 pages
4 22865 IS465 2019 1 2 1 08ClassBasic
No ratings yet
4 22865 IS465 2019 1 2 1 08ClassBasic
43 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Classification Clustering
No ratings yet
Classification Clustering
44 pages
Module 3 - Bayesian Classifier
No ratings yet
Module 3 - Bayesian Classifier
17 pages
AI Notes
No ratings yet
AI Notes
19 pages
10 Classification New 1
No ratings yet
10 Classification New 1
31 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
47 pages
Lecture 8 - Naive Bayes
No ratings yet
Lecture 8 - Naive Bayes
27 pages
Bayesian Classification
No ratings yet
Bayesian Classification
25 pages
Unit-4 DWDM
No ratings yet
Unit-4 DWDM
10 pages
Lecture Slide 03 - Bayesian Classifier - Summer 2023
No ratings yet
Lecture Slide 03 - Bayesian Classifier - Summer 2023
23 pages
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
No ratings yet
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
66 pages
3 - Bayesian Classification
No ratings yet
3 - Bayesian Classification
15 pages
Statistical Inference INF312 - Is - Lecture 03 - Part 3
No ratings yet
Statistical Inference INF312 - Is - Lecture 03 - Part 3
18 pages
20210913115710D3708 - Session 09-12 Bayes Classifier
No ratings yet
20210913115710D3708 - Session 09-12 Bayes Classifier
30 pages
29-Naive Bayes-03-10-2024
No ratings yet
29-Naive Bayes-03-10-2024
48 pages
Nayes Bayes Classifier
No ratings yet
Nayes Bayes Classifier
46 pages
A5 PDF
No ratings yet
A5 PDF
9 pages
8 - Classification NaiveBayes PDF
No ratings yet
8 - Classification NaiveBayes PDF
13 pages
6 Classification
No ratings yet
6 Classification
53 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
Classification Naive Bayes
No ratings yet
Classification Naive Bayes
17 pages
Lecture12 Ch8 ClassBasic Part2
No ratings yet
Lecture12 Ch8 ClassBasic Part2
22 pages
Naive Bayes
No ratings yet
Naive Bayes
38 pages
DWDM Unit 3 Part 2
No ratings yet
DWDM Unit 3 Part 2
8 pages
Bayes Classification Method
No ratings yet
Bayes Classification Method
18 pages
8 Classification
No ratings yet
8 Classification
45 pages
2.3 Bayes Classification
No ratings yet
2.3 Bayes Classification
15 pages
9-Decision Tree Induction-23-01-2025
No ratings yet
9-Decision Tree Induction-23-01-2025
40 pages
Data Mining - Module 7
No ratings yet
Data Mining - Module 7
8 pages
Bayesian Learning: Berrin Yanikoglu
No ratings yet
Bayesian Learning: Berrin Yanikoglu
64 pages
Bayesian Learning
No ratings yet
Bayesian Learning
58 pages
Module - 3 - Last Part
No ratings yet
Module - 3 - Last Part
16 pages
Lecture 3 Basics of Clssification
No ratings yet
Lecture 3 Basics of Clssification
53 pages
Unit-4 Naïve Bayes & Support Vector Machine
No ratings yet
Unit-4 Naïve Bayes & Support Vector Machine
79 pages
Classification
No ratings yet
Classification
33 pages
Bayes Classifier
No ratings yet
Bayes Classifier
20 pages
Lesson 3.3 - Supervised Learning Rule Based Classification
No ratings yet
Lesson 3.3 - Supervised Learning Rule Based Classification
43 pages
Chapter 4
No ratings yet
Chapter 4
57 pages
Data Classification and Prediction : Lecture-11
No ratings yet
Data Classification and Prediction : Lecture-11
36 pages
Bayes Classification
No ratings yet
Bayes Classification
4 pages
14 - Naive Baysean Classification
No ratings yet
14 - Naive Baysean Classification
20 pages
ML Unit3
No ratings yet
ML Unit3
21 pages
Calculus Super Review
From Everand
Calculus Super Review
Editors of REA
No ratings yet
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
Pre-Calculus Essentials
From Everand
Pre-Calculus Essentials
Ernest Woodward
No ratings yet
Introduction To Management Science A Modeling and Case Studies Approach With Spreadsheets 5th Edition Hillier Test Bank PDF Download
100% (3)
Introduction To Management Science A Modeling and Case Studies Approach With Spreadsheets 5th Edition Hillier Test Bank PDF Download
44 pages
K Means R and Rapid Miner Patient and Mall Case Study
No ratings yet
K Means R and Rapid Miner Patient and Mall Case Study
80 pages
Quantum Gravity As Gravitized Quantum Theory: Tristan H Ubsch and Djordje Minic
No ratings yet
Quantum Gravity As Gravitized Quantum Theory: Tristan H Ubsch and Djordje Minic
60 pages
Pde 240509154448 9589657a
No ratings yet
Pde 240509154448 9589657a
20 pages
Summative 1 - Polynomials
No ratings yet
Summative 1 - Polynomials
5 pages
Ppt-Unit 5 - 18mab302t-Graph Theory
No ratings yet
Ppt-Unit 5 - 18mab302t-Graph Theory
72 pages
Information Security
No ratings yet
Information Security
43 pages
Data Structures and Algorithms
No ratings yet
Data Structures and Algorithms
32 pages
Kelley C.T. - Iterative Methods For optimization-SIAM (1999)
No ratings yet
Kelley C.T. - Iterative Methods For optimization-SIAM (1999)
188 pages
Lab
No ratings yet
Lab
9 pages
Deepdesrt Deep Learning For Table Detection
No ratings yet
Deepdesrt Deep Learning For Table Detection
6 pages
Optimization of One Step Block Method With Three Hybrid Points For Solving First-Order Ordinary Differential Equations
No ratings yet
Optimization of One Step Block Method With Three Hybrid Points For Solving First-Order Ordinary Differential Equations
6 pages
4.2.4 Chain Rule and Implicit Differentation
No ratings yet
4.2.4 Chain Rule and Implicit Differentation
8 pages
RISE QM MCQs Ch#04
No ratings yet
RISE QM MCQs Ch#04
14 pages
Absolute/Global Extrema: Maxima and Minima of A Function of One Variable
No ratings yet
Absolute/Global Extrema: Maxima and Minima of A Function of One Variable
3 pages
2fy2-01 Engineering Mathematics-I (A To D)
No ratings yet
2fy2-01 Engineering Mathematics-I (A To D)
3 pages
2007 Process Optimization of Injection Moulding Using An Adaptive Surrogate Model With Gaussian Process Approach
No ratings yet
2007 Process Optimization of Injection Moulding Using An Adaptive Surrogate Model With Gaussian Process Approach
11 pages
Zeeshan (CS) - Assignment 1
No ratings yet
Zeeshan (CS) - Assignment 1
3 pages
Week 6 - Fourier Transform: Activities
No ratings yet
Week 6 - Fourier Transform: Activities
10 pages
Evaluate Ai LLM
No ratings yet
Evaluate Ai LLM
17 pages
Data Structures & Algorithms - Week 1 To 7
No ratings yet
Data Structures & Algorithms - Week 1 To 7
105 pages
ADA Module 2
No ratings yet
ADA Module 2
44 pages
Sweety Model1 200gm 05feb24
No ratings yet
Sweety Model1 200gm 05feb24
21 pages
Single-Source Shortest Paths - Cormen Book CH 24
No ratings yet
Single-Source Shortest Paths - Cormen Book CH 24
28 pages
MATHESH Matlab Final Output
No ratings yet
MATHESH Matlab Final Output
19 pages
Noc20 Cs81 Assignment 01 Week 03
No ratings yet
Noc20 Cs81 Assignment 01 Week 03
5 pages
Quantum Computing
No ratings yet
Quantum Computing
122 pages
Design of Water Quality Monitoring Based On SVM and Its Simulation Platform by Remote Sensing
No ratings yet
Design of Water Quality Monitoring Based On SVM and Its Simulation Platform by Remote Sensing
5 pages
Heteroskedasticity Test Glejser
No ratings yet
Heteroskedasticity Test Glejser
2 pages
(Goutam Paul Subhamoy Maitra) RC4 Stream Cipher A (B-Ok - Xyz)
No ratings yet
(Goutam Paul Subhamoy Maitra) RC4 Stream Cipher A (B-Ok - Xyz)
310 pages

Bayesian

Uploaded by

Bayesian

Uploaded by

Classification-Bayesian

▪ Use of Bayes Theorem in Naïve Bayesian

▪ Class Conditional Independence

P(X | buys_comp=Y)P(buys_comp=Y) = 0.044*0.643=0.028

CONCLUSION: Bayesian classifier predicts buys_comp=Y for

▪ How it can be measured?

You might also like