0% found this document useful (0 votes)

8 views18 pages

Statistical Inference INF312 - Is - Lecture 03 - Part 3

The document discusses Bayesian classification, emphasizing its probabilistic prediction capabilities based on Bayes' Theorem. It explains the naive Bayes classifier, which simplifies computations by assuming attribute independence, and provides examples of how to calculate probabilities for classifying data. Additionally, it includes a solved example demonstrating the application of Bayes' Theorem in predicting bone fractures using bone mineral density measurements.

Uploaded by

mohamed2004mowaffak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views18 pages

Statistical Inference INF312 - Is - Lecture 03 - Part 3

Uploaded by

mohamed2004mowaffak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Bayesian Classification: Why?

◼ A statistical classifier: performs probabilistic prediction, i.e.,

predicts class membership probabilities
◼ Foundation: Based on Bayes’ Theorem.
◼ Performance: A simple Bayesian classifier, naïve Bayesian
classifier, has comparable performance with decision tree and
selected neural network classifiers
◼ Incremental: Each training example can incrementally
increase/decrease the probability that a hypothesis is correct —
prior knowledge can be combined with observed data
◼ Standard: Even when Bayesian methods are computationally
intractable, they can provide a standard of optimal decision
making against which other methods can be measured
1
Bayes’ Theorem: Basics
M
◼ Total probability Theorem: P(B) =  P(B | A )P( A )
i i
i =1

◼ Bayes’ Theorem: P(H | X) = P(X | H )P(H ) = P(X | H ) P(H ) / P(X)

P(X)
◼ Let X be a data sample (“evidence”): class label is unknown
◼ Let H be a hypothesis that X belongs to class C
◼ Classification is to determine P(H|X), (i.e., posteriori probability): the
probability that the hypothesis holds given the observed data sample X
◼ P(H) (prior probability): the initial probability
◼ E.g., X will buy computer, regardless of age, income, …

◼ P(X): probability that sample data is observed

◼ P(X|H) (likelihood): the probability of observing the sample X, given that
the hypothesis holds
◼ E.g., Given that X will buy computer, the prob. that X is 31..40,

medium income
2
Prediction Based on Bayes’ Theorem
◼ Given training data X, posteriori probability of a hypothesis H,
P(H|X), follows the Bayes’ theorem

P(H | X) = P(X | H )P(H ) = P(X | H ) P(H ) / P(X)

P(X)
◼ Informally, this can be viewed as
posteriori = likelihood x prior/evidence
◼ Predicts X belongs to Ci iff the probability P(Ci|X) is the highest
among all the P(Ck|X) for all the k classes
◼ Practical difficulty: It requires initial knowledge of many
probabilities, involving significant computational cost

3
Classification Is to Derive the Maximum Posteriori
◼ Let D be a training set of tuples and their associated class
labels, and each tuple is represented by an n-D attribute vector
X = (x1, x2, …, xn)
◼ Suppose there are m classes C1, C2, …, Cm.
◼ Classification is to derive the maximum posteriori, i.e., the
maximal P(Ci|X)
◼ This can be derived from Bayes’ theorem
P(X | C )P(C )
P(C | X) = i i
i P(X)
◼ Since P(X) is constant for all classes, only
P(C | X) = P(X | C )P(C )
i i i
needs to be maximized

4
Naïve Bayes Classifier
◼ A simplified assumption: attributes are conditionally
independent (i.e., no dependence relation between
attributes):
n
P( X | C i) =  P( x | C i) = P( x | C i)  P( x | C i)  ...  P( x | C i)
k 1 2 n
k =1
◼ This greatly reduces the computation cost: Only counts the
class distribution
◼ If Ak is categorical, P(xk|Ci) is the # of tuples in Ci having value xk
for Ak divided by |Ci, D| (# of tuples of Ci in D)
◼ If Ak is continous-valued, P(xk|Ci) is usually computed based on
Gaussian distribution with a mean μ and standard deviation σ
( x− )2
1 −
g ( x,  ,  ) = e 2 2
and P(xk|Ci) is 2 
P ( X | C i ) = g ( xk ,  C i ,  Ci )
5
Naïve Bayes Classifier: Training Dataset
Example:
age income student credit_rating buys_computer
<=30 high no fair no
<=30 high no excellent no
31…40 high no fair yes
>40 medium no fair yes
>40 low yes fair yes
>40 low yes excellent no
31…40 low yes excellent yes
<=30 medium no fair no
<=30 low yes fair yes
>40 medium yes fair yes
<=30 medium yes excellent yes
31…40 medium no excellent yes
31…40 high yes fair yes
Class: >40 medium no excellent no

C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’

Data to be classified:
X = (age <=30, Income = medium, Student = yes, Credit_rating = Fair)
6
Naïve Bayes Classifier: An Example
Class:
C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’

◼ Compute P(Ci) for each class:

◼ P(C1) = P(buys_computer = “yes”) = 9/14 = 0.643

◼ P(C2) = P(buys_computer = “no”) = 5/14= 0.357

7
Naïve Bayes Classifier: An Example
Class:
C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’

◼ Compute P(X|Ci) for each class

P(Xk|C1) = P(X1|C1) * P(X2|C1) * P(X3|C1)* ….*P(Xk|C1)

P(Xk|C2) = P(X1|C2) * P(X2|C2) * P(X3|C2)* ….*P(Xk|C2)

8
Naïve Bayes Classifier: Training Dataset
Class:
C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’
Data to be classified:
X = (age <=30, Income = medium, Student = yes, Credit_rating = Fair)
Age Buys Computer Count Total Conditional Probability Conditional Probability
<= 30 Yes 2 9 (2/9) 0.222222222
<= 30 No 3 5 (3/5) 0.6
31-40 Yes 4 9 (4/9) 0.444444444
31-40 No 0 5 (0/5) 0
> 40 Yes 3 9 (3/9) 0.333333333
> 40 No 2 5 (2/5) 0.4

P(Age <= 30| Buys Computer = Yes) 0.222222222

9
Naïve Bayes Classifier: Training Dataset
Class:
C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’
Data to be classified:
X = (age <=30, Income = medium, Student = yes, Credit_rating = Fair)
Income Buys Computer Count Total Conditional Probability Conditional Probability
High Yes 2 9 (2/9) 0.222222222
High No 2 5 (2/5) 0.4
Medium Yes 4 9 (4/9) 0.444444444
Medium No 2 5 (2/5) 0.4
Low Yes 3 9 (3/9) 0.333333333
Low No 1 5 (1/5) 0.2

P(Income = High| Buys Computer = Yes) 0.222222222

10
Naïve Bayes Classifier: Training Dataset
Class:
C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’
Data to be classified:
X = (age <=30, Income = medium, Student = yes, Credit_rating = Fair)

Student Buys Computer Count Total Conditional Probability Conditional Probability

Yes Yes 6 9 (6/9) 0.666666667
Yes No 1 5 (1/5) 0.2
No Yes 3 9 (3/9) 0.333333333
No No 4 5 (4/5) 0.8

P(Student = Yes| Buys Computer = Yes) 0.666666667

P(Student = Yes| Buys Computer = No) 0.2
P(Student = No| Buys Computer = Yes) 0.333333333
P(Student = No| Buys Computer = No) 0.8

11
Naïve Bayes Classifier: Training Dataset
Class:
C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’
Data to be classified:
X = (age <=30, Income = medium, Student = yes, Credit_rating = Fair)

Credit Rating Buys Computer Count Total Conditional Probability Conditional Probability
Fair Yes 6 9 (6/9) 0.666666667
Fair No 2 5 (2/5) 0.4
Excellent Yes 3 9 (3/9) 0.333333333
Excellent No 3 5 (3/5) 0.6

P(Credit Rating = Fair| Buys Computer = Yes) 0.666666667

P(Credit Rating = Fair| Buys Computer = No) 0.4
P(Credit Rating = Excellent| Buys Computer = Yes) 0.333333333
P(Credit Rating = Excellent| Buys Computer = No) 0.6

12
Naïve Bayes Classifier: An Example
Class:
C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’

◼ Compute P(X|Ci) for each class

P(X|C1) = P(X|buys_computer = “yes”)

= 0.222 x 0.444 x 0.667 x 0.667 = 0.044

P(X|C2) = P(X|buys_computer = “no”)

= 0.6 x 0.4 x 0.2 x 0.4 = 0.019

13
Naïve Bayes Classifier: An Example
Class:
C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’

◼ Compute P(X|Ci) * P(Ci) for each class

P(X|C1) * P(C1) = 0.044 * 0.643 = 0.028

P(X|C2) * P(C2) = 0.019 * 0.357 = 0.007

◼ Decision

P(X|C1) * P(C1) > P(X|C2) * P(C2)

X belongs to (C1)
Therefore, X belongs to class (“buys_computer = yes”)
14
Solved Example on Bayes Theorem
◼ Researchers investigated the effectiveness of using the
Hologic Sahara Sonometer, a portable device that
measures bone mineral density (BMD) in the ankle, in
predicting a fracture. They used a Hologic estimated
bone mineral density value of .57 as a cutoff. The
results of the investigation yielded the following data:

15
Solved Example on Bayes Theorem
a) Calculate the sensitivity of using a BMD value of 0.57
as a cutoff value for predicting fracture.
b) Calculate the specificity of using a BMD value of 0.57
as a cutoff value for predicting fracture.
c) If it is estimated that 10 percent of the U.S.
population have a confirmed bone fracture, What is
predictive value positive of using a BMD value of
0.57 as a cutoff value for predicting fracture? That is,
we wish to estimate the probability that a subject
who has BMD value equals 0.57 has a confirmed
bone fracture.

16
Solved Example on Bayes Theorem

a) Sensitivity = P (+T \ +D) = 214/287 = 0.7456 = 74.56%

b) Specificity = P (-T \ -D) = 330/1000 = 0.33 = 33%
c) Predictive Value Positive
P +T\+D ∗P(+D)
P(+D\+T) =
𝑃(+𝑇)

17
Solved Example on Bayes Theorem

c) Predictive Value Positive

P(+T) = P(+T\+D)P(+D) + P(-T\+D)P(-D) =
(214/287)(0.1) + (670/1000)(0.9) = 0.6776
P +T\+D ∗P(+D) 0.7456∗0.1
◼ P(+D\+T) = = = 0.11
𝑃(+𝑇) 0.6776

WPTU Instructions
No ratings yet
WPTU Instructions
196 pages
3 - Bayesian Classification
No ratings yet
3 - Bayesian Classification
15 pages
Shell Model Ebook v4
No ratings yet
Shell Model Ebook v4
9 pages
9-Decision Tree Induction-23-01-2025
No ratings yet
9-Decision Tree Induction-23-01-2025
40 pages
PERIODIC TEST in ICT-Grade 9 (Computer System Servicing)
No ratings yet
PERIODIC TEST in ICT-Grade 9 (Computer System Servicing)
3 pages
Erp Glossary PDF
No ratings yet
Erp Glossary PDF
2 pages
Key Point Mapping
No ratings yet
Key Point Mapping
69 pages
Dear Sir
100% (3)
Dear Sir
3 pages
Applied Linear Regression Models 4th Edi
No ratings yet
Applied Linear Regression Models 4th Edi
4 pages
ANSYS Mechanical APDL Coupled-Field Analysis Guide - PDF (PDFDrive)
No ratings yet
ANSYS Mechanical APDL Coupled-Field Analysis Guide - PDF (PDFDrive)
288 pages
Brain Storming
100% (1)
Brain Storming
11 pages
M32-Edit V 3.2 PDF
No ratings yet
M32-Edit V 3.2 PDF
2 pages
Stock Ledger
0% (1)
Stock Ledger
25 pages
50 Uses of Computers in My Area
100% (1)
50 Uses of Computers in My Area
4 pages
Book Trade Elder 2012 - 07 - 194A
No ratings yet
Book Trade Elder 2012 - 07 - 194A
5 pages
Sundry Creditors Address
No ratings yet
Sundry Creditors Address
367 pages
Katalog - Body Vzplanuti PDF
No ratings yet
Katalog - Body Vzplanuti PDF
44 pages
Mivec Fault
No ratings yet
Mivec Fault
1 page
CSE231 - Lecture 5
No ratings yet
CSE231 - Lecture 5
33 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
47 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
6 Classification
No ratings yet
6 Classification
53 pages
BTech - 5sem - CE - Booklet - 2022-23-ODD
No ratings yet
BTech - 5sem - CE - Booklet - 2022-23-ODD
37 pages
K - Nearest Neighbours Classifier / Regressor
No ratings yet
K - Nearest Neighbours Classifier / Regressor
35 pages
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
No ratings yet
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
66 pages
Website Development Agreement
No ratings yet
Website Development Agreement
9 pages
Naive Bayes
No ratings yet
Naive Bayes
37 pages
Lecture Slide 03 - Bayesian Classifier - Summer 2023
No ratings yet
Lecture Slide 03 - Bayesian Classifier - Summer 2023
23 pages
Linear Programming
No ratings yet
Linear Programming
36 pages
Data Mining - Bayesian Classification
No ratings yet
Data Mining - Bayesian Classification
6 pages
29-Naive Bayes-03-10-2024
No ratings yet
29-Naive Bayes-03-10-2024
48 pages
20210913115710D3708 - Session 09-12 Bayes Classifier
No ratings yet
20210913115710D3708 - Session 09-12 Bayes Classifier
30 pages
Lecture 5 Bayesian Classification
No ratings yet
Lecture 5 Bayesian Classification
16 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Naïve Bayesv1
No ratings yet
Naïve Bayesv1
31 pages
Lecture 8 - Naive Bayes
No ratings yet
Lecture 8 - Naive Bayes
27 pages
Pgm5 With Output
No ratings yet
Pgm5 With Output
13 pages
DWM - Classification-Unit7
No ratings yet
DWM - Classification-Unit7
44 pages
DM Lect 6 - Recommender Systems
No ratings yet
DM Lect 6 - Recommender Systems
46 pages
Lecture 5 Modes of Operation
No ratings yet
Lecture 5 Modes of Operation
30 pages
ML Module4 Classification
No ratings yet
ML Module4 Classification
79 pages
UNIT - IV
No ratings yet
UNIT - IV
169 pages
Bays Classifier (Machine Learning)
No ratings yet
Bays Classifier (Machine Learning)
16 pages
Lesson 3.3 - Supervised Learning Rule Based Classification
No ratings yet
Lesson 3.3 - Supervised Learning Rule Based Classification
43 pages
Networks Lecture 1
No ratings yet
Networks Lecture 1
28 pages
8 - Classification NaiveBayes PDF
No ratings yet
8 - Classification NaiveBayes PDF
13 pages
Bayesian Classification
No ratings yet
Bayesian Classification
25 pages
1-Introduction To Business Intelligence in A Business Environment
No ratings yet
1-Introduction To Business Intelligence in A Business Environment
40 pages
Bayesian Learning
No ratings yet
Bayesian Learning
58 pages
Infineon-Presentation 2kW ZVS Demoboard description-AP-v01 00-EN
No ratings yet
Infineon-Presentation 2kW ZVS Demoboard description-AP-v01 00-EN
16 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
Nayes Bayes Classifier
No ratings yet
Nayes Bayes Classifier
46 pages
5-Data Analytics in A Business Operations and BI Marketing Models
No ratings yet
5-Data Analytics in A Business Operations and BI Marketing Models
29 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Naive by
No ratings yet
Naive by
23 pages
4 22865 IS465 2019 1 2 1 08ClassBasic
No ratings yet
4 22865 IS465 2019 1 2 1 08ClassBasic
43 pages
Buildroot Image With Qt5 OPEGN GLS 2.0 Mesa VC4 Driver in 32 Bit
No ratings yet
Buildroot Image With Qt5 OPEGN GLS 2.0 Mesa VC4 Driver in 32 Bit
22 pages
Bayes Classification Methods
No ratings yet
Bayes Classification Methods
22 pages
Class Adv Classification IV
No ratings yet
Class Adv Classification IV
49 pages
Initial Recommendations
No ratings yet
Initial Recommendations
10 pages
Classification Clustering
No ratings yet
Classification Clustering
44 pages
Naive Bayes
No ratings yet
Naive Bayes
24 pages
Bayesian
No ratings yet
Bayesian
23 pages
Lec5-Regular Simplex Method and Dual Simplex Method
No ratings yet
Lec5-Regular Simplex Method and Dual Simplex Method
48 pages
Simple Bayesian Classifier: Assist - Prof. Songül Albayrak Yıldız Teknik Üniversitesi Bilgisayar Müh. Bölümü
No ratings yet
Simple Bayesian Classifier: Assist - Prof. Songül Albayrak Yıldız Teknik Üniversitesi Bilgisayar Müh. Bölümü
15 pages
ML 05 Bayesian Classifier
No ratings yet
ML 05 Bayesian Classifier
19 pages
Lecture 1 - Introduction To Data Security
No ratings yet
Lecture 1 - Introduction To Data Security
46 pages
23-Naive Bayes
No ratings yet
23-Naive Bayes
22 pages
Lecture12 Ch8 ClassBasic Part2
No ratings yet
Lecture12 Ch8 ClassBasic Part2
22 pages
Unit6 - 3 Classification-Bayesian
No ratings yet
Unit6 - 3 Classification-Bayesian
15 pages
3-Data Fundamentals For BI - Part2
No ratings yet
3-Data Fundamentals For BI - Part2
44 pages
Networks Lecture 2
No ratings yet
Networks Lecture 2
21 pages
IME672 - Lecture 44
No ratings yet
IME672 - Lecture 44
16 pages
DM Lect 9 - Classification - Decision Trees
No ratings yet
DM Lect 9 - Classification - Decision Trees
39 pages
Lecture-7 Classification Using Naive Bays
No ratings yet
Lecture-7 Classification Using Naive Bays
19 pages
Networks Lecture 5
No ratings yet
Networks Lecture 5
29 pages
CSC 325 AI Lecture08 Supervised Learning Fall2024 DR Raheel 20022025 034558pm
No ratings yet
CSC 325 AI Lecture08 Supervised Learning Fall2024 DR Raheel 20022025 034558pm
29 pages
Bayes Classification Method
No ratings yet
Bayes Classification Method
18 pages
Bayes Classification
No ratings yet
Bayes Classification
9 pages
2.3 Bayes Classification
No ratings yet
2.3 Bayes Classification
15 pages
A5 PDF
No ratings yet
A5 PDF
9 pages
Unit-4 DWDM
No ratings yet
Unit-4 DWDM
10 pages
AI Notes
No ratings yet
AI Notes
19 pages
Module 3 - Bayesian Classifier
No ratings yet
Module 3 - Bayesian Classifier
17 pages
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
No ratings yet
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
16 pages
TTDS Lecture 5
No ratings yet
TTDS Lecture 5
8 pages
Hardware and Software Selection and Acquisition and Computer Personnel
No ratings yet
Hardware and Software Selection and Acquisition and Computer Personnel
11 pages
Bayes Classification
No ratings yet
Bayes Classification
4 pages
Unit-Iv Data Classification: Data Warehousing and Data Mining
No ratings yet
Unit-Iv Data Classification: Data Warehousing and Data Mining
7 pages
DSE Inequalitiesdsdsddwdddwdwdw
No ratings yet
DSE Inequalitiesdsdsddwdddwdwdw
4 pages
The Truth About Your Height Exploring The Myths and Realities of Human Size and Its Effects On Performance Health Pollution and Survival by Thomas Samaras B00jxyr9oo
No ratings yet
The Truth About Your Height Exploring The Myths and Realities of Human Size and Its Effects On Performance Health Pollution and Survival by Thomas Samaras B00jxyr9oo
6 pages
DM Lec 6
No ratings yet
DM Lec 6
4 pages
Ai Theory Assignmnet (120 E)
No ratings yet
Ai Theory Assignmnet (120 E)
6 pages
07 Naive Bayes
No ratings yet
07 Naive Bayes
6 pages
AgNet - Novel Agentic Network Architecture - 2 Col
No ratings yet
AgNet - Novel Agentic Network Architecture - 2 Col
5 pages
Bayesian Classification - Problem
No ratings yet
Bayesian Classification - Problem
4 pages
P633 Cortec
No ratings yet
P633 Cortec
1 page
Statistical Inference INF312 - Is - Lecture 03 - Part 2
No ratings yet
Statistical Inference INF312 - Is - Lecture 03 - Part 2
2 pages
Types of Inspection Documents As Per en 10204 (2004
No ratings yet
Types of Inspection Documents As Per en 10204 (2004
2 pages
Calculus Super Review
From Everand
Calculus Super Review
Editors of REA
No ratings yet
Pre-Calculus Essentials
From Everand
Pre-Calculus Essentials
Ernest Woodward
No ratings yet
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
Generalized Fermat Equation
From Everand
Generalized Fermat Equation
Ran Van Vo
No ratings yet

Statistical Inference INF312 - Is - Lecture 03 - Part 3

Uploaded by

Statistical Inference INF312 - Is - Lecture 03 - Part 3

Uploaded by

Bayesian Classification: Why?

◼ A statistical classifier: performs probabilistic prediction, i.e.,

◼ Bayes’ Theorem: P(H | X) = P(X | H )P(H ) = P(X | H ) P(H ) / P(X)

◼ P(X): probability that sample data is observed

P(H | X) = P(X | H )P(H ) = P(X | H ) P(H ) / P(X)

C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’

◼ Compute P(Ci) for each class:

◼ P(C2) = P(buys_computer = “no”) = 5/14= 0.357

◼ Compute P(X|Ci) for each class

P(Xk|C1) = P(X1|C1) * P(X2|C1) * P(X3|C1)* ….*P(Xk|C1)

P(Xk|C2) = P(X1|C2) * P(X2|C2) * P(X3|C2)* ….*P(Xk|C2)

P(Age <= 30| Buys Computer = Yes) 0.222222222

P(Income = High| Buys Computer = Yes) 0.222222222

Student Buys Computer Count Total Conditional Probability Conditional Probability

P(Student = Yes| Buys Computer = Yes) 0.666666667

P(Credit Rating = Fair| Buys Computer = Yes) 0.666666667

◼ Compute P(X|Ci) for each class

P(X|C1) = P(X|buys_computer = “yes”)

= 0.222 x 0.444 x 0.667 x 0.667 = 0.044

P(X|C2) = P(X|buys_computer = “no”)

= 0.6 x 0.4 x 0.2 x 0.4 = 0.019

◼ Compute P(X|Ci) * P(Ci) for each class

P(X|C1) * P(C1) = 0.044 * 0.643 = 0.028

P(X|C2) * P(C2) = 0.019 * 0.357 = 0.007

P(X|C1) * P(C1) > P(X|C2) * P(C2)

a) Sensitivity = P (+T \ +D) = 214/287 = 0.7456 = 74.56%

c) Predictive Value Positive

You might also like