TTDS Lecture 5

Uploaded by

ABDULLAH ASIF BUBBAR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views8 pages

TTDS Lecture 5

Uploaded by

ABDULLAH ASIF BUBBAR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 8

TOOLS &

TECHNIQUES FOR
DATA SCIENCE
LECTURE 5
Bayesian Classification, Naïve Bayes Classifier
Bayesian Classification: Why?
 A statistical classifier: performs probabilistic prediction,
i.e., predicts class membership probabilities
 Foundation: Based on Bayes’ Theorem.
 Performance: A simple Bayesian classifier, naïve Bayesian
classifier, has comparable performance with decision tree
and selected neural network classifiers
 Incremental: Each training example can incrementally
increase/decrease the probability that a hypothesis is
correct — prior knowledge can be combined with observed
data
 Standard: Even when Bayesian methods are
computationally intractable, they can provide a standard
of optimal decision making against which other methods
can be measured
Bayes’ Theorem: Basics
M
 Total probability Theorem: P(B)   P(B | A )P( A )
i i
i 1

 Bayes’ Theorem: P( H | X) P(X | H ) P( H ) P(X | H )P( H ) / P(X)

P(X)
 Let X be a data sample (“evidence”): class label is unknown
 Let H be a hypothesis that X belongs to class C
 Classification is to determine P(H|X), (i.e., posteriori probability): the
probability that the hypothesis holds given the observed data
sample X
 P(H) (prior probability): the initial probability
 E.g., X will buy computer, regardless of age, income, …
 P(X): probability that sample data is observed
 P(X|H) (likelihood): the probability of observing the sample X, given
that the hypothesis holds
 E.g.,
Given that X will buy computer, the prob. that X is 31..40,
medium income
Prediction Based on Bayes’ Theorem
 Given training data X, posteriori probability of a
hypothesis H, P(H|X), follows the Bayes’ theorem

P(H | X) P(X | H ) P(H ) P(X | H )P(H ) / P(X)

P(X)
 Informally, this can be viewed as
posteriori = likelihood x prior/evidence
 Predicts X belongs to Ci iff the probability P(Ci|X) is the
highest among all the P(Ck|X) for all the k classes
 Practical difficulty: It requires initial knowledge of many
probabilities, involving significant computational cost
Classification Is to Derive the Maximum
Posteriori
 Let D be a training set of tuples and their associated
class labels, and each tuple is represented by an n-D
attribute vector X = (x1, x2, …, xn)
 Suppose there are m classes C1, C2, …, Cm.
 Classification is to derive the maximum posteriori, i.e.,
the maximal P(Ci|X)
 This can be derived from Bayes’ theorem
P(X | C )P(C )
P(C | X)  i i
i P(X)

 Since P(X) is constant for all classes, only

P(C | X) P(X | C )P(C )
i i i

needs to be maximized
Naïve Bayes Classifier

 A simplified assumption: attributes are conditionally

independent (i.e., no dependence
n
relation between
attributes): P( X | C )   P( x | C ) P( x | C ) P( x | C ) ... P( x | C )
i k i 1 i 2 i n i
k 1
 This greatly reduces the computation cost: Only counts
the class distribution
 If Ak is categorical, P(xk|Ci) is the # of tuples in Ci having
value xk for Ak divided by |Ci, D| (# of tuples of Ci in D)
 If Ak is continous-valued, P(xk|Ci) is usually computed
based on Gaussian distribution with a mean (μ and
x  ) 2

standard deviation σ 1 
g ( x,  ,  ) 
2
e 2
2 
and P(xk|Ci) is P ( X | C i )  g ( x k ,  Ci ,  C i )
Naïve Bayes Classifier: Training Dataset
age income studentcredit_rating
buys_computer
<=30 high no fair no
<=30 high no excellent no
31…40 high no fair yes
>40 medium no fair yes
>40 low yes fair yes
>40 low yes excellent no
31…40 low yes excellent yes
<=30 medium no fair no
<=30 low yes fair yes
>40 medium yes fair yes
<=30 medium yes excellent yes
31…40 medium no excellent yes
31…40 high yes fair yes
7
>40 medium no excellent no
Naïve Bayes Classifier: Comments
 Advantages
 Easy to implement
 Good results obtained in most of the cases
 Disadvantages
 Assumption: class conditional independence,
therefore loss of accuracy
 Practically, dependencies exist among variables
 E.g., hospitals: patients: Profile: age, family history, etc.

Symptoms: fever, cough etc., Disease: lung

cancer, diabetes, etc.
 Dependencies among these cannot be modeled by Naïve Bayes Classifier

Toaz - Info Cyberpunk 2020 Adventure All Fall Down Ag5040 PR - PDF
100% (1)
Toaz - Info Cyberpunk 2020 Adventure All Fall Down Ag5040 PR - PDF
34 pages
Bayes Classification
No ratings yet
Bayes Classification
9 pages
ML 05 Bayesian Classifier
No ratings yet
ML 05 Bayesian Classifier
19 pages
Module 3 - Bayesian Classifier
No ratings yet
Module 3 - Bayesian Classifier
17 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
2.3 Bayes Classification
No ratings yet
2.3 Bayes Classification
15 pages
Statistical Inference INF312 - Is - Lecture 03 - Part 3
No ratings yet
Statistical Inference INF312 - Is - Lecture 03 - Part 3
18 pages
Lecture Slide 03 - Bayesian Classifier - Summer 2023
No ratings yet
Lecture Slide 03 - Bayesian Classifier - Summer 2023
23 pages
A5 PDF
No ratings yet
A5 PDF
9 pages
3 - Bayesian Classification
No ratings yet
3 - Bayesian Classification
15 pages
Unit-4 DWDM
No ratings yet
Unit-4 DWDM
10 pages
Unit6 - 3 Classification-Bayesian
No ratings yet
Unit6 - 3 Classification-Bayesian
15 pages
Lecture 8 - Naive Bayes
No ratings yet
Lecture 8 - Naive Bayes
27 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
47 pages
29-Naive Bayes-03-10-2024
No ratings yet
29-Naive Bayes-03-10-2024
48 pages
AI Notes
No ratings yet
AI Notes
19 pages
Lecture12 Ch8 ClassBasic Part2
No ratings yet
Lecture12 Ch8 ClassBasic Part2
22 pages
Data Mining - Bayesian Classification
No ratings yet
Data Mining - Bayesian Classification
6 pages
Lecture 5 Bayesian Classification
No ratings yet
Lecture 5 Bayesian Classification
16 pages
Bayes Classification
No ratings yet
Bayes Classification
4 pages
Nayes Bayes Classifier
No ratings yet
Nayes Bayes Classifier
46 pages
ML Module4 Classification
No ratings yet
ML Module4 Classification
79 pages
ML Naive Bayes 1
No ratings yet
ML Naive Bayes 1
19 pages
Naive Bayes
No ratings yet
Naive Bayes
37 pages
Bayesian
No ratings yet
Bayesian
23 pages
Classification Clustering
No ratings yet
Classification Clustering
44 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
4 22865 IS465 2019 1 2 1 08ClassBasic
No ratings yet
4 22865 IS465 2019 1 2 1 08ClassBasic
43 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
21 pages
IME672 - Lecture 44
No ratings yet
IME672 - Lecture 44
16 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
9-Decision Tree Induction-23-01-2025
No ratings yet
9-Decision Tree Induction-23-01-2025
40 pages
Naive by
No ratings yet
Naive by
23 pages
Bayesian Classification
No ratings yet
Bayesian Classification
25 pages
BSC ML CH2
No ratings yet
BSC ML CH2
79 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
Lesson 3.3 - Supervised Learning Rule Based Classification
No ratings yet
Lesson 3.3 - Supervised Learning Rule Based Classification
43 pages
UNIT - IV
No ratings yet
UNIT - IV
169 pages
Class Adv Classification IV
No ratings yet
Class Adv Classification IV
49 pages
6 Classification
No ratings yet
6 Classification
53 pages
Unit-Iv Data Classification: Data Warehousing and Data Mining
No ratings yet
Unit-Iv Data Classification: Data Warehousing and Data Mining
7 pages
Naive Bayes
No ratings yet
Naive Bayes
24 pages
8 - Classification NaiveBayes PDF
No ratings yet
8 - Classification NaiveBayes PDF
13 pages
DWDM Unit 3 Part 2
No ratings yet
DWDM Unit 3 Part 2
8 pages
07 Naive Bayes
No ratings yet
07 Naive Bayes
6 pages
K - Nearest Neighbours Classifier / Regressor
No ratings yet
K - Nearest Neighbours Classifier / Regressor
35 pages
CSC 325 AI Lecture08 Supervised Learning Fall2024 DR Raheel 20022025 034558pm
No ratings yet
CSC 325 AI Lecture08 Supervised Learning Fall2024 DR Raheel 20022025 034558pm
29 pages
23-Naive Bayes
No ratings yet
23-Naive Bayes
22 pages
20210913115710D3708 - Session 09-12 Bayes Classifier
No ratings yet
20210913115710D3708 - Session 09-12 Bayes Classifier
30 pages
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
No ratings yet
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
66 pages
Module - 3 - Last Part
No ratings yet
Module - 3 - Last Part
16 pages
CSC 325 AI Lecture08 Supervised Learning
No ratings yet
CSC 325 AI Lecture08 Supervised Learning
32 pages
Naïve Bayesv1
No ratings yet
Naïve Bayesv1
31 pages
10 Classification New 1
No ratings yet
10 Classification New 1
31 pages
Ba Yes Naive
No ratings yet
Ba Yes Naive
15 pages
Classification Naive Bayes
No ratings yet
Classification Naive Bayes
17 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Bayes Classification Method
No ratings yet
Bayes Classification Method
18 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
Calculus Super Review
From Everand
Calculus Super Review
Editors of REA
No ratings yet
Indiaray - Brochure
No ratings yet
Indiaray - Brochure
23 pages
Inverter - EP-3K-48-AU - User Manual - 091119
No ratings yet
Inverter - EP-3K-48-AU - User Manual - 091119
43 pages
5G Bootcamp Syllabus 3.0 - APPROVED 10 - 12 - 22-1
No ratings yet
5G Bootcamp Syllabus 3.0 - APPROVED 10 - 12 - 22-1
9 pages
Complaint Copy Uppcl
No ratings yet
Complaint Copy Uppcl
2 pages
Das 350
No ratings yet
Das 350
6 pages
2022 JamesCook Katalog EN Homepage
No ratings yet
2022 JamesCook Katalog EN Homepage
36 pages
7 ICT Powerpoint W1
No ratings yet
7 ICT Powerpoint W1
3 pages
Nara Cognitive Technologies Whitepaper
No ratings yet
Nara Cognitive Technologies Whitepaper
29 pages
SI PAKET 1 & FMS CEK PLAN WORK Ini Sample
No ratings yet
SI PAKET 1 & FMS CEK PLAN WORK Ini Sample
30 pages
3rd Quarter Summative Test in Animation For Week 1 - 2 Grade 7-8
No ratings yet
3rd Quarter Summative Test in Animation For Week 1 - 2 Grade 7-8
2 pages
Amith Vayu Niyama
100% (1)
Amith Vayu Niyama
34 pages
01 TASS Training Manual For Tax Payer - Copy - PPTM
No ratings yet
01 TASS Training Manual For Tax Payer - Copy - PPTM
109 pages
PVC-Insulated Cables: 450/750V Single-Core PVC Insulated, Non-Sheathed Cable
No ratings yet
PVC-Insulated Cables: 450/750V Single-Core PVC Insulated, Non-Sheathed Cable
1 page
28-11-2024 Daily Progress Report Night Shift
No ratings yet
28-11-2024 Daily Progress Report Night Shift
1 page
Vigiflow: Introduction and Basic Features
No ratings yet
Vigiflow: Introduction and Basic Features
26 pages
Gemini For Google Cloud Documentation
No ratings yet
Gemini For Google Cloud Documentation
2 pages
A Bluetooth ESP32 TFT Touch Macro Keypad
100% (2)
A Bluetooth ESP32 TFT Touch Macro Keypad
28 pages
National Concrete Products Co. LTD 200-20-0049 CIFA K48
No ratings yet
National Concrete Products Co. LTD 200-20-0049 CIFA K48
4 pages
GeM Bidding 5879144
No ratings yet
GeM Bidding 5879144
5 pages
C16 Dcme
No ratings yet
C16 Dcme
311 pages
Renolit Poliplex Series - en
No ratings yet
Renolit Poliplex Series - en
2 pages
RTI GHY April 22
No ratings yet
RTI GHY April 22
42 pages
جهاز Ultrasound Dus 60 - كتيب المستخدم
No ratings yet
جهاز Ultrasound Dus 60 - كتيب المستخدم
114 pages
Hmi WS23-24
No ratings yet
Hmi WS23-24
5 pages
GC 2025 01 26
No ratings yet
GC 2025 01 26
2 pages
Code 188 - Punto Classic
No ratings yet
Code 188 - Punto Classic
5 pages
Jadual
No ratings yet
Jadual
4 pages
G17 Gen 5 Instructions
No ratings yet
G17 Gen 5 Instructions
9 pages
2nd Summative Test
No ratings yet
2nd Summative Test
8 pages

TTDS Lecture 5

Uploaded by

TTDS Lecture 5

Uploaded by

TOOLS &

 Bayes’ Theorem: P( H | X) P(X | H ) P( H ) P(X | H )P( H ) / P(X)

P(H | X) P(X | H ) P(H ) P(X | H )P(H ) / P(X)

 Since P(X) is constant for all classes, only

 A simplified assumption: attributes are conditionally

Symptoms: fever, cough etc., Disease: lung

You might also like