0% found this document useful (0 votes)

6 views15 pages

8.introduction To Artificial Intelligence 2

The document provides an introduction to the Naive Bayes algorithm, explaining Bayes' theorem and its application in medical diagnosis and classification problems. It details how to calculate conditional probabilities and the workings of the Naive Bayes classifier, including exercises on discrete data and handling zero probabilities. The document emphasizes the effectiveness of the Naive Bayes classifier in various machine learning tasks, particularly in natural language classification.

Uploaded by

rmj92623

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views15 pages

8.introduction To Artificial Intelligence 2

Uploaded by

rmj92623

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Introduction to Artificial Intelligence

Machine Learning- Naive Bayes Algorithm

Bayes' Theorem
Example: Medical Diagnosis
Conditional Probabilities Comparison
Naive Bayes Classifier
Naive Bayes Classifier Exercises
Bayes Theorem

• Bayes’ theorem is used to calculate the probability that a certain

event will occur or that a certain proposition is true, and here we
already know the part of information.

• The P(B) is called the prior probability of B.

• P(B ∣ A) is called the posterior probability of B.
Exercise 1: Medical Diagnosis

• Example: Suppose, you have a high temperature. What is the

likelihood that you have a cold?
• A high temperature is one of the symptoms of a cold by 80%. Also, 1
in 10,000 people gets a cold, and 1 in every 1000 people has a high
temperature. We can use A to represent “high temperature” and B to
represent “Cold.”
1. likelihood : P(A ∣ B) = 0.8
2. Prior probability: P(B) = 0.0001
3. Marginal probability: P(A) = 0.001
P(A∣B)∙P(B) 0.8×0.0001 The probability that a patient has a cold by
4. P(B ∣ A)= = =0.08 knowing the high temperature.
P(A) 0.001
Conditional Probabilities Comparison

• We can compare the probabilities of all hypotheses.

• Example: making a diagnosis from a set of evidence, one will often have to
choose from several possible hypotheses.
• Let us extend the previous medical example, by using hypothesis C to
represent the “plague”. Note: the ratio of infection with the plague is
0.000000001, and the percentage of high temperature among those
infected with the plague is 0.99.
1. P(A) = 0.001 “A: high temperature”
2. P(B) = 0.0001 “B: cold”
3. P(B ∣ A) = 0.8 “The probability that a patient has a cold by knowing the high temperature”.

The probability of having plague in case of high temperature:

P(A∣ C)∙P(C) 0.99×0.000000001
P(C ∣ A)= = =0.00000099
P(A) 0.001
Conditional Probabilities Comparison

To find the more likely of B and C, given A, we can eliminate P(A) from
these equations and can determine the relative likelihood of B and C as
follows:

P(B∣A) P(A∣B) ∙ P(B) 0.8 ×0.0001

• = = = 80,808.08
P(C∣A) P(A∣C) ∙P(C) 0.99 ×0.000000001

• The probability of catching a cold, given the patient's elevated high

temperature, is hundreds of thousands of times higher than the
probability of getting a plague.
NaÏve Bayes Classifier

• The naïve Bayes classifier is a Simple machine learning system but

effective for many problems, especially those related to natural language
classification.
• It works to classify data based on Bayes' theorem but by assuming that a
set of attributes or evidence are independent, that is why they call it the
name of Naïve.
• A data set consists of several data points, each data point contains a set of
attributes, can take some possible values (categorical or numeric), and
contains a specific classification.
• To identify the best classification for a particular model of data (d1, ...,dn ),
the posterior probability of each classification is calculated:
P(ci| d1, . . ., dn )
• Here ci is one of the classifications in the set of possible hypotheses or
classifications (for example, the set of classifications is {Pass, Fail,
apologetic} and ci represents the classification for Pass.
NaÏve Bayes Classifier

• The hypothesis that has the highest posterior probability is often

known as the maximum a posteriori as follows:
P d1, . . dn ci P ci
• P ci d1, . . dn =
P(d1,..dn )

• Which can be simplified and exclude probability evidence to:

P ci d1, . . dn = P d1, . . dn ci P(ci )
• The naïve Bayes classifier now assumes that each of the attributes in
the data item is independent of the others, in which case can be
rewritten and the following value obtained:
P ci d1, . . dn = P d1 ci . . P(dn |ci )P(ci )
Exercise 1: The Naïve Bayes Classifier Discrete Data

• For example, let’s suppose that each data

item consists of the attributes x, y, z. Where
x, y, z are integers numbers in the range 1 to Classification X Y Z
4. The available classifications are A, B, C. A 2 3 2
• In the table, we have 7 of the training data B 2 3 4
classified as (3 as A, 2 as B, 2 as C). C 1 3 4
• To classify new data as (x = 2, y = 3, z = 4). A 2 4 3
The posterior probability must be calculated B 4 3 1
for each classification of (A, B, C) based on
C 2 1 3
the given training data, then choose the
classification with the highest probability. A 1 2 4
Exercise 1: The Naïve Bayes Classifier Discrete Data

• Calculate the posterior probability for A based on

the attribute of the new training data:
𝑷 𝑨 𝒙 = 𝟐, 𝒚 = 𝟑, 𝒛 = 𝟒 = 𝑷 𝑨 𝑷 𝒙 = 𝟐 𝑨 𝑷 𝒚 = 𝟑 𝑨 𝑷(𝐳 = 𝟒|𝐀) Classification X Y Z
= 𝟑𝟕 × 𝟐𝟑 × 𝟏𝟑 × 𝟏𝟑 = 𝟎. 𝟒𝟑 × 𝟎. 𝟔 × 𝟎 . 𝟑 × 𝟎. 𝟑 = 𝟎. 𝟎𝟐 A 2 3 2
• Calculate the posterior probability for B based on
the attribute of the new training data: B 2 3 4
𝑷 𝑩 𝒙 = 𝟐, 𝒚 = 𝟑, 𝒛 = 𝟒 = 𝑷 𝑩 𝑷 𝒙 = 𝟐 𝑩 𝑷 𝒚 = 𝟑 𝑩 𝑷(𝒛 = 𝟒|𝑩) C 1 3 4
𝟐 𝟏 𝟐 𝟏
= × × × = 𝟎. 𝟐𝟗 × 𝟎. 𝟓 × 𝟏 × 𝟎. 𝟓 = 𝟎. 𝟎𝟕
𝟕 𝟐 𝟐 𝟐
A 2 4 3
• Calculate the posterior probability for C based on
the attribute of the new training data: B 4 3 1
𝑷 𝑪 𝒙 = 𝟐, 𝒚 = 𝟑, 𝒛 = 𝟒 = 𝑷 𝑪 𝑷 𝒙 = 𝟐 𝑪 𝑷 𝒚 = 𝟑 𝑪 𝑷(𝒛 = 𝟒|𝑪)
= 𝟐𝟕 × 𝟏𝟐 × 𝟏𝟐 × 𝟏𝟐 = 𝟎. 𝟐𝟗 × 𝟎. 𝟓 × 𝟎. 𝟓 × 𝟎. 𝟓 = 𝟎. 𝟎𝟑 C 2 1 3
By comparing the probabilities of all A 1 2 4
classifications, we find that the correct
classification is B.
Exercise 2: The Naïve Bayes Classifier Probability of Zero

• Now suppose we are going to classify the

following sample of new training data: Classification X Y Z
(x = 1, y = 2, z = 2) A 2 3 2
𝑷 𝑨 𝒙 = 𝟏, 𝒚 = 𝟐, 𝒛 = 𝟏 = 𝑷 𝑨 𝑷 𝒙 = 𝟏 𝑨 𝑷 𝒚 = 𝟐 𝑨 𝑷(𝐳 = 𝟐|𝐀) B 2 3 4
= 𝟑𝟕 × 𝟏𝟑 × 𝟏𝟑 × 𝟏𝟑 = 𝟎. 𝟒𝟑 × 𝟎. 𝟑 × 𝟎 . 𝟑 × 𝟎. 𝟑 = 𝟎. 𝟎𝟏 C 1 3 4
𝑷 𝑩 𝒙 = 𝟏, 𝒚 = 𝟐, 𝒛 = 𝟏 = 𝑷 𝑩 𝑷 𝒙 = 𝟏 𝑩 𝑷 𝒚 = 𝟐 𝑩 𝑷(𝒛 = 𝟐|𝑩) A 2 4 3
= 𝟐𝟕 × 𝟎𝟐 × 𝟎𝟐 × 𝟎𝟐 = 𝟎 B 4 3 1
𝑷 𝑪 𝒙 = 𝟏, 𝒚 = 𝟐, 𝒛 = 𝟏 = 𝑷 𝑪 𝑷 𝒙 = 𝟏 𝑪 𝑷 𝒚 = 𝟐 𝑪 𝑷(𝒛 = 𝟐|𝑪) C 2 1 3
= 𝟐𝟕 × 𝟏𝟐 × 𝟎𝟐 × 𝟎𝟐 = 𝟎 A 1 2 4

Zero Probability Problem

Exercise 2: The Naïve Bayes Classifier Probability of Zero

• The zero-probability problem occurs when there

is no data that has the attributes given and
categorized with a specific classification. Classification X Y Z
• For example, here, no data in the training set
have attributes, such as x=1 y= 2, a classification A 2 3 2
of B. This problem can be avoided by using the m- B 2 3 4
estimate, as follows:
𝑎 + 𝑚𝑝 C 1 3 4
𝑏+𝑚 A 2 4 3
• a= the number of training examples that exactly match B 4 3 1
our requirements.
• b = the number of training examples that were C 2 1 3
classified in the current classification.
1
• p=an estimate of the probability that we are trying to obtain
A 1 2 4
• m= is a constant value, known as the equivalent sample size. It is
equal to 1% of the size of the training data.
Exercise 2: The Naïve Bayes Classifier Probability of Zero

• Calculate probabilities using m-estimation:

• To calculate the value of P, all the variables x, Classification X Y Z
y, z take 4 values 1,2,3,4. A 2 3 2
Therefor, 𝑝 = 14=0.25 . m=1 its 1% of the B 2 3 4
examples number, which is less than one. C 1 3 4
• 𝑷 𝑩 𝒙 = 𝟏, 𝒚 = 𝟐, 𝒛 = 𝟐 = 𝑷 𝑩 ∗ 𝑷 𝒙 = 𝟏 𝑩 ∗ 𝑷 𝒚 = 𝟐 𝑩 ∗ 𝑷 𝒛 = 𝟐 𝑩 A 2 4 3
2+0.25 0+0.25 0+0.25 0+0.25 B 4 3 1
= × × ×
7+1 2+1 2+1 2+1
C 2 1 3
2.25 0.25 0.25 0.25
=
8
×
3
×
3
×
3
A 1 2 4

= 0.28× 0.08 × 0.08 × 0.08 = 0.00014

Thank You

ML Unit No.4 Naïve Bayes Classifiers PPT Notes
No ratings yet
ML Unit No.4 Naïve Bayes Classifiers PPT Notes
47 pages
Bayesian Classifier and ML Estimation: 6.1 Conditional Probability
100% (3)
Bayesian Classifier and ML Estimation: 6.1 Conditional Probability
11 pages
3 - Bayesian Classification
No ratings yet
3 - Bayesian Classification
15 pages
Text Mining - Classification
No ratings yet
Text Mining - Classification
28 pages
Chapter 4
No ratings yet
Chapter 4
57 pages
Module05 - Bayesian Reasoning
No ratings yet
Module05 - Bayesian Reasoning
37 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
21 pages
UNIT4 - Part2 Aiml
No ratings yet
UNIT4 - Part2 Aiml
46 pages
Class Adv Classification IV
No ratings yet
Class Adv Classification IV
49 pages
ML BayesionBeliefNetwork Lect12 14
No ratings yet
ML BayesionBeliefNetwork Lect12 14
99 pages
Classification With NaiveBayes
No ratings yet
Classification With NaiveBayes
19 pages
IML Module 3
No ratings yet
IML Module 3
95 pages
ML 09 Naive Bayes Classifier
No ratings yet
ML 09 Naive Bayes Classifier
24 pages
Naïve Bayesv1
No ratings yet
Naïve Bayesv1
31 pages
Naive by
No ratings yet
Naive by
23 pages
2 Naive Bayes
No ratings yet
2 Naive Bayes
49 pages
BSC ML CH2
No ratings yet
BSC ML CH2
79 pages
Ba Yes Naive
No ratings yet
Ba Yes Naive
15 pages
Lecture 7
No ratings yet
Lecture 7
15 pages
Nayes Bayes Classifier
No ratings yet
Nayes Bayes Classifier
46 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
16 - Naïve Bayes Classifier
No ratings yet
16 - Naïve Bayes Classifier
21 pages
Unit-4 Naïve Bayes & Support Vector Machine
No ratings yet
Unit-4 Naïve Bayes & Support Vector Machine
79 pages
29-Naive Bayes-03-10-2024
No ratings yet
29-Naive Bayes-03-10-2024
48 pages
L4 Naive Bayes
No ratings yet
L4 Naive Bayes
31 pages
Naive Bayes
No ratings yet
Naive Bayes
9 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
51 pages
Lecture - 4.1 - Bayes Classifier
No ratings yet
Lecture - 4.1 - Bayes Classifier
31 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
51 pages
DM NaiveBayes
No ratings yet
DM NaiveBayes
15 pages
Machine Learning: Naïve Bayes Classifier
No ratings yet
Machine Learning: Naïve Bayes Classifier
11 pages
Bayes Classifier
No ratings yet
Bayes Classifier
20 pages
07 Naive Bayes
No ratings yet
07 Naive Bayes
6 pages
Bayesian Classifier Notes
No ratings yet
Bayesian Classifier Notes
9 pages
Naive Bayes
No ratings yet
Naive Bayes
26 pages
Statistical Inference INF312 - Is - Lecture 03 - Part 3
No ratings yet
Statistical Inference INF312 - Is - Lecture 03 - Part 3
18 pages
NOTES
No ratings yet
NOTES
15 pages
Naive Bayes
No ratings yet
Naive Bayes
29 pages
Lecture10 - Bayesian Classifier
No ratings yet
Lecture10 - Bayesian Classifier
40 pages
Lecture 5 Bayesian
No ratings yet
Lecture 5 Bayesian
37 pages
Lec 03 NaiveBayesClassification
No ratings yet
Lec 03 NaiveBayesClassification
33 pages
UNIT 2 AAM Notes
No ratings yet
UNIT 2 AAM Notes
38 pages
Lecture Slide 03 - Bayesian Classifier - Summer 2023
No ratings yet
Lecture Slide 03 - Bayesian Classifier - Summer 2023
23 pages
ML 05 Bayesian Classifier
No ratings yet
ML 05 Bayesian Classifier
19 pages
Bayesian Classification
No ratings yet
Bayesian Classification
25 pages
20210913115710D3708 - Session 09-12 Bayes Classifier
No ratings yet
20210913115710D3708 - Session 09-12 Bayes Classifier
30 pages
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
No ratings yet
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
66 pages
Unit II Probabilistic Reasoning
No ratings yet
Unit II Probabilistic Reasoning
28 pages
Bayes Rule PR-2
No ratings yet
Bayes Rule PR-2
5 pages
Pgm5 With Output
No ratings yet
Pgm5 With Output
13 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
47 pages
Reliability Part 1 - Partial Factors
No ratings yet
Reliability Part 1 - Partial Factors
38 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Naive Bayesian Classifier: National Institute of Technology Sikkim
No ratings yet
Naive Bayesian Classifier: National Institute of Technology Sikkim
6 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Naïve Bayes Classifier: April 25, 2006
No ratings yet
Naïve Bayes Classifier: April 25, 2006
19 pages
Unit-4 DWDM
No ratings yet
Unit-4 DWDM
10 pages
Data Mining - Bayesian Classification
No ratings yet
Data Mining - Bayesian Classification
6 pages
Research On Rescue 1122 Service in Pakistan
No ratings yet
Research On Rescue 1122 Service in Pakistan
68 pages
3 - Continuous Random Variables
No ratings yet
3 - Continuous Random Variables
84 pages
Coursenotes Aug2012
No ratings yet
Coursenotes Aug2012
179 pages
MAS 102 - Topic 1
No ratings yet
MAS 102 - Topic 1
13 pages
CH - 2 - DEFINITIONS and-AXIOMS
100% (1)
CH - 2 - DEFINITIONS and-AXIOMS
52 pages
Guía 2
No ratings yet
Guía 2
6 pages
Stochastic Hydrology
No ratings yet
Stochastic Hydrology
187 pages
There Are Welcome To Six Oh Four One Six Four Thirty One The Class and
No ratings yet
There Are Welcome To Six Oh Four One Six Four Thirty One The Class and
30 pages
JCSS Probabilistic Model Code, Section 3.7: SOIL PROPERTIES
No ratings yet
JCSS Probabilistic Model Code, Section 3.7: SOIL PROPERTIES
27 pages
Math10 Q3 SLM Module 6
No ratings yet
Math10 Q3 SLM Module 6
11 pages
Unit-II-Probability-binomia Distribution-Poisson Distribution-Normal distribution-NOTES
No ratings yet
Unit-II-Probability-binomia Distribution-Poisson Distribution-Normal distribution-NOTES
50 pages
Correlation T Test ANOVA
No ratings yet
Correlation T Test ANOVA
62 pages
Fybim - Revised Syllabus
No ratings yet
Fybim - Revised Syllabus
50 pages
Modeling Risk and Realities Week 4 Session 3
No ratings yet
Modeling Risk and Realities Week 4 Session 3
25 pages
Statistic Matlab Example
No ratings yet
Statistic Matlab Example
7 pages
Assignment 6: IC252 - IIT Mandi
No ratings yet
Assignment 6: IC252 - IIT Mandi
2 pages
Developing Neural Network Applications Using Labview
No ratings yet
Developing Neural Network Applications Using Labview
105 pages
Monte Carlo 1723719361
No ratings yet
Monte Carlo 1723719361
7 pages
Douka, M., 2018, Statistical Analyses of Extreme Rainfall Events in Thessaloniki, Greece
No ratings yet
Douka, M., 2018, Statistical Analyses of Extreme Rainfall Events in Thessaloniki, Greece
18 pages
CP&M - Lec 11-Pert
No ratings yet
CP&M - Lec 11-Pert
22 pages
Midterm1 Spring2023 Answerkeys
No ratings yet
Midterm1 Spring2023 Answerkeys
12 pages
Atp Examples
No ratings yet
Atp Examples
42 pages
Week5 Statistics
No ratings yet
Week5 Statistics
9 pages
Monte Carlo Methods and Bayesian Computation: Importance Sampling
No ratings yet
Monte Carlo Methods and Bayesian Computation: Importance Sampling
5 pages
STA301 Assignment No.2 Solution
100% (1)
STA301 Assignment No.2 Solution
2 pages
Nur Razimah (940907015834)
No ratings yet
Nur Razimah (940907015834)
8 pages
Documentation For GPML Matlab Code
No ratings yet
Documentation For GPML Matlab Code
10 pages
Prob
No ratings yet
Prob
3 pages
Math F242 1466
No ratings yet
Math F242 1466
2 pages
Pre-Calculus Essentials
From Everand
Pre-Calculus Essentials
Ernest Woodward
No ratings yet
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet

8.introduction To Artificial Intelligence 2

Uploaded by

8.introduction To Artificial Intelligence 2

Uploaded by

Introduction to Artificial Intelligence​

Machine Learning- Naive Bayes Algorithm

• Bayes’ theorem is used to calculate the probability that a certain

• The P(B) is called the prior probability of B.

• Example: Suppose, you have a high temperature. What is the

• We can compare the probabilities of all hypotheses.

The probability of having plague in case of high temperature:

P(B∣A) P(A∣B) ∙ P(B) 0.8 ×0.0001

• The probability of catching a cold, given the patient's elevated high

• The naïve Bayes classifier is a Simple machine learning system but

• The hypothesis that has the highest posterior probability is often

• Which can be simplified and exclude probability evidence to:

• For example, let’s suppose that each data

• Calculate the posterior probability for A based on

• Now suppose we are going to classify the

Zero Probability Problem

• The zero-probability problem occurs when there

• Calculate probabilities using m-estimation:

= 0.28× 0.08 × 0.08 × 0.08 = 0.00014

You might also like

Introduction to Artificial Intelligence