0% found this document useful (0 votes)

27 views12 pages

Exp 3 Bi

1. The document describes implementing and evaluating a Naive Bayes classification algorithm using Python. It discusses the theory behind Naive Bayes, including Bayes' theorem. 2. The Python implementation includes data preprocessing, fitting a Gaussian Naive Bayes classifier to the training data, predicting the test results, evaluating accuracy using a confusion matrix, and visualizing the test results. 3. The results show the Naive Bayes classifier achieving an accuracy score of 90% on the test data, with 10 incorrect predictions out of 100 total predictions. A decision boundary plot visualizes the classifier's predictions.

Uploaded by

Smaranika Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views12 pages

Exp 3 Bi

Uploaded by

Smaranika Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 12

EXPERIMENT NO.

03
Aim: Implement and evaluate using Python
a) Classification Algorithm – Naïve Bayes

Date of Performance: Date of Submission:

THEORY
Naive Bayes Classifier Algorithm
Naive Bayes algorithm is a supervised learning algorithm, which is based on Bayes theorem
and used for solving classification problems.It is mainly used in text classification that includes
a high-dimensional training dataset.Naïve Bayes Classifier is one of the simple and most
effective Classification algorithms which helps in building the fast machine learning models
that can make quick predictions.It is a probabilistic classifier, which means it predicts on the
basis of the probability of an object.Some popular examples of Naïve Bayes Algorithm are
spam filtration, Sentimental analysis, and classifying articles.

Why is it called Naïve Bayes?

The Naïve Bayes algorithm is comprised of two words Naïve and Bayes, Which can be
described as:
Naïve: It is called Naïve because it assumes that the occurrence of a certain feature is
independent of the occurrence of other features. Such as if the fruit is identified on the bases
of color, shape, and taste, then red, spherical, and sweet fruit is recognized as an apple.
Hence each feature individually contributes to identify that it is an apple without depending
on each other.
Bayes: It is called Bayes because it depends on the principle of Bayes' Theorem.

Bayes' Theorem
Bayes' theorem is also known as Bayes' Rule or Bayes' law, which is used to determine the
probability of a hypothesis with prior knowledge. It depends on the conditional probability.
The formula for Bayes' theorem is given as:

Where,
P(A|B) is Posterior probability: Probability of hypothesis A on the observed event B. P(B|
A) is Likelihood probability: Probability of the evidence given that the probability of a
hypothesis is true.

Alam Umar|Roll no.16|A1|BI Lab|TE-IT Pg.1

P(A) is Prior Probability: Probability of hypothesis before observing the evidence.
P(B) is Marginal Probability: Probability of Evidence.

Python Implementation of the Naïve Bayes algorithm

Now we will implement a Naive Bayes Algorithm using Python. So for this, we will use the
"user_data" dataset, which we have used in our other classification model. Therefore we can
easily compare the Naive Bayes model with the other models.
Steps to implement:
o Data Pre-processing step
o Fitting Naive Bayes to the Training set
o Predicting the test result
o Test accuracy of the result(Creation of Confusion matrix)
o Visualizing the test set result.

Data Pre-processing step

In this step, we will pre-process/prepare the data so that we can use it efficiently in our
code. It is similar as we did in data-pre-processing. The code for this is given below:
Data Preprocessing
import pandas as pd
from sklearn.model_selection import
train_test_split from sklearn.preprocessing import
LabelEncoder

# Load the
dataset try:
user_data = pd.read_csv("userdata.csv") # Change the file path accordingly
except FileNotFoundError:
print("Error: File not found.")
exit()
# Check if the 'target' column
exists if 'target' not in
user_data.columns:
print("Error: 'target' column not found in the dataset.")
exit()
# Split dataset into features and labels
X = user_data.drop(columns=['target']) #
Features y = user_data['target'] # Labels
# Encode categorical labels
label_encoder =
LabelEncoder()
y = label_encoder.fit_transform(y)
# Split the dataset into training and test sets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
Alam Umar|Roll no.16|A1|BI Lab|TE-IT Pg.2
In the above code, we have loaded the dataset into our program using "dataset =
pd.read_csv('user_data.csv'). The loaded dataset is divided into training and test set, and
then we have scaled the feature variable.

The output for the dataset is given as:

Fitting Naive Bayes to the Training Set:

After the pre-processing step, now we will fit the Naive Bayes model to the Training set.
Below is the code for it:
Fitting Naive Bayes to the Training set from
sklearn.naive_bayes import GaussianNB
# Create a Naive Bayes
classifier classifier =
GaussianNB()

# Train the classifier

classifier.fit(X_train,
y_train)
In the above code, we have used the GaussianNB classifier to fit it to the training dataset.
We can also use other classifiers as per our requirement

Alam Umar|Roll no.16|A1|BI Lab|TE-IT Pg.3

Output:

Prediction of the test set result:

Now we will predict the test set result. For this, we will create a new predictor variable y_pred,
and will use the predict function to make the predictions.
Predicting the test result y_pred =
classifier.predict(X_test)
Creating Confusion Matrix:
Now we will check the accuracy of the Naive Bayes classifier using the Confusion
matrix. Below is the code for it:
Test accuracy of the result (Creation of Confusion matrix) from
sklearn.metrics import confusion_matrix, accuracy_score
# Calculate confusion matrix
cm = confusion_matrix(y_test,
y_pred) # Calculate accuracy score
accuracy = accuracy_score(y_test,
y_pred) # Print confusion matrix and
accuracy print("Confusion Matrix:")
print(cm)
print("\nAccuracy:", accuracy)

Output:

Alam Umar|Roll no.16|A1|BI Lab|TE-IT Pg.4

As we can see in the above confusion matrix output, there are 7+3= 10 incorrect predictions,
and 65+25=90 correct predictions.

Visualizing the training set result

Next we will visualize the training set result using Naïve Bayes Classifier. Below is the
code for it:
Visualizing the test set result
import matplotlib.pyplot as plt
import numpy as np
# Define function to plot decision regions
def plot_decision_regions(X, y, classifier,
resolution=0.02): markers = ('s', 'x', 'o', '^', 'v')
colors = ('red', 'blue', 'lightgreen', 'gray',
'cyan') cmap = plt.get_cmap('Pastel2')

x1_min, x1_max = X[:, 0].min() - 1, X[:, 0].max() + 1

x2_min, x2_max = X[:, 1].min() - 1, X[:, 1].max() + 1
xx1, xx2 = np.meshgrid(np.arange(x1_min, x1_max, resolution),
np.arange(x2_min, x2_max, resolution))
Z = classifier.predict(np.array([xx1.ravel(), xx2.ravel()]).T)
Z = Z.reshape(xx1.shape)
plt.contourf(xx1, xx2, Z, alpha=0.4, cmap=cmap)
plt.xlim(xx1.min(), xx1.max())
plt.ylim(xx2.min(), xx2.max())
for idx, cl in enumerate(np.unique(y)):
plt.scatter(x=X[y == cl, 0], y=X[y == cl, 1],
alpha=0.8, c=[colors[idx]],
marker=markers[idx], label=cl)
# Plot decision regions (assuming only two
features) if X_test.shape[1] == 2:
plt.figure(figsize=(10, 6))
plot_decision_regions(X_test.values, y_test, classifier=classifier)
plt.title('Naive Bayes - Test set')
plt.xlabel('Feature 1')
plt.ylabel('Feature 2')
plt.legend(loc='upper right')
plt.show()
else:
print("Cannot visualize decision regions as the dataset has more than two features.")

Alam Umar|Roll no.16|A1|BI Lab|TE-IT Pg.5

Output:

In the above output we can see that the Naïve Bayes classifier has segregated the data points
with the fine boundary. It is Gaussian curve as we have used GaussianNB classifier in our code.
Alam Umar|Roll no.16|A1|BI Lab|TE-IT Pg.6
CONCLUSION

Alam Umar|Roll no.16|A1|BI Lab|TE-IT Pg.7

1) Fitting Naive Bayes to the Training Set:
After the pre-processing step, now we will fit the Naive Bayes model to the Training set.
Below is the code for it:

# Step 2: Fitting Naive Bayes to the Training

set from sklearn.naive_bayes import
GaussianNB

# Create a Naive Bayes classifier

classifier = GaussianNB()

# Train the classifier

classifier.fit(X_train, y_train)
In the above code, we have used the GaussianNB classifier to fit it to the training dataset. We
can also use other classifiers as per our requirement

Alam Umar|Roll no.16|A1|BI Lab|TE-IT Pg.8

Output:

2) Prediction of the test set result:

Now we will predict the test set result. For this, we will create a new predictor variable y_pred,
and will use the predict function to make the predictions.

# Step 3: Predicting the test result

y_pred = classifier.predict(X_test)

3) Creating Confusion Matrix:

Now we will check the accuracy of the Naive Bayes classifier using the Confusion matrix.
Below is the code for it:

# Step 4: Test accuracy of the result (Creation of Confusion

matrix) from sklearn.metrics import confusion_matrix,
accuracy_score
# Calculate confusion matrix
cm = confusion_matrix(y_test, y_pred)
# Calculate accuracy score
accuracy = accuracy_score(y_test, y_pred)
# Print confusion matrix and accuracy
print("Confusion Matrix:")
print(cm)
print("\nAccuracy:", accuracy)

Output:

Alam Umar|Roll no.16|A1|BI Lab|TE-IT Pg.9

As we can see in the above confusion matrix output, there are 7+3= 10 incorrect predictions,
and 65+25=90 correct predictions.

4) Visualizing the training set result:

Next we will visualize the training set result using Naïve Bayes Classifier. Below is the code
for it:

# Step 5: Visualizing the test set

result import matplotlib.pyplot as plt
import numpy as np
# Define function to plot decision regions
def plot_decision_regions(X, y, classifier,
resolution=0.02): markers = ('s', 'x', 'o', '^', 'v')
colors = ('red', 'blue', 'lightgreen', 'gray', 'cyan')
cmap = plt.get_cmap('Pastel2')

x1_min, x1_max = X[:, 0].min() - 1, X[:, 0].max() + 1

x2_min, x2_max = X[:, 1].min() - 1, X[:, 1].max() + 1
xx1, xx2 = np.meshgrid(np.arange(x1_min, x1_max, resolution),
np.arange(x2_min, x2_max, resolution))
Z = classifier.predict(np.array([xx1.ravel(), xx2.ravel()]).T)
Z = Z.reshape(xx1.shape)
plt.contourf(xx1, xx2, Z, alpha=0.4, cmap=cmap)
plt.xlim(xx1.min(), xx1.max())
plt.ylim(xx2.min(), xx2.max())
for idx, cl in enumerate(np.unique(y)):
plt.scatter(x=X[y == cl, 0], y=X[y == cl, 1],
alpha=0.8, c=[colors[idx]],
marker=markers[idx], label=cl)
# Plot decision regions (assuming only two features)
if X_test.shape[1] == 2:
plt.figure(figsize=(10, 6))
plot_decision_regions(X_test.values, y_test, classifier=classifier)
plt.title('Naive Bayes - Test set')
plt.xlabel('Feature 1')
plt.ylabel('Feature 2')
plt.legend(loc='upper right')
plt.show()
else:
print("Cannot visualize decision regions as the dataset has more than two features.")

Alam Umar|Roll no.16|A1|BI Lab|TE-IT

Pg.10
Output:

Alam Umar|Roll no.16|A1|BI Lab|TE-IT

Pg.12

Primitive War (Ethan Pettus) (Z-Library)
0% (1)
Primitive War (Ethan Pettus) (Z-Library)
320 pages
Scottish Folktales and Legends
100% (1)
Scottish Folktales and Legends
244 pages
Fine Jewelry Auction - Skinner
100% (4)
Fine Jewelry Auction - Skinner
124 pages
Starbucks Coffee-In Bangladesh-Marketing
No ratings yet
Starbucks Coffee-In Bangladesh-Marketing
22 pages
Esl Lesson Plan
No ratings yet
Esl Lesson Plan
16 pages
The G Factor - General Intelligence and Its Implications - Chris Brand (Race Difference IQ School Grades Exam Results Educational Achievement Alex Jon
No ratings yet
The G Factor - General Intelligence and Its Implications - Chris Brand (Race Difference IQ School Grades Exam Results Educational Achievement Alex Jon
206 pages
Internet Book of Critical Care (IBCC) : Rapid Reference
No ratings yet
Internet Book of Critical Care (IBCC) : Rapid Reference
1 page
DWM Exp 4
No ratings yet
DWM Exp 4
7 pages
Research ABM 12 Pandita
No ratings yet
Research ABM 12 Pandita
11 pages
Hinaishin and Other Tools PDF
100% (2)
Hinaishin and Other Tools PDF
17 pages
A Modern History, From The Time of Luther To The Fall of NapoleonFor The Use of Schools and Colleges by Lord, John, 1810-1894
No ratings yet
A Modern History, From The Time of Luther To The Fall of NapoleonFor The Use of Schools and Colleges by Lord, John, 1810-1894
299 pages
Naive Bayes
No ratings yet
Naive Bayes
11 pages
Machine Ass
No ratings yet
Machine Ass
33 pages
Surah Al Baqarah (2:222) - Prohibiting Coitus During Menstruation
No ratings yet
Surah Al Baqarah (2:222) - Prohibiting Coitus During Menstruation
4 pages
Naive Bayes Classifier in Machine Learning - Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning - Javatpoint
19 pages
Dr. Gauri, (Hariom Verma)
No ratings yet
Dr. Gauri, (Hariom Verma)
6 pages
Ursula K. Le Guin
No ratings yet
Ursula K. Le Guin
194 pages
Ia TP
No ratings yet
Ia TP
22 pages
Thesis Statement For The Tragedy of Romeo and Juliet
100% (2)
Thesis Statement For The Tragedy of Romeo and Juliet
4 pages
Thesis Statement About Adderall
100% (2)
Thesis Statement About Adderall
5 pages
Naive Bayes Classifiers - Parta
No ratings yet
Naive Bayes Classifiers - Parta
17 pages
Cp4252 Machine Learning Lab Manual
No ratings yet
Cp4252 Machine Learning Lab Manual
40 pages
Elementary Science Methods A Constructivist Approach Fifth Edition David Jerner Martin - Read The Ebook Online or Download It As You Prefer
No ratings yet
Elementary Science Methods A Constructivist Approach Fifth Edition David Jerner Martin - Read The Ebook Online or Download It As You Prefer
45 pages
Motion 2
No ratings yet
Motion 2
5 pages
Naive Bayes Classifier in Machine Learning Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning Javatpoint
23 pages
AI and ML Lab Manual
No ratings yet
AI and ML Lab Manual
29 pages
Wa0001
No ratings yet
Wa0001
39 pages
CSL0777 L24
No ratings yet
CSL0777 L24
38 pages
Ame: Waqar Ali
No ratings yet
Ame: Waqar Ali
22 pages
Unit 2 AAM
No ratings yet
Unit 2 AAM
32 pages
Introduction To TOEFL IBT Preparation Shortened
100% (1)
Introduction To TOEFL IBT Preparation Shortened
9 pages
Naive Bates Classifier
No ratings yet
Naive Bates Classifier
18 pages
SChedule VI
No ratings yet
SChedule VI
88 pages
Muslim Family Laws
No ratings yet
Muslim Family Laws
8 pages
ML Lab1 PGM
No ratings yet
ML Lab1 PGM
4 pages
16 - Naïve Bayes Classifier
No ratings yet
16 - Naïve Bayes Classifier
21 pages
Saint Camillus by A Camillian
No ratings yet
Saint Camillus by A Camillian
40 pages
Machine Learning-Lecture 04
No ratings yet
Machine Learning-Lecture 04
31 pages
Naive Bayes
No ratings yet
Naive Bayes
9 pages
ML Lab Experiments (1) - Pages-3
No ratings yet
ML Lab Experiments (1) - Pages-3
11 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
Exp 3 Bi 30
No ratings yet
Exp 3 Bi 30
7 pages
Aman Agarwal
No ratings yet
Aman Agarwal
6 pages
Practical 3
No ratings yet
Practical 3
11 pages
Naive Bayes Model With Python 1684166563
No ratings yet
Naive Bayes Model With Python 1684166563
9 pages
ML Lab PT
No ratings yet
ML Lab PT
25 pages
Ai 5
No ratings yet
Ai 5
7 pages
Untitled
No ratings yet
Untitled
18 pages
Annotated Follow-Along Guide - Construct A Naive Bayes Model With Python
No ratings yet
Annotated Follow-Along Guide - Construct A Naive Bayes Model With Python
9 pages
Practical # 11
No ratings yet
Practical # 11
10 pages
07 Naive - Bayes
No ratings yet
07 Naive - Bayes
7 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
11 pages
Purva Rawale - BDA Practical No 2
No ratings yet
Purva Rawale - BDA Practical No 2
9 pages
Naive Bayes Project
No ratings yet
Naive Bayes Project
5 pages
Nonverbal Communication
No ratings yet
Nonverbal Communication
5 pages
ML Practical Lovepreet 6-10
No ratings yet
ML Practical Lovepreet 6-10
10 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
8 pages
NOTES
No ratings yet
NOTES
15 pages
Machine Learning With Titanic Dataset Tutorial
No ratings yet
Machine Learning With Titanic Dataset Tutorial
7 pages
The Catcher in The Rye Essay Options
No ratings yet
The Catcher in The Rye Essay Options
4 pages
WK 08
No ratings yet
WK 08
10 pages
Clustering Cba 1
No ratings yet
Clustering Cba 1
10 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
11 pages
Exp 6
No ratings yet
Exp 6
5 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
8 pages
Dev ML Ex5
No ratings yet
Dev ML Ex5
6 pages
ML Lab Program 7
No ratings yet
ML Lab Program 7
7 pages
Lab2 - Bayes Classification
No ratings yet
Lab2 - Bayes Classification
4 pages
W8 Naive Bayes Lab
No ratings yet
W8 Naive Bayes Lab
4 pages
Data Analytics III
No ratings yet
Data Analytics III
5 pages
9.program Naive Bayes
No ratings yet
9.program Naive Bayes
9 pages
Prac4 AAM
No ratings yet
Prac4 AAM
2 pages
Assignment - 01
No ratings yet
Assignment - 01
4 pages
Naive Biase
No ratings yet
Naive Biase
6 pages
Practical-3 Ritesh
No ratings yet
Practical-3 Ritesh
5 pages
Prog 6
No ratings yet
Prog 6
3 pages
DWM Exp 4-2
No ratings yet
DWM Exp 4-2
4 pages
Exp 9
No ratings yet
Exp 9
2 pages
ML Assignment-7
No ratings yet
ML Assignment-7
3 pages
Pract 8 - Naive Bays Algorithm
No ratings yet
Pract 8 - Naive Bays Algorithm
2 pages
Prac4 AAM
No ratings yet
Prac4 AAM
2 pages
AIML - Ex.3 Manual
No ratings yet
AIML - Ex.3 Manual
4 pages
ML Lab-R23-Expt-7
No ratings yet
ML Lab-R23-Expt-7
2 pages
6 Easy Steps To Learn Naive Bayes Algorithm (With Code in Python)
No ratings yet
6 Easy Steps To Learn Naive Bayes Algorithm (With Code in Python)
3 pages
Sodapdf
No ratings yet
Sodapdf
1 page
Folic Acid in Pregnancy: Ethics/Education
No ratings yet
Folic Acid in Pregnancy: Ethics/Education
3 pages
(Data Structure AND Algorathims) : (Teacher: MR Yang Weichao)
No ratings yet
(Data Structure AND Algorathims) : (Teacher: MR Yang Weichao)
6 pages
DAGALA - Lets Analyze
No ratings yet
DAGALA - Lets Analyze
4 pages
Allied Ii Basic Physics - I
No ratings yet
Allied Ii Basic Physics - I
1 page
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet

Exp 3 Bi

Uploaded by

Exp 3 Bi

Uploaded by

EXPERIMENT NO.

Date of Performance: Date of Submission:

Why is it called Naïve Bayes?

Alam Umar|Roll no.16|A1|BI Lab|TE-IT Pg.1

Python Implementation of the Naïve Bayes algorithm

Data Pre-processing step

The output for the dataset is given as:

Fitting Naive Bayes to the Training Set:

# Train the classifier

Alam Umar|Roll no.16|A1|BI Lab|TE-IT Pg.3

Prediction of the test set result:

Alam Umar|Roll no.16|A1|BI Lab|TE-IT Pg.4

Visualizing the training set result

x1_min, x1_max = X[:, 0].min() - 1, X[:, 0].max() + 1

Alam Umar|Roll no.16|A1|BI Lab|TE-IT Pg.5

Alam Umar|Roll no.16|A1|BI Lab|TE-IT Pg.7

# Step 2: Fitting Naive Bayes to the Training

# Create a Naive Bayes classifier

# Train the classifier

Alam Umar|Roll no.16|A1|BI Lab|TE-IT Pg.8

2) Prediction of the test set result:

# Step 3: Predicting the test result

3) Creating Confusion Matrix:

# Step 4: Test accuracy of the result (Creation of Confusion

Alam Umar|Roll no.16|A1|BI Lab|TE-IT Pg.9

4) Visualizing the training set result:

# Step 5: Visualizing the test set

x1_min, x1_max = X[:, 0].min() - 1, X[:, 0].max() + 1

Alam Umar|Roll no.16|A1|BI Lab|TE-IT

Alam Umar|Roll no.16|A1|BI Lab|TE-IT

You might also like