0% found this document useful (0 votes)

16 views4 pages

Assignment - 01

Uploaded by

DHRUV TILLU

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views4 pages

Assignment - 01

Uploaded by

DHRUV TILLU

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Name: Dhruv Jayant Tillu Roll No.

: 6107
Subject: 510302 - BDS

ASSIGNMENT: 01
Aim: Implement Naive Bayes algorithm, using Java/Python/R to classify a dataset from UCI repository. (Do not
use built-in functions for naive bayes). Compare the performance of your implementation with the Naive Bayes
classifier from the Weka tool/R/Python. Present the Confusion matrix for each classifier. For measuring
performance use at least five metrics such as accuracy, precision, recall, F-measure etc.

Requirements:
• Software: PyCharm Professional
• Libraries: Pandas, Scikit-Learn, Seaborn, Matplotlib, and NumPy
• Dataset: Iris dataset from UCI repository.

Theory: Naive Bayes is a probabilistic classifier based on Bayes' Theorem, assuming independence between
features. It calculates the posterior probability of each class by combining prior probabilities and the
likelihood of observed data. Despite its simplicity, Naive Bayes is highly effective for classification tasks,
especially in text classification and medical diagnosis, due to its efficiency and reasonable accuracy even with
small datasets.

Code:
import pandas as pd
from sklearn.model_selection import train_test_split

# Load the Iris dataset

url = "https://archive.ics.uci.edu/ml/machine-learning-databases/iris/iris.data"
names = ['sepal_length', 'sepal_width', 'petal_length', 'petal_width', 'class']
dataset = pd.read_csv(url, names=names)

# Split dataset into training and testing

X = dataset.iloc[:, :-1].values
y = dataset.iloc[:, -1].values
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)

import numpy as np
from collections import defaultdict

class NaiveBayes:
def __init__(self):
self.classes = None
self.mean = defaultdict(list)
self.variance = defaultdict(list)
self.priors = {}

def fit(self, X, y):

self.classes = np.unique(y)
for cls in self.classes:
X_cls = X[y == cls]
self.mean[cls] = X_cls.mean(axis=0)
self.variance[cls] = X_cls.var(axis=0)
self.priors[cls] = X_cls.shape[0] / X.shape[0]
Name: Dhruv Jayant Tillu Roll No.: 6107
Subject: 510302 - BDS

def calculate_likelihood(self, mean, var, x):

eps = 1e-4 # to avoid division by zero
coeff = 1.0 / np.sqrt(2.0 * np.pi * var + eps)
exponent = np.exp(-((x - mean) ** 2) / (2 * var + eps))
return coeff * exponent

def calculate_posterior(self, X):

posteriors = []
for cls in self.classes:
prior = np.log(self.priors[cls])
likelihood = np.sum(np.log(self.calculate_likelihood(self.mean[cls],
self.variance[cls], X)))
posterior = prior + likelihood
posteriors.append(posterior)
return self.classes[np.argmax(posteriors)]

def predict(self, X):

y_pred = [self.calculate_posterior(x) for x in X]
return np.array(y_pred)

# Train the model

nb_manual = NaiveBayes()
nb_manual.fit(X_train, y_train)
y_pred_manual = nb_manual.predict(X_test)

from sklearn.naive_bayes import GaussianNB

# Train the model

nb_sklearn = GaussianNB()
nb_sklearn.fit(X_train, y_train)
y_pred_sklearn = nb_sklearn.predict(X_test)

from sklearn.metrics import accuracy_score, precision_score, recall_score, f1_score,

confusion_matrix

def evaluate(y_true, y_pred, model_name):

accuracy = accuracy_score(y_true, y_pred)
precision = precision_score(y_true, y_pred, average='weighted')
recall = recall_score(y_true, y_pred, average='weighted')
f1 = f1_score(y_true, y_pred, average='weighted')
cm = confusion_matrix(y_true, y_pred)

print(f"Evaluation for {model_name}:")

print(f"Accuracy: {accuracy:.2f}")
print(f"Precision: {precision:.2f}")
print(f"Recall: {recall:.2f}")
print(f"F1 Score: {f1:.2f}")
print(f"Confusion Matrix:\n{cm}\n")
return accuracy, precision, recall, f1, cm

# Evaluate both models

metrics_manual = evaluate(y_test, y_pred_manual, "Manual Naive Bayes")
metrics_sklearn = evaluate(y_test, y_pred_sklearn, "Scikit-Learn Naive Bayes")

Evaluation for Manual Naive Bayes:

Accuracy: 1.00
Name: Dhruv Jayant Tillu Roll No.: 6107
Subject: 510302 - BDS

Precision: 1.00
Recall: 1.00
F1 Score: 1.00
Confusion Matrix:
[[10 0 0]
[ 0 9 0]
[ 0 0 11]]

Evaluation for Scikit-Learn Naive Bayes:

Accuracy: 1.00
Precision: 1.00
Recall: 1.00
F1 Score: 1.00
Confusion Matrix:
[[10 0 0]
[ 0 9 0]
[ 0 0 11]]

# Plot the confusion matrix both models

import matplotlib.pyplot as plt
import seaborn as sns
sns.heatmap(metrics_manual[4], annot=True, cmap='Blues', fmt='g')
plt.title("Confusion Matrix for Manual Naive Bayes")
plt.xlabel("Predicted")
plt.ylabel("True")
plt.show()

sns.heatmap(metrics_sklearn[4], annot=True, cmap='Blues', fmt='g')

plt.title("Confusion Matrix for Scikit-Learn Naive Bayes")
plt.xlabel("Predicted")
plt.ylabel("True")
plt.show()
Name: Dhruv Jayant Tillu Roll No.: 6107
Subject: 510302 - BDS

Conclusion: Naive Bayes is a simple yet powerful probabilistic classifier that assumes independence between
features. It uses Bayes' Theorem to calculate the probability of each class and is particularly effective in text
classification and medical diagnosis due to its efficiency and reasonable accuracy. Despite its simplicity, it
often performs well, even on complex datasets.
Evaluation for Manual Naive Bayes:

Accuracy: 1.00

Precision: 1.00

Recall: 1.00

F1 Score: 1.00

Evaluation for Scikit-Learn Naive Bayes:

Accuracy: 1.00

Precision: 1.00

Recall: 1.00

F1 Score: 1.00

M800-M80-E80 Series Instruction Manual Ib1501274engk
No ratings yet
M800-M80-E80 Series Instruction Manual Ib1501274engk
794 pages
21CSC305P ML - Lab Programs 1 - 9
No ratings yet
21CSC305P ML - Lab Programs 1 - 9
36 pages
ML - LAB - 7 - Jupyter Notebook
100% (1)
ML - LAB - 7 - Jupyter Notebook
7 pages
Data Analytics III
No ratings yet
Data Analytics III
5 pages
ML Lab 146
No ratings yet
ML Lab 146
50 pages
AI and ML Lab Manual
No ratings yet
AI and ML Lab Manual
29 pages
Exp 3 Bi
No ratings yet
Exp 3 Bi
12 pages
8&9 Assignment ADS
No ratings yet
8&9 Assignment ADS
20 pages
Naive Bayes
No ratings yet
Naive Bayes
9 pages
Aman Agarwal
No ratings yet
Aman Agarwal
6 pages
Cp4252 Machine Learning Lab Manual
No ratings yet
Cp4252 Machine Learning Lab Manual
40 pages
Ai 5
No ratings yet
Ai 5
7 pages
Naive Bayes Project
No ratings yet
Naive Bayes Project
5 pages
Example - 1
No ratings yet
Example - 1
5 pages
SL Classification For Data Science..
No ratings yet
SL Classification For Data Science..
4 pages
ML Lab6
No ratings yet
ML Lab6
4 pages
ML Lab Experiments (1) - Pages-3
No ratings yet
ML Lab Experiments (1) - Pages-3
11 pages
W8 Naive Bayes Lab
No ratings yet
W8 Naive Bayes Lab
4 pages
ML Assignment-7
No ratings yet
ML Assignment-7
3 pages
CS3491 Artificial Intelligence and Machine Learning Laboratory
No ratings yet
CS3491 Artificial Intelligence and Machine Learning Laboratory
36 pages
Ai&Ml Lab: Dept of CSE, SUK
No ratings yet
Ai&Ml Lab: Dept of CSE, SUK
3 pages
Allcodesml 2
No ratings yet
Allcodesml 2
10 pages
9.program Naive Bayes
No ratings yet
9.program Naive Bayes
9 pages
Remaining ML Program
No ratings yet
Remaining ML Program
12 pages
ML Prac1-10
No ratings yet
ML Prac1-10
32 pages
ML Lab Exp
No ratings yet
ML Lab Exp
7 pages
School of Engineering: Lab Manual On Machine Learning Lab
No ratings yet
School of Engineering: Lab Manual On Machine Learning Lab
23 pages
ML With Python Practical
No ratings yet
ML With Python Practical
22 pages
3 Naive Bayes Model
No ratings yet
3 Naive Bayes Model
3 pages
Aiml Practical
No ratings yet
Aiml Practical
17 pages
Prakhar - Week 5
No ratings yet
Prakhar - Week 5
8 pages
ML Lab
No ratings yet
ML Lab
7 pages
Amlnew
No ratings yet
Amlnew
25 pages
Practical 3
No ratings yet
Practical 3
11 pages
Naive Bayes
No ratings yet
Naive Bayes
8 pages
Artificial Intelligence Lab 7
No ratings yet
Artificial Intelligence Lab 7
10 pages
Exp 3 Bi 30
No ratings yet
Exp 3 Bi 30
7 pages
ShaftDesigner by IMT (English)
100% (1)
ShaftDesigner by IMT (English)
2 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
8 pages
Dsbda 10
No ratings yet
Dsbda 10
5 pages
ML Lab Manual
No ratings yet
ML Lab Manual
12 pages
Naive Biase
No ratings yet
Naive Biase
6 pages
Mllabprog 5
No ratings yet
Mllabprog 5
6 pages
Library Management System SRS Report Lib
No ratings yet
Library Management System SRS Report Lib
13 pages
ADS - Phase 3
No ratings yet
ADS - Phase 3
34 pages
ML Lab Programs For Exam
No ratings yet
ML Lab Programs For Exam
10 pages
ML Lab PT
No ratings yet
ML Lab PT
25 pages
Unit 2 AAM
No ratings yet
Unit 2 AAM
32 pages
# ELG 5255 Applied Machine Learning Fall 2020 # Quiz 1 (Bayesian Decision Theory)
No ratings yet
# ELG 5255 Applied Machine Learning Fall 2020 # Quiz 1 (Bayesian Decision Theory)
6 pages
Atul MLT Exp 4-11
No ratings yet
Atul MLT Exp 4-11
17 pages
Emerging Technologies For Business Processes
No ratings yet
Emerging Technologies For Business Processes
19 pages
Prog 6
No ratings yet
Prog 6
3 pages
Software Engineering For Automotive Systems: A Roadmap
No ratings yet
Software Engineering For Automotive Systems: A Roadmap
17 pages
Artificial Intelligence (Roselin)
No ratings yet
Artificial Intelligence (Roselin)
38 pages
Naive Bates Classifier
No ratings yet
Naive Bates Classifier
18 pages
Aiml Ex 4-7
No ratings yet
Aiml Ex 4-7
8 pages
SAP Readiness Check For SAP S4HANA Conversion
No ratings yet
SAP Readiness Check For SAP S4HANA Conversion
100 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
11 pages
Shobit Sharma (2124399) ML Lab File PDF
No ratings yet
Shobit Sharma (2124399) ML Lab File PDF
19 pages
Practical-3 Ritesh
No ratings yet
Practical-3 Ritesh
5 pages
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
No ratings yet
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
20 pages
01-10 CANBUS Signal Description
No ratings yet
01-10 CANBUS Signal Description
11 pages
MP Vs MC
No ratings yet
MP Vs MC
6 pages
Unit2 ML Programs
No ratings yet
Unit2 ML Programs
7 pages
Purva Rawale - BDA Practical No 2
No ratings yet
Purva Rawale - BDA Practical No 2
9 pages
Analysis of ISO-IEC 9126 and 25010
0% (1)
Analysis of ISO-IEC 9126 and 25010
27 pages
Naive Bayes Classifier in Machine Learning Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning Javatpoint
23 pages
Labview Programming Reference Manual 7-30-2024-9001-11630
No ratings yet
Labview Programming Reference Manual 7-30-2024-9001-11630
2,630 pages
Ame: Waqar Ali
No ratings yet
Ame: Waqar Ali
22 pages
RPA With UiPath Q&A
No ratings yet
RPA With UiPath Q&A
3 pages
215 Turbo Box
No ratings yet
215 Turbo Box
19 pages
Whats A Deepfake
No ratings yet
Whats A Deepfake
7 pages
TCG EFI Platform Specification: For TPM Family 1.1 or 1.2
No ratings yet
TCG EFI Platform Specification: For TPM Family 1.1 or 1.2
43 pages
ML Lab Programs
No ratings yet
ML Lab Programs
18 pages
Sino Gnss n5
No ratings yet
Sino Gnss n5
55 pages
Timedtrial Log
No ratings yet
Timedtrial Log
61 pages
How To Upgrade From Red Hat Enterprise Linux 6
No ratings yet
How To Upgrade From Red Hat Enterprise Linux 6
3 pages
4k Uhd TV Manual
No ratings yet
4k Uhd TV Manual
15 pages
CODE1
No ratings yet
CODE1
20 pages
840dsl Initial Commissioning
No ratings yet
840dsl Initial Commissioning
138 pages
FirmwareUpdate 102 Eng
No ratings yet
FirmwareUpdate 102 Eng
5 pages
PSM Simulation Exam 2
No ratings yet
PSM Simulation Exam 2
35 pages
Sanofi Coursera Pathways
No ratings yet
Sanofi Coursera Pathways
6 pages
GC 2024 10 17
No ratings yet
GC 2024 10 17
13 pages
Leaflet
No ratings yet
Leaflet
2 pages
Odoo Candidate Skill Comparison Horizontal
No ratings yet
Odoo Candidate Skill Comparison Horizontal
2 pages
SE Questions
No ratings yet
SE Questions
3 pages
R1 - Case Study Problem Statement PDF
No ratings yet
R1 - Case Study Problem Statement PDF
2 pages
Navigating Quipper
No ratings yet
Navigating Quipper
30 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet

Assignment - 01

Uploaded by

Assignment - 01

Uploaded by

Name: Dhruv Jayant Tillu Roll No.

# Load the Iris dataset

# Split dataset into training and testing

def fit(self, X, y):

def calculate_likelihood(self, mean, var, x):

def calculate_posterior(self, X):

def predict(self, X):

# Train the model

from sklearn.naive_bayes import GaussianNB

# Train the model

from sklearn.metrics import accuracy_score, precision_score, recall_score, f1_score,

def evaluate(y_true, y_pred, model_name):

print(f"Evaluation for {model_name}:")

# Evaluate both models

Evaluation for Manual Naive Bayes:

Evaluation for Scikit-Learn Naive Bayes:

# Plot the confusion matrix both models

sns.heatmap(metrics_sklearn[4], annot=True, cmap='Blues', fmt='g')

Evaluation for Scikit-Learn Naive Bayes:

You might also like