MLT Lab 06

The document outlines a practical assignment for using the Naïve Bayesian Classifier to classify a set of documents, detailing the steps for text preprocessing, training, prediction, and evaluation metrics such as accuracy, precision, and recall. It explains the theoretical foundation of the classifier based on Bayes' Theorem and includes source code for implementation in Python using libraries like pandas and sklearn. Key assumptions of the model are also discussed, emphasizing the independence of features and the need for a representative training dataset.

Uploaded by

ponete3977

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views3 pages

MLT Lab 06

Uploaded by

ponete3977

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Name – Amit Shukla

Roll No. – 2200971640010

Branch – AIML
Subject – Machine Learning Technique Lab

Practical-06

AIM - Assuming a set of documents that need to be classified, use the naïve Bayesian Classifier model to
perform this task. Built-in Java classes/API can be used to write the program. Calculate the accuracy,
precision, and recall for your data set

Theory:- The Naïve Bayesian Classifier is a probabilis c machine learning model used for text classifica on
tasks, such as spam detec on or sen ment analysis. It is based on Bayes' Theorem, with the "naïve" assump
on that all features (words in a document) are independent of each other given the class label. Despite this
simplifica on, it performs remarkably well in prac cal applica ons.

Key Concepts:
 Bayes’ Theorem:
It provides a way to calculate the probability of a hypothesis given the evidence.

Prior Probability P(H):

Probability of a class (e.g., posi ve or nega ve) before seeing the data.
Likelihood P(E∣H):
Probability of observing a word in a document, given the class.
Posterior Probability P(H∣E):
Final probability of the class given the observed features (words).
Feature Independence Assump on:
Assumes each word in the document contributes independently to the class probability.

How the Naïve Bayesian Classifier Works for Document Classifica on:
1. Preprocess the Text:
Convert documents into tokens (words), remove stopwords, and vectorize the data using techniques
like Bag of Words or TF-IDF.
2. Training Phase:
Use the training documents and their labels to calculate the prior and likelihood probabili es for
each class.
3. Predic on Phase:
For a new/unseen document, compute the posterior probability for each class, and assign the class
with the highest probability.
4. Evalua on:
Use metrics such as Accuracy, Precision, and Recall to evaluate model performance.

Assump ons of Naïve Bayesian Classifier:

• The features (words) are condi onally independent given the class.

• The training dataset is representa ve of the real-world distribu on.

• The input text is already preprocessed (cleaned and vectorized).

Source Code :-

import pandas as pd
msg = pd.read_csv('/content/sample_data/document.csv', names=['message', 'label'])
print("Total Instances of Dataset: ", msg.shape[0]) msg['labelnum'] =
msg.label.map({'pos': 1, 'neg': 0})

X = msg.message
y = msg.labelnum
from sklearn.model_selec on import train_test_split Xtrain,
Xtest, ytrain, ytest = train_test_split(X, y)
from sklearn.feature_extrac on.text import CountVectorizer

count_v = CountVectorizer()
Xtrain_dm = count_v.fit_transform(Xtrain)
Xtest_dm = count_v.transform(Xtest)
……………………………………………………………………………

df = pd.DataFrame(Xtrain_dm.toarray(), columns=count_v.get_feature_names_out())
print(df[0:5])
from sklearn.naive_bayes import Mul nomialNB clf = Mul nomialNB()
clf.fit(Xtrain_dm, ytrain)
pred = clf.predict(Xtest_dm)
…………………………………………………
………………………… for doc, p in
zip(Xtrain, pred): p = 'pos' if p == 1 else 'neg'
print("%s -> %s" % (doc, p))

from
sklearn.metrics import accuracy_score, confusion_matrix, precision_score, recall_score
print('Accuracy Metrics: \n') print('Accuracy: ', accuracy_score(ytest, pred)) print('Recall: ',
recall_score(ytest, pred)) print('Precision: ', precision_score(ytest, pred))
print('Confusion Matrix: \n', confusion_matrix(ytest, pred))

05 Text Classification - Naive Bayes
No ratings yet
05 Text Classification - Naive Bayes
64 pages
Naive Bayes Algorithm For Classification Tasks: Sana Badagan 1MS24RAI09
No ratings yet
Naive Bayes Algorithm For Classification Tasks: Sana Badagan 1MS24RAI09
31 pages
Metalsa Supplier Manual Rev 4 1
No ratings yet
Metalsa Supplier Manual Rev 4 1
58 pages
Lec 09
No ratings yet
Lec 09
50 pages
BAI601 Module 3 PDF
No ratings yet
BAI601 Module 3 PDF
19 pages
Lec 09
No ratings yet
Lec 09
50 pages
M. Ali Asdar Departement of Pulmonology and Respiratory Medicine Faculty of Medicine University of Indonesia - Persahabatan General Hospital Jakarta
No ratings yet
M. Ali Asdar Departement of Pulmonology and Respiratory Medicine Faculty of Medicine University of Indonesia - Persahabatan General Hospital Jakarta
30 pages
Paper I Telugu 8th Jan 2025 Shift 1
No ratings yet
Paper I Telugu 8th Jan 2025 Shift 1
88 pages
Lecture13 Nbayes
No ratings yet
Lecture13 Nbayes
56 pages
NLP NB
No ratings yet
NLP NB
52 pages
AIML - Ex.3 Manual
No ratings yet
AIML - Ex.3 Manual
4 pages
Lecture3 Linear Classifiers
No ratings yet
Lecture3 Linear Classifiers
36 pages
16 - Naïve Bayes Classifier
No ratings yet
16 - Naïve Bayes Classifier
21 pages
Naive Bayes Classifier in Machine Learning Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning Javatpoint
23 pages
24 Shivangi DMDW
No ratings yet
24 Shivangi DMDW
12 pages
Naive Bayes
No ratings yet
Naive Bayes
12 pages
Naive Bayes Explanation Cleaned
No ratings yet
Naive Bayes Explanation Cleaned
2 pages
Ame: Waqar Ali
No ratings yet
Ame: Waqar Ali
22 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
24 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
8 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
46 pages
Naive Bayes Classifier Presentation
No ratings yet
Naive Bayes Classifier Presentation
10 pages
Cryptography
No ratings yet
Cryptography
201 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
16 pages
Vamshi ml-4
No ratings yet
Vamshi ml-4
3 pages
UNIT 2 AAM Notes
No ratings yet
UNIT 2 AAM Notes
38 pages
Practical 3
No ratings yet
Practical 3
11 pages
Machine Ass
No ratings yet
Machine Ass
33 pages
77 4001 StaSaf
No ratings yet
77 4001 StaSaf
20 pages
Naive Bayes Classifier Overview
No ratings yet
Naive Bayes Classifier Overview
7 pages
Guidanc CTspection
No ratings yet
Guidanc CTspection
17 pages
07 Naive Bayes
No ratings yet
07 Naive Bayes
6 pages
Ai&Ml Lab: Dept of CSE, SUK
No ratings yet
Ai&Ml Lab: Dept of CSE, SUK
3 pages
Large Scale Production Fermenter Design
No ratings yet
Large Scale Production Fermenter Design
15 pages
PIL - 3rd Sem LLB
No ratings yet
PIL - 3rd Sem LLB
68 pages
Myppt
No ratings yet
Myppt
14 pages
LM3 - Naive Bayes Model
No ratings yet
LM3 - Naive Bayes Model
21 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
18 pages
ML Lab Experiments (1) - Pages-3
No ratings yet
ML Lab Experiments (1) - Pages-3
11 pages
CSL0777 L24
No ratings yet
CSL0777 L24
38 pages
Equlibrium
No ratings yet
Equlibrium
20 pages
ML CLassification Naive Bayes
No ratings yet
ML CLassification Naive Bayes
6 pages
3.1 Tuple Relational Calculus
No ratings yet
3.1 Tuple Relational Calculus
11 pages
Photoluminescence FBG
No ratings yet
Photoluminescence FBG
13 pages
Mapping Pulling Cable Grounding System
No ratings yet
Mapping Pulling Cable Grounding System
1 page
NOTES
No ratings yet
NOTES
15 pages
Secret of Anti-Aging Anti-Aging Food Con
No ratings yet
Secret of Anti-Aging Anti-Aging Food Con
5 pages
Lab7&8 NaiveBayes
No ratings yet
Lab7&8 NaiveBayes
5 pages
DC-6 Om
100% (4)
DC-6 Om
522 pages
Lab2 - Bayes Classification
No ratings yet
Lab2 - Bayes Classification
4 pages
Naive Bayes Classifier: Fundamentals and Applications
From Everand
Naive Bayes Classifier: Fundamentals and Applications
Fouad Sabry
No ratings yet
FICM Unit 3
No ratings yet
FICM Unit 3
6 pages
4as Tle7 LC4
No ratings yet
4as Tle7 LC4
5 pages
NB Slides
No ratings yet
NB Slides
29 pages
Naive Bayes
No ratings yet
Naive Bayes
9 pages
Naive Bayes
No ratings yet
Naive Bayes
4 pages
Financial Kake Da Hotel (N)
No ratings yet
Financial Kake Da Hotel (N)
10 pages
DWM Exp 4-2
No ratings yet
DWM Exp 4-2
4 pages
Prac4 AAM
No ratings yet
Prac4 AAM
2 pages
Mechine Learning
No ratings yet
Mechine Learning
7 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
Risk Assessment Table New Version
No ratings yet
Risk Assessment Table New Version
4 pages
Introduction To Soil Ecology
No ratings yet
Introduction To Soil Ecology
15 pages
6d7701 - Bayesean Classifer
No ratings yet
6d7701 - Bayesean Classifer
8 pages
Unit2 - 5 - Part 2
No ratings yet
Unit2 - 5 - Part 2
1 page
Naive Bates Classifier
No ratings yet
Naive Bates Classifier
18 pages
Practical-3 Ritesh
No ratings yet
Practical-3 Ritesh
5 pages
How Human Behaviour Amplifies The Bullwhip Effect A Study Based On The Beer Distribution Game Online
No ratings yet
How Human Behaviour Amplifies The Bullwhip Effect A Study Based On The Beer Distribution Game Online
12 pages
Tackling The Poor Assumptions of Naive Bayes Text Classifiers
No ratings yet
Tackling The Poor Assumptions of Naive Bayes Text Classifiers
8 pages
Lab5 NaiveBayes Full
No ratings yet
Lab5 NaiveBayes Full
5 pages
Experiment No 6
No ratings yet
Experiment No 6
3 pages
Naive Bayes Classifier in Machine Learning - Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning - Javatpoint
19 pages
Naive Bayes Classifiers - Parta
No ratings yet
Naive Bayes Classifiers - Parta
17 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
11 pages
New Microsoft Office Word Document
No ratings yet
New Microsoft Office Word Document
6 pages
Naïve Bayes
No ratings yet
Naïve Bayes
15 pages
Data Security
No ratings yet
Data Security
13 pages
ASIC Implementation of Efficient 16-Parallel Fast FIR Algorithm Filter Structure
No ratings yet
ASIC Implementation of Efficient 16-Parallel Fast FIR Algorithm Filter Structure
5 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Hazard Identification: 2. Risk Analysis/Evaluation 3. Risk Control
No ratings yet
Hazard Identification: 2. Risk Analysis/Evaluation 3. Risk Control
2 pages
Navies Bayes
No ratings yet
Navies Bayes
18 pages
Keralauniversity of Fisheries & Ocean Studies: Panangad P.O., Kochi 682 506, Kerala, India
No ratings yet
Keralauniversity of Fisheries & Ocean Studies: Panangad P.O., Kochi 682 506, Kerala, India
13 pages
Naive Bayes Classification
100% (3)
Naive Bayes Classification
10 pages
ARINC Meteorological Data Collection and Reporting System (MDCRS)
No ratings yet
ARINC Meteorological Data Collection and Reporting System (MDCRS)
16 pages
PHP Yii JSP Servlet - 2 - Md. Shibly Forkani
No ratings yet
PHP Yii JSP Servlet - 2 - Md. Shibly Forkani
4 pages
An Approach of The Naive Bayes Classifier For The Document Classification
No ratings yet
An Approach of The Naive Bayes Classifier For The Document Classification
4 pages
Na Ive Bayes Classifier
No ratings yet
Na Ive Bayes Classifier
3 pages
The Act
No ratings yet
The Act
2 pages
Quick Start Guide: Register Your Product and Get Support at
No ratings yet
Quick Start Guide: Register Your Product and Get Support at
6 pages
Material Test Report: Cse. Chiang Sung Enterprise Co., LTD
No ratings yet
Material Test Report: Cse. Chiang Sung Enterprise Co., LTD
3 pages

MLT Lab 06

Uploaded by

MLT Lab 06

Uploaded by

Name – Amit Shukla

Roll No. – 2200971640010

Prior Probability P(H):

Assump ons of Naïve Bayesian Classifier:

• The training dataset is representa ve of the real-world distribu on.

• The input text is already preprocessed (cleaned and vectorized).

You might also like