dwm_06

Uploaded by

Mohit Vaidya

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

dwm_06

Uploaded by

Mohit Vaidya

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Experiment No : 6

Aim: Implementation of Decision tree classifier using ID3 algorithm

Theory:

Decision tree classifier:

The Decision Tree classifier uses a flowchart-like structure where internal nodes represent a "test" on an
attribute, branches represent the outcome of the test, and each leaf node represents a class label (decision
outcome).

ID3 algorithm:

Thе ID3 algorithm is a classic decision tree algorithm used for both classification and regression tasks. It
builds a decision tree by recursively partitioning the data set into smaller and smaller subsets until all data
points in each subset belong to the same class. It employs a top-down approach, recursively selecting
features to split the dataset based on information gain.

The algorithm works by selecting the attribute that best classifies the training data using a metric called
"Information Gain." The attribute with the highest Information Gain is chosen as the root node, and this
process is repeated recursively for each branch until the tree fully represents the data or satisfies a stopping
criterion.

Information Gain and Entropy:

● Entropy: A measure of the randomness or uncertainty in a dataset. In the context of Decision

Trees, entropy is used to measure the impurity in a set of examples. Lower entropy implies a
more homogeneous set.

(pi is the proportion of examples in class i.)

● Information Gain: The reduction in entropy achieved by partitioning the data according to a
particular attribute. The attribute with the highest Information Gain is selected for a decision node.
(S is the original set of examples, A is the attribute being tested, Sv is the subset of S for which
attribute A has value v.)

Implementation Steps in the Experiment:

Step 1: Data Preparation- The dataset is read using pandas and the target variable (DEP_DEL15) is
separated from the features. The categorical columns are defined for encoding.

Step 2: Encoding Categorical Features- The categorical features are converted to numerical format to
prepare the data for the Decision Tree algorithm using LabelEncoder from the sklearn.preprocessing
module.

Step 3: Splitting the Data- The dataset is split into training and test sets using train_test_split with 70%
of the data used for training and 30% for testing.

Step 4: Model Training- A DecisionTreeClassifier is created with criterion='entropy', which uses the
ID3 algorithm to build the tree based on entropy and information gain. The model is trained using the
training dataset.

Step 5: Model Prediction and Evaluation- The trained model predicts the target values for the test dataset.
The accuracy of the model is then evaluated using accuracy_score.

Code:
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.tree import DecisionTreeClassifier, plot_tree
from sklearn.metrics import accuracy_score
from sklearn.preprocessing import LabelEncoder
import matplotlib.pyplot as plt

df = pd.read_csv('final_customer.csv')
X = df.drop(['Spending_Score'], axis=1)

def convert_to_string(num):
return str(num)

X['Family Size'] = X['Family Size'].apply(convert_to_string)

X = X.drop(['Age', 'Annual Income ($)', 'Unnamed: 0'], axis=1)

print(X.info())
y = df['Spending_Score']

categorical_columns = X.columns
label_encoders = {}
for col in categorical_columns:
label_encoders[col] = LabelEncoder()
X[col] = label_encoders[col].fit_transform(X[col])

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.30, random_state=42)

clf = DecisionTreeClassifier(criterion='entropy', random_state=42)

clf.fit(X_train, y_train)

y_pred = clf.predict(X_test)

accuracy = accuracy_score(y_test, y_pred)

print(f"Accuracy of the Decision Tree model: {accuracy:.2f}")

plt.figure(figsize=(20,10))
plot_tree(clf, max_depth=3, filled=True, feature_names=X_train.columns,
class_names=label_encoders['Spending_Score'].classes_, rounded=True)

plt.savefig('decision_tree_plot.png', dpi=300, bbox_inches='tight')

print("Decision tree plot has been saved as 'decision_tree_plot.png'")
plt.close()

Output:
Conclusion:

In this experiment, we implemented a Decision Tree using the ID3 algorithm to select the root node. The
ID3 method leverages entropy and information gain to determine the attribute that best splits the data at
each node. Using this approach, we developed a decision tree to predict customer spending scores based on
several factors, such as gender, age group, profession, income group, family size, and work experience.

Petrel Workflow For Converting 2D Seismic To Pseudo 3D Seismic Cube
67% (3)
Petrel Workflow For Converting 2D Seismic To Pseudo 3D Seismic Cube
14 pages
JCG Global Air Services
No ratings yet
JCG Global Air Services
2 pages
K.Venkat Ratnam 191911412 Class Work 1) Describe The Attribute Selection Measures Used by The ID3 Algorithm To Construct A Decision Tree. A)
No ratings yet
K.Venkat Ratnam 191911412 Class Work 1) Describe The Attribute Selection Measures Used by The ID3 Algorithm To Construct A Decision Tree. A)
8 pages
ML LAB MANUAL 4-8
No ratings yet
ML LAB MANUAL 4-8
11 pages
DWDM Unit-3: What Is Classification? What Is Prediction?
No ratings yet
DWDM Unit-3: What Is Classification? What Is Prediction?
12 pages
Decision Tree and Related Techniques For Classification in Scalation
No ratings yet
Decision Tree and Related Techniques For Classification in Scalation
12 pages
Assignment 3 - LP1
No ratings yet
Assignment 3 - LP1
13 pages
Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI
No ratings yet
Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI
4 pages
Ijret - Research Scholars Evaluation Based On Guides View Using Id3
No ratings yet
Ijret - Research Scholars Evaluation Based On Guides View Using Id3
4 pages
ML_4,5 (1)
No ratings yet
ML_4,5 (1)
5 pages
Decision Tree Induction Algorithm
No ratings yet
Decision Tree Induction Algorithm
6 pages
ML Module Iii
No ratings yet
ML Module Iii
12 pages
6CS4-02 Machine Learning Manish Bhardwaj
No ratings yet
6CS4-02 Machine Learning Manish Bhardwaj
625 pages
Research Scholars Evaluation Based On Guides View Using Id3
No ratings yet
Research Scholars Evaluation Based On Guides View Using Id3
4 pages
Chatgpt Unit - 3
No ratings yet
Chatgpt Unit - 3
4 pages
Practical No4 - 5 ML
No ratings yet
Practical No4 - 5 ML
11 pages
Python Decision Tree Classification
No ratings yet
Python Decision Tree Classification
14 pages
Unit-Iii: Classification and Prediction
No ratings yet
Unit-Iii: Classification and Prediction
21 pages
Prac5 AAM
No ratings yet
Prac5 AAM
2 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
statistic inference unit 2 notes
No ratings yet
statistic inference unit 2 notes
34 pages
MLDA1
No ratings yet
MLDA1
8 pages
siv UNIT-3 Classification DWM PART-A
No ratings yet
siv UNIT-3 Classification DWM PART-A
12 pages
ML Unit-III
No ratings yet
ML Unit-III
30 pages
Assignment No 1
No ratings yet
Assignment No 1
9 pages
Minor Project Synopsis
No ratings yet
Minor Project Synopsis
12 pages
DWM_EXP4
No ratings yet
DWM_EXP4
5 pages
MLA Lab 6:-Implementation of Decision Tree
No ratings yet
MLA Lab 6:-Implementation of Decision Tree
16 pages
Assignment 04
No ratings yet
Assignment 04
17 pages
Classification DecisionTreesNaiveBayeskNN
No ratings yet
Classification DecisionTreesNaiveBayeskNN
75 pages
Unit-3 (MLT)
No ratings yet
Unit-3 (MLT)
46 pages
PRACTICAL5
No ratings yet
PRACTICAL5
23 pages
2 - Decision Tree
No ratings yet
2 - Decision Tree
23 pages
Data Mining Unit-Iii
No ratings yet
Data Mining Unit-Iii
36 pages
Dwdm-Unit-3 R16
No ratings yet
Dwdm-Unit-3 R16
14 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
Module 3_classification
No ratings yet
Module 3_classification
9 pages
Class i Fiers
No ratings yet
Class i Fiers
24 pages
Decision Tree - Associative Rule Mining
No ratings yet
Decision Tree - Associative Rule Mining
69 pages
decision tree
No ratings yet
decision tree
13 pages
2167TC1 Lab
No ratings yet
2167TC1 Lab
8 pages
4 Classification
No ratings yet
4 Classification
20 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
6 - Steps of The Classification Algorithm in Supervised Learning
No ratings yet
6 - Steps of The Classification Algorithm in Supervised Learning
15 pages
Aih Exp 2
No ratings yet
Aih Exp 2
8 pages
Practical 9 Decision Tree Classification
No ratings yet
Practical 9 Decision Tree Classification
24 pages
aam p-4 to 6
No ratings yet
aam p-4 to 6
6 pages
ChatGPT_MyLearning on Coding for Machine Learning
No ratings yet
ChatGPT_MyLearning on Coding for Machine Learning
16 pages
UNIT III DM (2)
No ratings yet
UNIT III DM (2)
48 pages
Unit-3 Classification
No ratings yet
Unit-3 Classification
28 pages
AI unit 2
No ratings yet
AI unit 2
14 pages
DWM UNIT-V NOTES
No ratings yet
DWM UNIT-V NOTES
15 pages
THUẬT TOÁN
No ratings yet
THUẬT TOÁN
4 pages
Progrram8-Decision Tree.docx
No ratings yet
Progrram8-Decision Tree.docx
3 pages
Machine Learning QNA
No ratings yet
Machine Learning QNA
1 page
Machine Learning: BY:Vatsal J. Gajera (09BCE010)
No ratings yet
Machine Learning: BY:Vatsal J. Gajera (09BCE010)
25 pages
DSML Practical
No ratings yet
DSML Practical
4 pages
AICS Topics
No ratings yet
AICS Topics
250 pages
Data Mining and Warehousing Concepts Lab: (ITPC - 228)
No ratings yet
Data Mining and Warehousing Concepts Lab: (ITPC - 228)
6 pages
Data Structures: Notes For Lecture 13 Techniques of Data Mining by Samaher Hussein Ali
No ratings yet
Data Structures: Notes For Lecture 13 Techniques of Data Mining by Samaher Hussein Ali
8 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Marlin Ch08
No ratings yet
Marlin Ch08
27 pages
Material (Algorithm)
No ratings yet
Material (Algorithm)
5 pages
Wiley - Interscience.elements - Of.information - Theory.jul.2006.ebook DDU
100% (3)
Wiley - Interscience.elements - Of.information - Theory.jul.2006.ebook DDU
774 pages
Merge Sort Algorithm
No ratings yet
Merge Sort Algorithm
10 pages
Minimum Cardinality Refers To - .: Communicationsunit@unilag - Edu.ng
No ratings yet
Minimum Cardinality Refers To - .: Communicationsunit@unilag - Edu.ng
4 pages
Fake Job Post Detection Using Machine Learning
No ratings yet
Fake Job Post Detection Using Machine Learning
9 pages
BE368 Lecture 4
No ratings yet
BE368 Lecture 4
28 pages
Download Modern Statistics With R From Wrangling and Exploring Data to Inference and Predictive Modelling second edition Måns Thulin ebook All Chapters PDF
No ratings yet
Download Modern Statistics With R From Wrangling and Exploring Data to Inference and Predictive Modelling second edition Måns Thulin ebook All Chapters PDF
71 pages
Notes of ANN
No ratings yet
Notes of ANN
35 pages
Defining The Problem As A State Space Search
No ratings yet
Defining The Problem As A State Space Search
9 pages
An Optimized K Means Clustering For Improving Accuracy in Traffic Classification
No ratings yet
An Optimized K Means Clustering For Improving Accuracy in Traffic Classification
13 pages
Introduction To Belief Networks: David Barber
No ratings yet
Introduction To Belief Networks: David Barber
20 pages
ShieldFL_Mitigating_Model_Poisoning_Attacks_in_Privacy-Preserving_Federated_Learning
No ratings yet
ShieldFL_Mitigating_Model_Poisoning_Attacks_in_Privacy-Preserving_Federated_Learning
16 pages
BDA - AIDS Syllabus
No ratings yet
BDA - AIDS Syllabus
2 pages
Chapter 4
No ratings yet
Chapter 4
19 pages
Quiz 3
No ratings yet
Quiz 3
5 pages
A Branch-And-Cut-And-Price Algorithm For One-Dimensional Stock Cutting and Two-Dimensional Two-Stage Cutting G. Belov, G. Scheithauer
No ratings yet
A Branch-And-Cut-And-Price Algorithm For One-Dimensional Stock Cutting and Two-Dimensional Two-Stage Cutting G. Belov, G. Scheithauer
22 pages
Applied Nonlunear Control: Fundamentals of Lyapunov Theory
No ratings yet
Applied Nonlunear Control: Fundamentals of Lyapunov Theory
10 pages
Vlsi Signal Processing
No ratings yet
Vlsi Signal Processing
455 pages
Proposing Solution To XOR Problem Using Minimum Configuration MLP
No ratings yet
Proposing Solution To XOR Problem Using Minimum Configuration MLP
8 pages
Roy Convolutional Prompting Meets Language Models For Continual Learning CVPR 2024 Paper
No ratings yet
Roy Convolutional Prompting Meets Language Models For Continual Learning CVPR 2024 Paper
11 pages
Combining Task and Motion Planning: A Culprit Detection Problem
No ratings yet
Combining Task and Motion Planning: A Culprit Detection Problem
40 pages
Edits of Physics Notes 2023
No ratings yet
Edits of Physics Notes 2023
94 pages
Module 3 - Two Port Networks
No ratings yet
Module 3 - Two Port Networks
56 pages
PDA
No ratings yet
PDA
52 pages
Julia Ode
No ratings yet
Julia Ode
22 pages
Math211 Practice Exam Final Draft
No ratings yet
Math211 Practice Exam Final Draft
9 pages
Instant download (Ebook) Quantum Systems, Channels, Information by Alexander S. Holevo ISBN 9783110642490, 3110642492 pdf all chapter
100% (7)
Instant download (Ebook) Quantum Systems, Channels, Information by Alexander S. Holevo ISBN 9783110642490, 3110642492 pdf all chapter
67 pages