0% found this document useful (0 votes)

3 views

dev id3.ipynb - Colab

The document outlines the ID3 algorithm for building decision trees, emphasizing the calculation of entropy and information gain to select the best attributes. It provides a step-by-step guide to manually create a decision tree using a dataset and includes a Python implementation using the sklearn library to train and visualize the tree. Additionally, it demonstrates how to test the decision tree with a sample query and presents the classification report for model evaluation.

Uploaded by

singhdeepanshu1409

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

dev id3.ipynb - Colab

Uploaded by

singhdeepanshu1409

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Experiment-8

1. State the ID3 Algo for Decision Tree

2. Solve the given dataset or create a decision tree for a given dataset on paper using ID3
3. Implement this ID3 algorithm in python for the same given dataset
4. visualise the decision tree
5. Test/Validate the tree for any query

1. ID3 Algorithm (Conceptual Summary) ID3 (Iterative Dichotomiser 3) builds a decision tree by
selecting the feature with the highest Information Gain at each node.

Steps:

1. Calculate Entropy of the dataset.

2. For each attribute, calculate Information Gain:

Information Gain = Entropy(Parent) − ∑ ( ∣ Subset ∣ ∣ Total ∣ × Entropy(Subset) ) Information

Gain=Entropy(Parent)−∑( ∣Total∣ ∣Subset∣×Entropy(Subset))

3. Choose the attribute with the highest Information Gain.

4. Repeat recursively for each subset until:

All samples belong to one class

No attributes are left

2. Manually Solving (on Paper)

You can solve the decision tree on paper using the following steps:

Start with full dataset entropy (target = "PlayTennis")

For each attribute (Outlook, Humidity, etc.), compute the information gain

Choose attribute with max gain as root

Repeat for each branch (subset) until classified

import pandas as pd
from sklearn.preprocessing import LabelEncoder
from sklearn.tree import DecisionTreeClassifier, plot_tree
from sklearn.metrics import classification_report
import matplotlib.pyplot as plt

# Load the dataset

df = pd.read_csv("/content/PlayTennis.csv")
print("Dataset:\n", df)

# Encode categorical features

label_encoders = {}
for col in df.columns:
if df[col].dtype == 'object':
le = LabelEncoder()
df[col] = le.fit_transform(df[col])
label_encoders[col] = le

# Features and target

X = df.drop('Play Tennis', axis=1)
y = df['Play Tennis']

Dataset:
Outlook Temperature Humidity Wind Play Tennis
0 Sunny Hot High Weak No
1 Sunny Hot High Strong No
2 Overcast Hot High Weak Yes
3 Rain Mild High Weak Yes
4 Rain Cool Normal Weak Yes
5 Rain Cool Normal Strong No
6 Overcast Cool Normal Strong Yes
7 Sunny Mild High Weak No
8 Sunny Cool Normal Weak Yes
9 Rain Mild Normal Weak Yes
10 Sunny Mild Normal Strong Yes
11 Overcast Mild High Strong Yes
12 Overcast Hot Normal Weak Yes
13 Rain Mild High Strong No

# Train the Decision Tree using ID3 (entropy)

clf = DecisionTreeClassifier(criterion="entropy", random_state=0)
clf.fit(X, y)

# Visualize the decision tree

plt.figure(figsize=(12, 8))
plot_tree(clf, feature_names=X.columns, class_names=label_encoders['Play Tennis'].classes
plt.title("Decision Tree using ID3 (Entropy)")
plt.show()
# Predict on the training data (or a query below)
y_pred = clf.predict(X)
print("\nClassification Report:\n", classification_report(y, y_pred, target_names=label_e

Classification Report:
precision recall f1-score support

No 1.00 1.00 1.00 5

Yes 1.00 1.00 1.00 9

accuracy 1.00 14
macro avg 1.00 1.00 1.00 14
weighted avg 1.00 1.00 1.00 14
query = {
'Outlook': 'Sunny',
'Temperature': 'Cool',
'Humidity': 'High',
'Wind': 'Strong'
}

# Encode the input

query_encoded = [label_encoders[col].transform([query[col]])[0] for col in X.columns]
prediction = clf.predict([query_encoded])
predicted_label = label_encoders['Play Tennis'].inverse_transform(prediction)

print("Prediction for query:", predicted_label[0])

Prediction for query: No

/usr/local/lib/python3.11/dist-packages/sklearn/utils/validation.py:2739: UserWarning
warnings.warn(

 

Numerical Methods Using MATLAB 4ed Solution Manual
43% (23)
Numerical Methods Using MATLAB 4ed Solution Manual
171 pages
3 ID3 Algorithm Updated
No ratings yet
3 ID3 Algorithm Updated
3 pages
3
No ratings yet
3
3 pages
Lec-2 Decision Tree_13-8-2024
No ratings yet
Lec-2 Decision Tree_13-8-2024
38 pages
DWDM Lab 2
No ratings yet
DWDM Lab 2
3 pages
MANUAL (1)
No ratings yet
MANUAL (1)
34 pages
Lab Programs Manual
No ratings yet
Lab Programs Manual
22 pages
AD LAB-8.1-GrWork-updated
No ratings yet
AD LAB-8.1-GrWork-updated
7 pages
DA_LAB3_221IT064
No ratings yet
DA_LAB3_221IT064
6 pages
MANUAL (2)
No ratings yet
MANUAL (2)
33 pages
Decisiontrees
No ratings yet
Decisiontrees
46 pages
ML5_Implementation
No ratings yet
ML5_Implementation
32 pages
da-lab3-221it084-final (1)
No ratings yet
da-lab3-221it084-final (1)
6 pages
ML Ex1
No ratings yet
ML Ex1
12 pages
3ID3 Algorithm
No ratings yet
3ID3 Algorithm
9 pages
weather forecasting example (2)
No ratings yet
weather forecasting example (2)
3 pages
Classification - Issues Regarding Classification and Prediction
No ratings yet
Classification - Issues Regarding Classification and Prediction
42 pages
DM DT Solved Example 02 - Unlocked
No ratings yet
DM DT Solved Example 02 - Unlocked
3 pages
Decision Tree Learning and Inductive Inference
No ratings yet
Decision Tree Learning and Inductive Inference
37 pages
Aiml Lab
No ratings yet
Aiml Lab
24 pages
Decision_Tree_Explanation
No ratings yet
Decision_Tree_Explanation
3 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
Lab Manual
No ratings yet
Lab Manual
25 pages
7. Decision Tree & Random Forest
No ratings yet
7. Decision Tree & Random Forest
41 pages
ML1408-Machine Learning Lab Programs
No ratings yet
ML1408-Machine Learning Lab Programs
17 pages
Lab Program 3
No ratings yet
Lab Program 3
6 pages
ml final
No ratings yet
ml final
19 pages
unit 3
No ratings yet
unit 3
90 pages
lab manual ML
No ratings yet
lab manual ML
23 pages
07 - ML - Decision Tree
No ratings yet
07 - ML - Decision Tree
37 pages
Machine Learning Laboratory Record Book: 1 Find S Algorithm
No ratings yet
Machine Learning Laboratory Record Book: 1 Find S Algorithm
22 pages
ML Lab
No ratings yet
ML Lab
9 pages
18. Decision Tree
No ratings yet
18. Decision Tree
15 pages
LAB-Skill Advanced Course Machine Learning With Python Experiments
No ratings yet
LAB-Skill Advanced Course Machine Learning With Python Experiments
23 pages
MLlab Manual LIET
No ratings yet
MLlab Manual LIET
52 pages
DecisionTree COM
No ratings yet
DecisionTree COM
4 pages
BCSL606 MACHINE LEARNING LAB
No ratings yet
BCSL606 MACHINE LEARNING LAB
33 pages
Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
Decision Tree by Visualization
No ratings yet
Decision Tree by Visualization
31 pages
Decision Trees Classification: Mustafa Jarrar
No ratings yet
Decision Trees Classification: Mustafa Jarrar
46 pages
ML-19 (1)
No ratings yet
ML-19 (1)
28 pages
ML-Lab
No ratings yet
ML-Lab
26 pages
What Did We Learn?: Learning Problem
No ratings yet
What Did We Learn?: Learning Problem
60 pages
Decision Tree (Class 37-38) 169692509554958626652505a71d481
No ratings yet
Decision Tree (Class 37-38) 169692509554958626652505a71d481
45 pages
Machine Learning Lab Manual (1) (1)
No ratings yet
Machine Learning Lab Manual (1) (1)
26 pages
ML lab manual
No ratings yet
ML lab manual
25 pages
ID3 Decision Tree Explanation
No ratings yet
ID3 Decision Tree Explanation
8 pages
R20 Iii-Ii ML Lab Manual
100% (1)
R20 Iii-Ii ML Lab Manual
79 pages
MLT Experiment 3
No ratings yet
MLT Experiment 3
3 pages
Machine Learning
No ratings yet
Machine Learning
22 pages
Unit-3 (1)
No ratings yet
Unit-3 (1)
81 pages
Decision Tree
100% (1)
Decision Tree
10 pages
Decision Trees
No ratings yet
Decision Trees
29 pages
Module 3 DecisionTree Notes
100% (1)
Module 3 DecisionTree Notes
14 pages
If With: February 26, 2024
No ratings yet
If With: February 26, 2024
7 pages
06 Classification Decision Tree
No ratings yet
06 Classification Decision Tree
42 pages
MACHINE LEARNING LAB MANUAL (1)
No ratings yet
MACHINE LEARNING LAB MANUAL (1)
23 pages
ML Lab Manual
No ratings yet
ML Lab Manual
14 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
04 Classification
No ratings yet
04 Classification
20 pages
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
DFS PDF
No ratings yet
DFS PDF
3 pages
Linear Regression Algorithm in PL/SQL
No ratings yet
Linear Regression Algorithm in PL/SQL
1 page
Deterministic Dynamic Programming: To The Next
No ratings yet
Deterministic Dynamic Programming: To The Next
52 pages
Foo Moo Tutorial
No ratings yet
Foo Moo Tutorial
6 pages
NCERT Solutions Class 10 Maths Chapter 1 Real Numbers
No ratings yet
NCERT Solutions Class 10 Maths Chapter 1 Real Numbers
16 pages
2.3 Newton's Method and Its Extension
No ratings yet
2.3 Newton's Method and Its Extension
12 pages
Lecture 4 - Binary System and Logic Gates
No ratings yet
Lecture 4 - Binary System and Logic Gates
76 pages
CO3-16.Travelling Salesperson Problem
No ratings yet
CO3-16.Travelling Salesperson Problem
15 pages
Flat 203105305 1
No ratings yet
Flat 203105305 1
14 pages
Assign 7
No ratings yet
Assign 7
5 pages
MATH1920-Graphs, Paths and Circuits PDF
No ratings yet
MATH1920-Graphs, Paths and Circuits PDF
24 pages
CS 304: Design and Analysis of Algorithm: Strongly Connected Component
No ratings yet
CS 304: Design and Analysis of Algorithm: Strongly Connected Component
22 pages
A Parameterized Complexity Tutorial
No ratings yet
A Parameterized Complexity Tutorial
19 pages
CSE 102L Data Structures and Algorithms Lab (Common For B.Tech EEE, ECE, EI) Cycle Sheet - 1
0% (1)
CSE 102L Data Structures and Algorithms Lab (Common For B.Tech EEE, ECE, EI) Cycle Sheet - 1
4 pages
Main Report For Experiment 1
No ratings yet
Main Report For Experiment 1
26 pages
Student Placement Prediction
No ratings yet
Student Placement Prediction
4 pages
Number System Conversion
No ratings yet
Number System Conversion
7 pages
Error Control, Digital Data Communication Technique
No ratings yet
Error Control, Digital Data Communication Technique
44 pages
Cse 2123 Arraylists: Jeremy Morris
No ratings yet
Cse 2123 Arraylists: Jeremy Morris
34 pages
Object-Oriented Compiler Construction
No ratings yet
Object-Oriented Compiler Construction
10 pages
CS210 DSA Lab 01
No ratings yet
CS210 DSA Lab 01
13 pages
Unit1 - Programming - Scheme of Work - Tungdt
100% (1)
Unit1 - Programming - Scheme of Work - Tungdt
4 pages
Basic Logic Gates With Truth Table
No ratings yet
Basic Logic Gates With Truth Table
23 pages
Tree Balancing
No ratings yet
Tree Balancing
23 pages
Lab 3
No ratings yet
Lab 3
2 pages
Flat questions
No ratings yet
Flat questions
25 pages
Delta Wye Problem
No ratings yet
Delta Wye Problem
2 pages
Be Information Technology Semester 4 2024 May Automata Theory Rev 2019 c Scheme
No ratings yet
Be Information Technology Semester 4 2024 May Automata Theory Rev 2019 c Scheme
2 pages
Chapter 9: Turing Machine: The Standard Turing Machine Page: 336-338, 20 Problem
No ratings yet
Chapter 9: Turing Machine: The Standard Turing Machine Page: 336-338, 20 Problem
12 pages

dev id3.ipynb - Colab

Uploaded by

dev id3.ipynb - Colab

Uploaded by

Experiment-8

1. State the ID3 Algo for Decision Tree

1. Calculate Entropy of the dataset.

2. For each attribute, calculate Information Gain:

Information Gain = Entropy(Parent) − ∑ ( ∣ Subset ∣ ∣ Total ∣ × Entropy(Subset) ) Information

3. Choose the attribute with the highest Information Gain.

4. Repeat recursively for each subset until:

All samples belong to one class

No attributes are left

2. Manually Solving (on Paper)

Start with full dataset entropy (target = "PlayTennis")

Choose attribute with max gain as root

# Load the dataset

# Encode categorical features

# Features and target

# Train the Decision Tree using ID3 (entropy)

# Visualize the decision tree

No 1.00 1.00 1.00 5

# Encode the input

print("Prediction for query:", predicted_label[0])

Prediction for query: No

You might also like