0% found this document useful (0 votes)

10 views3 pages

Step 2: Implement The ID3 Algorithm

The document outlines the steps to prepare and implement the ID3 algorithm using the PlayTennis dataset, which includes features like Outlook, Temperature, Humidity, and Wind to predict the target variable PlayTennis. It details the calculation of entropy and information gain, the construction of the decision tree, and the testing of the model with a sample input. The ID3 algorithm's inductive bias is highlighted as a preference for features that maximize information gain, influencing the decision tree's structure.

Uploaded by

ritammaiti2016

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views3 pages

Step 2: Implement The ID3 Algorithm

Uploaded by

ritammaiti2016

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Step 1: Prepare the PlayTennis Dataset

The dataset looks like this (a small portion for illustration):

In [8]:
Outlook Temperature Humidity Wind PlayTennis
0 Sunny Hot High Weak No
1 Sunny Hot High Strong No
2 Overcast Hot High Weak Yes
3 Rain Mild High Weak Yes
4 Rain Cool Normal Weak Yes
5 Rain Cool Normal Strong No
6 Overcast Cool Normal Strong Yes
7 Sunny Mild High Weak No
8 Sunny Cool Normal Weak Yes
9 Rain Mild Normal Weak Yes
10 Sunny Mild Normal Strong Yes
11 Overcast Mild High Strong Yes
12 Overcast Hot Normal Weak Yes
13 Rain Mild High Strong No

Step 2: Implement the ID3 Algorithm

In [8]: import pandas as pd
import numpy as np

# PlayTennis dataset
data = {
'Outlook': ['Sunny', 'Sunny', 'Overcast', 'Rain', 'Rain', 'Rain', 'Overc
'Temperature': ['Hot', 'Hot', 'Hot', 'Mild', 'Cool', 'Cool', 'Cool', 'Mi
'Humidity': ['High', 'High', 'High', 'High', 'Normal', 'Normal', 'Normal
'Wind': ['Weak', 'Strong', 'Weak', 'Weak', 'Weak', 'Strong', 'Strong', '
'PlayTennis': ['No', 'No', 'Yes', 'Yes', 'Yes', 'No', 'Yes', 'No', 'Yes'
}

df = pd.DataFrame(data)

In [9]: # Step 1: Calculate Entropy

def entropy(target_column):
# Calculate the entropy of a given column (target_column)
elements, counts = np.unique(target_column, return_counts=True)
entropy_value = sum((-counts[i] / np.sum(counts)) * np.log2(counts[i] /
return entropy_value

In [10]: # Step 2: Calculate Information Gain

def information_gain(data, feature, target):
# Calculate Information Gain for a given feature
total_entropy = entropy(data[target])
values, counts = np.unique(data[feature], return_counts=True)
weighted_entropy = 0
for i in range(len(values)):
subset = data[data[feature] == values[i]]
weighted_entropy += (counts[i] / np.sum(counts)) * entropy(subset[ta

return total_entropy - weighted_entropy

In [15]: # Step 3: ID3 Algorithm to Build Tree

def id3(data, target, features):
# Base cases: If only one class, return it; or no features left
if len(np.unique(data[target])) == 1:
return np.unique(data[target])[0]
if len(features) == 0:
return np.unique(data[target])[np.argmax(np.bincount(data[target].va

# Step 3: Select the feature with the highest information gain

gains = [information_gain(data, feature, target) for feature in features
best_feature = features[np.argmax(gains)]

# Create a new decision tree node

tree = {best_feature: {}}

# Recurse for each value of the best feature

remaining_features = [f for f in features if f != best_feature]
for value in np.unique(data[best_feature]):
subset = data[data[best_feature] == value]
tree[best_feature][value] = id3(subset, target, remaining_features)

return tree

# Train the decision tree using ID3

target_column = 'PlayTennis'
features = [col for col in df.columns if col != target_column]
tree = id3(df, target_column, features)
print("Decision Tree:", tree)
Decision Tree: {'Outlook': {'Overcast': 'Yes', 'Rain': {'Wind': {'Strong': 'N
o', 'Weak': 'Yes'}}, 'Sunny': {'Humidity': {'High': 'No', 'Normal': 'Yes'}}}}

Step 3: Testing the Algorithm

Once the tree is built, we can evaluate its performance and verify
its inductive bias.
Step 4: Inductive Bias of ID3
The inductive bias of the ID3 algorithm is that it prefers splitting
on features that maximize information gain. This bias is reflected
in the fact that the algorithm prefers simpler trees with fewer
levels if they can achieve high information gain. In the case of the
PlayTennis dataset, the ID3 algorithm will likely choose attributes
like Outlook and Humidity over others like Wind based on the
information gain calculation.
Explanation:
Entropy: Measures the disorder or impurity of the dataset. Higher
entropy means more disorder.
Information Gain: Measures the effectiveness of an attribute in
classifying the dataset. The attribute with the highest information
gain is chosen for the decision tree node.
Step 5: Testing the Model
Once the tree is built, you can test it by making predictions on new data by
traversing the tree based on the feature values. Here's an example of how you
can test the model on a new sample:
In [2]: def predict(tree, sample):
# Traverse the decision tree
if not isinstance(tree, dict):
return tree
feature = list(tree.keys())[0]
feature_value = sample[feature]
return predict(tree[feature][feature_value], sample)

# Test the model with a sample

test_sample = {'Outlook': 'Sunny', 'Temperature': 'Hot', 'Humidity': 'High',
prediction = predict(tree, test_sample)
print(f"Prediction for test sample: {prediction}")
Prediction for test sample: No

Conclusion
This implementation provides a basic ID3 algorithm that builds a decision tree
using information gain and can be used to predict the target variable
(PlayTennis) based on the features. The inductive bias of the ID3 algorithm is
revealed through its preference for attributes that maximize information gain,
which influences the structure of the tree and the model's generalization.
In [ ]:

Machine Learning Laboratory Record Book: 1 Find S Algorithm
No ratings yet
Machine Learning Laboratory Record Book: 1 Find S Algorithm
22 pages
Decision Tree
100% (1)
Decision Tree
10 pages
Decision Tree - ID3
No ratings yet
Decision Tree - ID3
11 pages
Syllabus of Advanced Structural Analysis Course
0% (1)
Syllabus of Advanced Structural Analysis Course
2 pages
Module 3 DecisionTree Notes
100% (1)
Module 3 DecisionTree Notes
14 pages
Result Archive - 7 Colleges Affiliated With University of Dhaka
No ratings yet
Result Archive - 7 Colleges Affiliated With University of Dhaka
1 page
Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
Weather Forecasting Example
No ratings yet
Weather Forecasting Example
3 pages
First Order Systems
No ratings yet
First Order Systems
158 pages
AD LAB-8.1-GrWork-updated
No ratings yet
AD LAB-8.1-GrWork-updated
7 pages
MLT Experiment 3
No ratings yet
MLT Experiment 3
3 pages
3 ID3 Algorithm Updated
No ratings yet
3 ID3 Algorithm Updated
3 pages
ML 5
No ratings yet
ML 5
2 pages
P 4 Andp 5
No ratings yet
P 4 Andp 5
4 pages
Da Lab3 221it064
No ratings yet
Da Lab3 221it064
6 pages
RNN Neural Network
No ratings yet
RNN Neural Network
23 pages
Dev Id3.ipynb - Colab
No ratings yet
Dev Id3.ipynb - Colab
4 pages
Lec-3-Decision Trees
No ratings yet
Lec-3-Decision Trees
47 pages
Lab Manual2
No ratings yet
Lab Manual2
6 pages
ML 4
No ratings yet
ML 4
5 pages
221IT027 DA Lab3
No ratings yet
221IT027 DA Lab3
5 pages
22053227-AD LAB-8-GrWork
No ratings yet
22053227-AD LAB-8-GrWork
5 pages
DWDM Lab 2
No ratings yet
DWDM Lab 2
3 pages
Tree Models
No ratings yet
Tree Models
42 pages
3ID3 Algorithm
No ratings yet
3ID3 Algorithm
9 pages
Da Lab3 221it084 Final
No ratings yet
Da Lab3 221it084 Final
6 pages
06 Classification Decision Tree
No ratings yet
06 Classification Decision Tree
42 pages
ML Ex1
No ratings yet
ML Ex1
12 pages
Practical File Machine Learning
No ratings yet
Practical File Machine Learning
29 pages
Lab 3
No ratings yet
Lab 3
7 pages
Unit 4 - Decision Tree ID3
No ratings yet
Unit 4 - Decision Tree ID3
5 pages
Indexdw
No ratings yet
Indexdw
34 pages
What Is An ID3 Algorithm?
No ratings yet
What Is An ID3 Algorithm?
10 pages
Assigment 2 Ammad Ali
No ratings yet
Assigment 2 Ammad Ali
8 pages
Hci Lab2 1
No ratings yet
Hci Lab2 1
4 pages
Decision Tree
No ratings yet
Decision Tree
15 pages
ML 19
No ratings yet
ML 19
28 pages
07 - Decision Tree
No ratings yet
07 - Decision Tree
45 pages
Machine Learning Lab: Delhi Technological University
No ratings yet
Machine Learning Lab: Delhi Technological University
6 pages
Probablity Distribution
No ratings yet
Probablity Distribution
10 pages
Arsdigita University Month 5: Algorithms - Professor Shai Simonson Problem Set 1 - Sorting and Searching
100% (1)
Arsdigita University Month 5: Algorithms - Professor Shai Simonson Problem Set 1 - Sorting and Searching
3 pages
Decision Tree Learning and Inductive Inference
No ratings yet
Decision Tree Learning and Inductive Inference
37 pages
Generative AI
No ratings yet
Generative AI
17 pages
ML5 Implementation
No ratings yet
ML5 Implementation
32 pages
Frequency Response (Report)
No ratings yet
Frequency Response (Report)
19 pages
Lab Programs Manual
No ratings yet
Lab Programs Manual
22 pages
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
41 pages
Lec-2 Decision Tree - 13-8-2024
No ratings yet
Lec-2 Decision Tree - 13-8-2024
38 pages
Dec Tree
No ratings yet
Dec Tree
17 pages
ID3 Decision Tree Explanation
No ratings yet
ID3 Decision Tree Explanation
8 pages
Decision Tree Id3 Problem
No ratings yet
Decision Tree Id3 Problem
5 pages
Assigment 2 Ammad Ali
No ratings yet
Assigment 2 Ammad Ali
8 pages
00 Decision Tree Example
No ratings yet
00 Decision Tree Example
12 pages
ID3 Complete Solution
No ratings yet
ID3 Complete Solution
3 pages
USN 18CS654: B. E. Degree (Autonomous) Sixth Semester End Examination (SEE)
No ratings yet
USN 18CS654: B. E. Degree (Autonomous) Sixth Semester End Examination (SEE)
2 pages
Designing An Improved Id3 Decision Tree Algorithm
No ratings yet
Designing An Improved Id3 Decision Tree Algorithm
5 pages
MLT UNIT-3 Notes
No ratings yet
MLT UNIT-3 Notes
35 pages
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
No ratings yet
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
7 pages
ML Lab Mannual1
No ratings yet
ML Lab Mannual1
37 pages
Math3302 Fa2021 Syllabus
No ratings yet
Math3302 Fa2021 Syllabus
4 pages
Lab Program 3
No ratings yet
Lab Program 3
6 pages
ID3 Algorithm For Decision Trees
No ratings yet
ID3 Algorithm For Decision Trees
16 pages
MANUAL
No ratings yet
MANUAL
33 pages
MLExp 3
No ratings yet
MLExp 3
6 pages
Linear Programming
100% (1)
Linear Programming
47 pages
MANUAL
No ratings yet
MANUAL
34 pages
Scilab Control
No ratings yet
Scilab Control
30 pages
DM DT Solved Example 02 - Unlocked
No ratings yet
DM DT Solved Example 02 - Unlocked
3 pages
Syntax Wavelet
No ratings yet
Syntax Wavelet
3 pages
DataMining-Handouts1 5
No ratings yet
DataMining-Handouts1 5
8 pages
Video Lecture in India
No ratings yet
Video Lecture in India
17 pages
Crypto - PDF Notes
No ratings yet
Crypto - PDF Notes
254 pages
Ashwin Report
No ratings yet
Ashwin Report
18 pages
AD3461 ML Lab Manual
No ratings yet
AD3461 ML Lab Manual
32 pages
Power System Transient Stability Enhancement by Statcom With Non
No ratings yet
Power System Transient Stability Enhancement by Statcom With Non
5 pages
SIE 431 Simulation Modeling and Analysis Midterm Exam, May 15 2021 60 Minutes For Exam Name
No ratings yet
SIE 431 Simulation Modeling and Analysis Midterm Exam, May 15 2021 60 Minutes For Exam Name
9 pages
Hindawi
No ratings yet
Hindawi
11 pages
Activity 3
No ratings yet
Activity 3
3 pages
6.241 Dynamic Systems and Control: Lecture 5: Matrix Perturbations Readings: DDV, Chapter 5
No ratings yet
6.241 Dynamic Systems and Control: Lecture 5: Matrix Perturbations Readings: DDV, Chapter 5
11 pages
Differential Equations and Boundary Value Problems Computing and Modeling 5th Edition Edwards Solutions Manual Download
100% (22)
Differential Equations and Boundary Value Problems Computing and Modeling 5th Edition Edwards Solutions Manual Download
152 pages
B17 Discrete Report
No ratings yet
B17 Discrete Report
16 pages
RF Ablation (Needles and Designs)
No ratings yet
RF Ablation (Needles and Designs)
3 pages
ML Unit 4
No ratings yet
ML Unit 4
28 pages
Comparative Evaluation of Credit Card Fraud Detection
No ratings yet
Comparative Evaluation of Credit Card Fraud Detection
7 pages
SSP Experiment 3 Syamantak Sarkar
No ratings yet
SSP Experiment 3 Syamantak Sarkar
21 pages
Life Insurance Mathematics A Formulas
No ratings yet
Life Insurance Mathematics A Formulas
4 pages
STA2100-Regression Analysis
No ratings yet
STA2100-Regression Analysis
15 pages
Sliding Mode Control With High Performance
No ratings yet
Sliding Mode Control With High Performance
20 pages
Basic Signals and Signal Operation Lec2
No ratings yet
Basic Signals and Signal Operation Lec2
18 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet

Step 2: Implement The ID3 Algorithm

Uploaded by

Step 2: Implement The ID3 Algorithm

Uploaded by

Step 1: Prepare the PlayTennis Dataset

The dataset looks like this (a small portion for illustration):

Step 2: Implement the ID3 Algorithm

In [9]: # Step 1: Calculate Entropy

In [10]: # Step 2: Calculate Information Gain

return total_entropy - weighted_entropy

In [15]: # Step 3: ID3 Algorithm to Build Tree

# Step 3: Select the feature with the highest information gain

# Create a new decision tree node

# Recurse for each value of the best feature

# Train the decision tree using ID3

Step 3: Testing the Algorithm

# Test the model with a sample

You might also like