0% found this document useful (0 votes)

14 views7 pages

Practical - 6 Aim:: Decision Tree

Uploaded by

wadhaniyash14

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views7 pages

Practical - 6 Aim:: Decision Tree

Uploaded by

wadhaniyash14

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

CS3EL15(P): Machine Learning Laboratory Experiment No.

6
Experiment: Identify a data set for executing the Decision Tree algorithm Page of 21
to implement using python and analyse the same with cross validation and
percentage split.

PRACTICAL - 6
Aim:
To implement the Decision Tree algorithm using Python on a suitable dataset and analyze its performance
using Cross Validation and Percentage Split techniques, thereby evaluating the model’s accuracy and
generalization capability.

Theory:
 Decision Tree
A Decision Tree is a simple, intuitive diagram that helps in making decisions by mapping out various
choices and their possible outcomes. It is represented in the form of a tree-like structure that breaks
down a complex decision-making process into smaller parts.
It allows users to visualize decisions and their consequences clearly and supports decision-making by
analyzing different possible outcomes.

Structure of a Decision Tree

A decision tree is a hierarchical model with several components:
 Root Node: The starting point of the tree. It represents the entire dataset and the first decision to be
made.
 Branches: These represent the flow of decisions from one node to another based on feature values.
 Internal Nodes: These are decision points where the data is split based on specific conditions
(features).
 Leaf Nodes (Terminal Nodes): These are the end points that represent the final decision or outcome.

Example:
If you’re deciding whether to drink coffee based on the time of day and how tired you are, the root node
checks the time. If it’s morning, the next decision checks tiredness.

If tired → Drink Coffee,

16
Department of Computer Science & Engineering
Student Name: Yuvraj Sikarwar Enrollment No: EN22CS3011129
CS3EL15(P): Machine Learning Laboratory Experiment No. 6
Experiment: Identify a data set for executing the Decision Tree algorithm Page of 21
to implement using python and analyse the same with cross validation and
percentage split.

if not → No Coffee.
This kind of logic flow is how a decision tree operates.

Classification of Decision Trees

Decision trees are categorized based on the type of target variable:
 Classification Trees: Used for categorical outputs (e.g., spam or not spam, yes or no, pass or fail).
 Regression Trees: Used for continuous numerical outputs (e.g., predicting house prices, stock
values, etc.).

How Decision Trees Work

1. The process begins at the root node, selecting the best feature to split the dataset using criteria like
Gini Index, Information Gain, or Entropy.
2. The data is split into subsets by asking yes/no or condition-based questions.
3. This splitting continues until:
o All data points in a node belong to a single class, or
o No more splits can improve the model.
Each path from root to a leaf represents a decision rule that leads to an outcome.

Program:

Step 1: Import Libraries

17
Department of Computer Science & Engineering
Student Name: Yuvraj Sikarwar Enrollment No: EN22CS3011129
CS3EL15(P): Machine Learning Laboratory Experiment No. 6
Experiment: Identify a data set for executing the Decision Tree algorithm Page of 21
to implement using python and analyse the same with cross validation and
percentage split.

Step 2: Load dataset (Iris dataset as an example)

Step 3: Create and train the Decision Tree Classifier

Step 4: Predict on the training data

Step 5: Print Accuracy

Step 5: Visualize the Decision Tree

Output:

18
Department of Computer Science & Engineering
Student Name: Yuvraj Sikarwar Enrollment No: EN22CS3011129
CS3EL15(P): Machine Learning Laboratory Experiment No. 6
Experiment: Identify a data set for executing the Decision Tree algorithm Page of 21
to implement using python and analyse the same with cross validation and
percentage split.

Advantages of Decision Trees

 Simple and Easy to Interpret: Decision trees resemble flowcharts, making it easy to understand.

 Versatile: Can be used for classification and regression tasks.

 No Need for Scaling: Feature scaling (normalization/standardization) is not required.
 Captures Non-linear Relationships: Effectively models complex decision boundaries.

Disadvantages of Decision Trees

 Overfitting: Trees may learn noise in the data, reducing generalization to unseen data.
 Instability: Small changes in data can drastically alter the structure of the tree.
 Bias toward Categorical Features with Many Levels: Features with more unique values may
dominate decision making unfairly.

Applications of Decision Trees

1. Loan Approval in Banking: Predict approval/rejection based on features like income, credit score,
and employment history.
2. Medical Diagnosis: Identify health conditions (e.g., diabetes) using features like glucose level, BMI,
etc.
3. Predicting Exam Results in Education: Predict student performance using data like attendance,
study time, and past grades.

19
Department of Computer Science & Engineering
Student Name: Yuvraj Sikarwar Enrollment No: EN22CS3011129
CS3EL15(P): Machine Learning Laboratory Experiment No. 6
Experiment: Identify a data set for executing the Decision Tree algorithm Page of 21
to implement using python and analyse the same with cross validation and
percentage split.

Analyzing Decision Tree Algorithm using Cross Validation and Percentage Split
To assess the effectiveness and generalization capability of the decision tree model, we use evaluation
techniques like Train-Test Split and Cross Validation. These methods help us determine how well the
model performs on unseen data and avoid overfitting or underfitting.
1. Train-Test Split (Percentage Split)
The Train-Test Split is a simple and commonly used method to evaluate model performance. In this
method:
 The dataset is divided into two parts:
o Training Set: Used to train the model (typically 70% or 80% of data).
o Testing Set: Used to test the model’s prediction accuracy on unseen data (remaining 30% or
20%).
 The model is trained on the training set and then evaluated using the testing set.
 Evaluation metrics like Accuracy, Confusion Matrix, and Classification Report are used.
Advantage: Easy and quick to implement.
Limitation: Performance can vary based on how the data is split.
2. Cross Validation
Cross Validation is a more robust technique for model evaluation. It reduces the variance associated with
random Train-Test splits by averaging performance across multiple splits.
In k-Fold Cross Validation:
 The dataset is divided into k equal parts (folds).
 The model is trained on k-1 folds and tested on the remaining 1 fold.
 This process is repeated k times, each time with a different fold as the test set.
 The final accuracy is the mean of accuracies across all folds.
Advantage: Provides a more reliable and generalized estimate of model performance.
Limitation: Computationally more expensive compared to a single Train-Test split.
Why Analyze with Both?
Using both Train-Test Split and Cross Validation provides a comprehensive understanding of model
performance:
 Train-Test Split shows how the model performs in a specific split scenario.
 Cross Validation helps validate if the model performance is consistent and generalizable across
different data distributions.
Combining both methods helps in selecting the best model parameters and ensures that the model is not
biased or overfitted to a particular dataset split.

20
Department of Computer Science & Engineering
Student Name: Yuvraj Sikarwar Enrollment No: EN22CS3011129
CS3EL15(P): Machine Learning Laboratory Experiment No. 6
Experiment: Identify a data set for executing the Decision Tree algorithm Page of 21
to implement using python and analyse the same with cross validation and
percentage split.

Program:

Step 1: Importing necessary libraries

Step 2: Load dataset (Iris dataset)

Step 3: Train-Test Split Analysis

Step 4: Accuracy and Evaluation Metrics

Step 5: Cross Validation Analysis

Step 6: Visualize the Decision Tree

21
Department of Computer Science & Engineering
Student Name: Yuvraj Sikarwar Enrollment No: EN22CS3011129
CS3EL15(P): Machine Learning Laboratory Experiment No. 6
Experiment: Identify a data set for executing the Decision Tree algorithm Page of 21
to implement using python and analyse the same with cross validation and
percentage split.

Output:

22
Department of Computer Science & Engineering
Student Name: Yuvraj Sikarwar Enrollment No: EN22CS3011129

New Leader Assimilation: Process and Outcomes
No ratings yet
New Leader Assimilation: Process and Outcomes
21 pages
Definition and Types of Research
No ratings yet
Definition and Types of Research
42 pages
Lab # 10
No ratings yet
Lab # 10
6 pages
Decision Tree and Related Techniques For Classification in Scalation
No ratings yet
Decision Tree and Related Techniques For Classification in Scalation
12 pages
Prac 6
No ratings yet
Prac 6
6 pages
Title: Implementation of Decision Tree Classification: Department of Computer Science and Engineering
No ratings yet
Title: Implementation of Decision Tree Classification: Department of Computer Science and Engineering
8 pages
Write A Program To Demonstrate Decision Tree Algorithm For A Classification Problem and Perform Parameter Tuning For Better Results
No ratings yet
Write A Program To Demonstrate Decision Tree Algorithm For A Classification Problem and Perform Parameter Tuning For Better Results
5 pages
Experiment No 4 Vanraj
No ratings yet
Experiment No 4 Vanraj
2 pages
Practical No4 - 5 ML
No ratings yet
Practical No4 - 5 ML
11 pages
ML Exp6
No ratings yet
ML Exp6
3 pages
Experiment 3: Name: Reena Kale Te Comps Roll No:23
No ratings yet
Experiment 3: Name: Reena Kale Te Comps Roll No:23
4 pages
Decision Tree
No ratings yet
Decision Tree
13 pages
Decision - Tree - Regression - Ipynb - Colab
No ratings yet
Decision - Tree - Regression - Ipynb - Colab
3 pages
Exp 3 121a1047 Lavanya Kurup ML
No ratings yet
Exp 3 121a1047 Lavanya Kurup ML
4 pages
DM Lab Cycle 5
No ratings yet
DM Lab Cycle 5
3 pages
Practice 2+
No ratings yet
Practice 2+
25 pages
Experiment 3: Name: Reena Kale Te Comps Roll No:23
100% (1)
Experiment 3: Name: Reena Kale Te Comps Roll No:23
4 pages
Decision Tree R
No ratings yet
Decision Tree R
5 pages
Lecture 7.2 - DTC Algorithm Implementation
No ratings yet
Lecture 7.2 - DTC Algorithm Implementation
7 pages
Prac5 AAM
No ratings yet
Prac5 AAM
2 pages
Decision Tree Final
No ratings yet
Decision Tree Final
2 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
Decision Tree Classification
No ratings yet
Decision Tree Classification
1 page
1.10. Decision Trees - Scikit-Learn 0.24.1 Documentation
No ratings yet
1.10. Decision Trees - Scikit-Learn 0.24.1 Documentation
10 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
5b Python Implementation of Decision Tree
No ratings yet
5b Python Implementation of Decision Tree
7 pages
Decision Trees and Random Forests
No ratings yet
Decision Trees and Random Forests
25 pages
Introduction To Decision Tree: Gini Index
No ratings yet
Introduction To Decision Tree: Gini Index
15 pages
Decision Trees
No ratings yet
Decision Trees
8 pages
What Is Decision Tree
No ratings yet
What Is Decision Tree
35 pages
Types of Pruning Techniques
No ratings yet
Types of Pruning Techniques
10 pages
Machine Learning (Se204A) Lab Manual
No ratings yet
Machine Learning (Se204A) Lab Manual
27 pages
Decision Tree Project Report
No ratings yet
Decision Tree Project Report
3 pages
Practical 15 Python
No ratings yet
Practical 15 Python
6 pages
Decision Tree
100% (1)
Decision Tree
57 pages
Assignment-03 ME
No ratings yet
Assignment-03 ME
2 pages
DM Lab 04
No ratings yet
DM Lab 04
6 pages
BPE 22, Decision Trees
No ratings yet
BPE 22, Decision Trees
11 pages
Classification Using Decision Trees
No ratings yet
Classification Using Decision Trees
43 pages
20ee38011 Exp4
No ratings yet
20ee38011 Exp4
24 pages
HSMC
No ratings yet
HSMC
5 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
Sat - 149.Pdf - Prediction of Bigmart Sales Using Machine Learning Algorihms
No ratings yet
Sat - 149.Pdf - Prediction of Bigmart Sales Using Machine Learning Algorihms
11 pages
Experiment 8 ML Vtu
No ratings yet
Experiment 8 ML Vtu
4 pages
LAB (1) Decision Tree: Islamic University of Gaza Computer Engineering Department Artificial Intelligence ECOM 5038
No ratings yet
LAB (1) Decision Tree: Islamic University of Gaza Computer Engineering Department Artificial Intelligence ECOM 5038
18 pages
Expt7 ML2025 250306 143857
No ratings yet
Expt7 ML2025 250306 143857
5 pages
Programming Assignment: Decision Tree Classifier: Objective
No ratings yet
Programming Assignment: Decision Tree Classifier: Objective
3 pages
Decision Tree Induction Algorithm
No ratings yet
Decision Tree Induction Algorithm
6 pages
Decision Tree
No ratings yet
Decision Tree
57 pages
What Is Decision Tree?: ISM Implementation of Decision Tree Submitted By: Sagiruddin Akthar 19mcmc28
No ratings yet
What Is Decision Tree?: ISM Implementation of Decision Tree Submitted By: Sagiruddin Akthar 19mcmc28
4 pages
Machine Learning With Python - Machine Learning Algorithms - Decision Tree
No ratings yet
Machine Learning With Python - Machine Learning Algorithms - Decision Tree
17 pages
Decision Tree
No ratings yet
Decision Tree
7 pages
Unit Iir20
No ratings yet
Unit Iir20
22 pages
Machine Learning Random Forest Algorithm - Javatpoint
No ratings yet
Machine Learning Random Forest Algorithm - Javatpoint
14 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
DA Lab Week-3
No ratings yet
DA Lab Week-3
15 pages
Artificial Intelligence: Paf-Karachi Institute of Economics & Technology College of Engineering
No ratings yet
Artificial Intelligence: Paf-Karachi Institute of Economics & Technology College of Engineering
8 pages
8.program Decisiontree
No ratings yet
8.program Decisiontree
15 pages
Practical - 4 4.2 Decision Tree Implementation
No ratings yet
Practical - 4 4.2 Decision Tree Implementation
2 pages
Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI
No ratings yet
Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI
4 pages
A Survey of Decision Trees Concepts Algorithms and Applications
No ratings yet
A Survey of Decision Trees Concepts Algorithms and Applications
12 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Guidance Syllabus
100% (1)
Guidance Syllabus
6 pages
Proposed Title
100% (1)
Proposed Title
26 pages
TY BMS Project
No ratings yet
TY BMS Project
49 pages
Criteria Essay
No ratings yet
Criteria Essay
9 pages
KIT Feminist Brochure
No ratings yet
KIT Feminist Brochure
8 pages
THE USE OF SHADOWING TO TEACH ENGLISH PRONUNCIATION INMaxim Barkov
No ratings yet
THE USE OF SHADOWING TO TEACH ENGLISH PRONUNCIATION INMaxim Barkov
10 pages
Imitations of Roman Republican 'Denarii': New Metallurgical Data / B.W. Woytek ... (Et Al.)
No ratings yet
Imitations of Roman Republican 'Denarii': New Metallurgical Data / B.W. Woytek ... (Et Al.)
35 pages
Research Project Report: Developing Marketing Strategies For Luminous Company
No ratings yet
Research Project Report: Developing Marketing Strategies For Luminous Company
73 pages
Research Paper Causes and Level of Truancy Among Primary School Pupils A Case Study of Adjena United Basic School Akosombo Ghana
No ratings yet
Research Paper Causes and Level of Truancy Among Primary School Pupils A Case Study of Adjena United Basic School Akosombo Ghana
7 pages
The Sandwich Generation(s) : Amy Goyer
No ratings yet
The Sandwich Generation(s) : Amy Goyer
73 pages
Interrater Reliability
No ratings yet
Interrater Reliability
4 pages
Median Test and Fisher Sign Test
No ratings yet
Median Test and Fisher Sign Test
17 pages
Consumer Psychology of Brands - Course Outline
No ratings yet
Consumer Psychology of Brands - Course Outline
11 pages
1163
No ratings yet
1163
7 pages
Sales Account Manager Biotechnology in Boston MA Resume Cynthia Smith
No ratings yet
Sales Account Manager Biotechnology in Boston MA Resume Cynthia Smith
2 pages
Ahmad Et Al. (2019)
No ratings yet
Ahmad Et Al. (2019)
19 pages
Ms Data Science S, 24 (WEEK# 1) Unlock
No ratings yet
Ms Data Science S, 24 (WEEK# 1) Unlock
31 pages
Loan Literature Review
100% (2)
Loan Literature Review
4 pages
Suls, Martin, Wheeler - 2002 - Social Comparison Why, With Whom, and With What Effect
No ratings yet
Suls, Martin, Wheeler - 2002 - Social Comparison Why, With Whom, and With What Effect
6 pages
Identification of Possible Bacteria Present in Food Products Sold in Supermarkets
No ratings yet
Identification of Possible Bacteria Present in Food Products Sold in Supermarkets
2 pages
A-HRM Chapter 1-3
No ratings yet
A-HRM Chapter 1-3
65 pages
Incredibles II
No ratings yet
Incredibles II
23 pages
Sankhya Data Science Course
No ratings yet
Sankhya Data Science Course
22 pages
Defining Public Interest in Planning Are View
No ratings yet
Defining Public Interest in Planning Are View
20 pages
An Exploration of Binge-Watching and Its Effects On College Academics
100% (1)
An Exploration of Binge-Watching and Its Effects On College Academics
18 pages
Concept Paper 4
No ratings yet
Concept Paper 4
9 pages
The Effect of Time Management To The Academic Performance of Grade 11 Academic Strand Students
No ratings yet
The Effect of Time Management To The Academic Performance of Grade 11 Academic Strand Students
38 pages
Annex 1 - Requirements and Job Decriptions
No ratings yet
Annex 1 - Requirements and Job Decriptions
41 pages