0% found this document useful (0 votes)

155 views3 pages

Lab 02: Decision Tree With Scikit-Learn: About The Mushroom Data Set

This document describes a lab assignment to build decision tree classifiers on a mushroom dataset using scikit-learn. The tasks are to: 1) Prepare training and test datasets by splitting the mushroom data; 2) Build decision tree classifiers on different train/test splits; 3) Evaluate the classifiers using classification reports and confusion matrices; 4) Analyze how decision tree depth affects accuracy on an 80/20 split. Graphs and tables must be used to visualize the decision trees and report accuracy scores for trees of varying depths.

Uploaded by

trung le

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

155 views3 pages

Lab 02: Decision Tree With Scikit-Learn: About The Mushroom Data Set

Uploaded by

trung le

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Class 20CLC – Term III/2021-2022

Course: CSC14003 – Artificial Intelligence

Lab 02: Decision Tree with scikit-learn

In this assignment, you are going to build a decision tree on the Mushroom dataset, with the support
from scikit-learn library.

About the Mushroom data set

This data set includes descriptions of hypothetical samples corresponding to 23 species of gilled
mushrooms. Each species is identified as either edible or poisonous. There are 8124 samples, each of
which is characterized by 22 attributes and a target attribute.
The data file is provided along this assignment, and it is posted on Moodle. Note that
• The target attribute is located at the first column.
• The original data comes from the UCI Machine learning repository. You may refer to this page
for more information.

Assignment requirements
You are asked to write a Python program, with appropriate calls of scikit-learn functions, to fulfill the
following tasks. Although there is no strict rule on how to organize the code, each task should be
noted carefully.

Preparing the data sets

This task prepares the training sets and test sets for the incoming experiments.
You need to organize the original Mushroom dataset into four subsets:
• feature_train: a set of training examples, each of which is a tuple of 22 attribute values
(target attribute excluded).
• label_train: a set of labels corresponding to the examples in feature_train.
• feature_test: a set of test examples, it is of similar structure to feature_train
• label_test: a set of labels corresponding to the examples in feature_test.
You need to shuffle the data before splitting and the data is split in a stratified fashion. Other
parameters (if there is any) are left by default.
There will be experiments on training sets and test sets of different proportions, including
(train/test) 40/60, 60/40, 80/20, and 90/10, and thus you need 16 subsets.

1
Building the decision tree classifiers
This task conducts experiments on the designated train/test proportions listed above.
You need to fit an instance of sklearn.tree.DecisionTreeClassifier (with information
gain) to each training set and visualize the resulting decision tree using graphviz.

The aside figure gives an example of a decision tree

built on the Iris dataset (3 classes).

Evaluating the decision tree classifiers

For each of the above decision tree classifiers, predict the examples in the corresponding test set,
and make a report using classification_report and confusion_matrix.
The following figure gives an example of classification report and confusion matrix for a classifier
on the Iris dataset (3 classes).

How do you interpret the classification report and the confusion matrix? From that, make your
own comments on the performances of those decision tree classifiers.

The depth and accuracy of a decision tree

This task works on the 80/20 training set and test set. You need to consider how the decision
tree’s depth affects the classification accuracy.
You can specify the maximum depth of a decision tree by varying the parameter max_depth of
sklearn.tree.DecisionTreeClassifier.
You need to try the following values for parameter max_depth: None, 2, 3, 4, 5, 6, and 7.
And then,
• Provide the decision tree drawn by graphviz for each max_depth value
• Report to the following table the accuracy_score (on the test set) of the decision tree
classifier when changing the value of parameter max_depth.
2
max_depth None 2 3 4 5 6 7
Accuracy
• Make your own comment on the above statistics.

References

[1] Scikit-learn decision trees: https://scikit-learn.org/stable/modules/tree.html

[2] Analysis and classification of Mushrooms:
https://www.kaggle.com/haimfeld87/analysis-and-classification-of-mushrooms

Grading

No. Specifications Scores (%)

1 Preparing the data sets 20
2 Building the decision tree classifiers 20
3 Evaluating the decision tree classifiers
Classification report and confusion matrix 20
Comments 10
4 The depth and accuracy of a decision tree
Trees, tables, and charts 20
Comments 10
Total 100

Notice

• This is an INDIVIDUAL assignment.

• Your program should be programmed in Python. Write down your report on a PDF File.
• A program with syntax/runtime error(s) will not be accepted.

Slides (A19 A20)
No ratings yet
Slides (A19 A20)
261 pages
Ai Lect 06
No ratings yet
Ai Lect 06
54 pages
Business Analytics
100% (1)
Business Analytics
10 pages
CHTKT - DataScience - Chapter03 - Machine Learning With Python - 02
No ratings yet
CHTKT - DataScience - Chapter03 - Machine Learning With Python - 02
34 pages
Decision Trees
No ratings yet
Decision Trees
38 pages
Capco Murex Cs
100% (1)
Capco Murex Cs
4 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Lab # 10
No ratings yet
Lab # 10
6 pages
Medical Engineering Team Leader PS&JD
No ratings yet
Medical Engineering Team Leader PS&JD
7 pages
8.program Decisiontree
No ratings yet
8.program Decisiontree
15 pages
Week 6 - 7 - Classification
No ratings yet
Week 6 - 7 - Classification
67 pages
Experiment 8
No ratings yet
Experiment 8
14 pages
Decision Trees and Random Forests
No ratings yet
Decision Trees and Random Forests
25 pages
Progrram8-Decision Tree
No ratings yet
Progrram8-Decision Tree
3 pages
ML Mod-4
No ratings yet
ML Mod-4
30 pages
14MachineLearningDecisionTreeRandomForest - Ipynb - Colaboratory
No ratings yet
14MachineLearningDecisionTreeRandomForest - Ipynb - Colaboratory
29 pages
Decision Tree Project Report
No ratings yet
Decision Tree Project Report
3 pages
The Academic Performance of Grade 10 Mathematics Learners Exposed To Hybrid Learning and Printed Modular Distance Learning in San Andres District, Schools Division of Quezon
No ratings yet
The Academic Performance of Grade 10 Mathematics Learners Exposed To Hybrid Learning and Printed Modular Distance Learning in San Andres District, Schools Division of Quezon
9 pages
Chapter1 - Decision Tree For Classification
No ratings yet
Chapter1 - Decision Tree For Classification
29 pages
Classification Using Decision Trees
No ratings yet
Classification Using Decision Trees
43 pages
FCP - FAZ - AD-7.4 (157 Questions)
No ratings yet
FCP - FAZ - AD-7.4 (157 Questions)
9 pages
Chapter1 PDF
No ratings yet
Chapter1 PDF
29 pages
Practice 2+
No ratings yet
Practice 2+
25 pages
Desicion Tree Ipynb
No ratings yet
Desicion Tree Ipynb
6 pages
10 Ict Css q3 m1 Css
No ratings yet
10 Ict Css q3 m1 Css
17 pages
Decision Trees
No ratings yet
Decision Trees
8 pages
Decision Tree
No ratings yet
Decision Tree
13 pages
ML Lab Manual 4-8
No ratings yet
ML Lab Manual 4-8
11 pages
1.10. Decision Trees - Scikit-Learn 0.24.1 Documentation
No ratings yet
1.10. Decision Trees - Scikit-Learn 0.24.1 Documentation
10 pages
Practical - 6 Aim:: Decision Tree
No ratings yet
Practical - 6 Aim:: Decision Tree
7 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
Trees - Classification - Ipynb - Colab
No ratings yet
Trees - Classification - Ipynb - Colab
6 pages
Expt7 ML2025 250306 143857
No ratings yet
Expt7 ML2025 250306 143857
5 pages
Decision Tree: 1 Description
No ratings yet
Decision Tree: 1 Description
5 pages
Lab 4 - Logistic Regression - KNN - Notes
No ratings yet
Lab 4 - Logistic Regression - KNN - Notes
6 pages
Practical 15 Python
No ratings yet
Practical 15 Python
6 pages
LAB (1) Decision Tree: Islamic University of Gaza Computer Engineering Department Artificial Intelligence ECOM 5038
No ratings yet
LAB (1) Decision Tree: Islamic University of Gaza Computer Engineering Department Artificial Intelligence ECOM 5038
18 pages
LAS 03 Illustrating A Probability Distribution For A Discrete Random Variable
No ratings yet
LAS 03 Illustrating A Probability Distribution For A Discrete Random Variable
1 page
Practical No4 - 5 ML
No ratings yet
Practical No4 - 5 ML
11 pages
Write A Program To Demonstrate Decision Tree Algorithm For A Classification Problem and Perform Parameter Tuning For Better Results
No ratings yet
Write A Program To Demonstrate Decision Tree Algorithm For A Classification Problem and Perform Parameter Tuning For Better Results
5 pages
Programming Assignment: Decision Tree Classifier: Objective
No ratings yet
Programming Assignment: Decision Tree Classifier: Objective
3 pages
Experiment 8 ML Vtu
No ratings yet
Experiment 8 ML Vtu
4 pages
Project 1
No ratings yet
Project 1
4 pages
Experiment 8
No ratings yet
Experiment 8
4 pages
EXP - 6 - Prasham Doshi - 22bec097
No ratings yet
EXP - 6 - Prasham Doshi - 22bec097
3 pages
DM Lab 04
No ratings yet
DM Lab 04
6 pages
Practical 5
No ratings yet
Practical 5
3 pages
Types of Pruning Techniques
No ratings yet
Types of Pruning Techniques
10 pages
Project 1
No ratings yet
Project 1
4 pages
Practical 1ritesh
No ratings yet
Practical 1ritesh
3 pages
Decision Tree and Related Techniques For Classification in Scalation
No ratings yet
Decision Tree and Related Techniques For Classification in Scalation
12 pages
Decision - Tree - Regression - Ipynb - Colab
No ratings yet
Decision - Tree - Regression - Ipynb - Colab
3 pages
Tutorial 6
No ratings yet
Tutorial 6
8 pages
Title: Implementation of Decision Tree Classification: Department of Computer Science and Engineering
No ratings yet
Title: Implementation of Decision Tree Classification: Department of Computer Science and Engineering
8 pages
RHEL 9.3 - Configuring A Redhat High Availability Cluster On Redhat Openstack Platform
No ratings yet
RHEL 9.3 - Configuring A Redhat High Availability Cluster On Redhat Openstack Platform
25 pages
Lab 2
No ratings yet
Lab 2
3 pages
Ludic - Workshop - Iris - Copie
No ratings yet
Ludic - Workshop - Iris - Copie
5 pages
Prac5 AAM
No ratings yet
Prac5 AAM
2 pages
Ge B90 Gek-131050 PDF
No ratings yet
Ge B90 Gek-131050 PDF
522 pages
Decision Tree
No ratings yet
Decision Tree
1 page
Introduction To Decision Tree: Gini Index
No ratings yet
Introduction To Decision Tree: Gini Index
15 pages
DM Lab Cycle 5
No ratings yet
DM Lab Cycle 5
3 pages
Meraki Whitepaper MSP
No ratings yet
Meraki Whitepaper MSP
9 pages
Trees and Forests: Machine Learning With Python Cookbook
No ratings yet
Trees and Forests: Machine Learning With Python Cookbook
5 pages
Data Center Security & Control: Smolianets Vladyslav
100% (1)
Data Center Security & Control: Smolianets Vladyslav
37 pages
Numerical I Module-1
No ratings yet
Numerical I Module-1
95 pages
What Is Decision Tree?: ISM Implementation of Decision Tree Submitted By: Sagiruddin Akthar 19mcmc28
No ratings yet
What Is Decision Tree?: ISM Implementation of Decision Tree Submitted By: Sagiruddin Akthar 19mcmc28
4 pages
Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI
No ratings yet
Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI
4 pages
Experiment No 4 Vanraj
No ratings yet
Experiment No 4 Vanraj
2 pages
CSET301 LabW8L2
No ratings yet
CSET301 LabW8L2
1 page
INFORMATION MANAGEMENT Unit 2
No ratings yet
INFORMATION MANAGEMENT Unit 2
35 pages
Sentence Building
No ratings yet
Sentence Building
1 page
Indian Institute of Management Bangalore: PGP 4 Term 2019-20
No ratings yet
Indian Institute of Management Bangalore: PGP 4 Term 2019-20
3 pages
1 Agile Manifesto
No ratings yet
1 Agile Manifesto
39 pages
CSEC Information Technology June 2016 P02
No ratings yet
CSEC Information Technology June 2016 P02
17 pages
HSD 28491 Camper Catalogue English
No ratings yet
HSD 28491 Camper Catalogue English
92 pages
Asgore V2
No ratings yet
Asgore V2
29 pages
Nigerian Air Force
No ratings yet
Nigerian Air Force
1 page
Notations... Crows Foot
No ratings yet
Notations... Crows Foot
14 pages
Wcms 2nd Unit Notes
No ratings yet
Wcms 2nd Unit Notes
31 pages
280 - DS Complete-2
No ratings yet
280 - DS Complete-2
24 pages
Design and Implementation of PV Emulator Based On Synchronous Buck Converter Using Arduino Nano Microcontroller
No ratings yet
Design and Implementation of PV Emulator Based On Synchronous Buck Converter Using Arduino Nano Microcontroller
9 pages
ASSEMBLY Chapter 10
No ratings yet
ASSEMBLY Chapter 10
45 pages
Exp 23 - (21203A0048 - Anvita Keer)
No ratings yet
Exp 23 - (21203A0048 - Anvita Keer)
7 pages
Single-Chip Microcontrollers (AMCU) : in Brief - .
No ratings yet
Single-Chip Microcontrollers (AMCU) : in Brief - .
31 pages
Ac & DC Ammeters: Fixed Range & Selectable Range (16 Ranges in 1 Meter)
No ratings yet
Ac & DC Ammeters: Fixed Range & Selectable Range (16 Ranges in 1 Meter)
2 pages
Module3 Caminong 022624
No ratings yet
Module3 Caminong 022624
2 pages
WWW Reddit Com R Slingshots Comments Weygv0 Diy Slingshot Make A Knuckle Slingshot Out of Wood
No ratings yet
WWW Reddit Com R Slingshots Comments Weygv0 Diy Slingshot Make A Knuckle Slingshot Out of Wood
7 pages
5yrty - The AI Search Engine You Control - AI Chat & Apps
No ratings yet
5yrty - The AI Search Engine You Control - AI Chat & Apps
3 pages
Entry Level Web Developer Resume Example
No ratings yet
Entry Level Web Developer Resume Example
1 page

Lab 02: Decision Tree With Scikit-Learn: About The Mushroom Data Set

Uploaded by

Lab 02: Decision Tree With Scikit-Learn: About The Mushroom Data Set

Uploaded by

Class 20CLC – Term III/2021-2022

Course: CSC14003 – Artificial Intelligence

Lab 02: Decision Tree with scikit-learn

About the Mushroom data set

Preparing the data sets

The aside figure gives an example of a decision tree

Evaluating the decision tree classifiers

The depth and accuracy of a decision tree

[1] Scikit-learn decision trees: https://scikit-learn.org/stable/modules/tree.html

No. Specifications Scores (%)

• This is an INDIVIDUAL assignment.

You might also like