DSML Practical

This document describes building a decision tree model to predict customers' likelihood of buying lipstick based on their attributes. It provides the dataset, target variable, and outlines the ID3 algorithm used to construct the decision tree in steps. The algorithm calculates entropy at each node to select the optimal splitting attribute, recursively splitting the data and adding branches until reaching leaf nodes of single target variable values. For new test data of [Age < 21, Income = Low, Gender = Female, Marital Status = Married], the decision tree would predict the likelihood of a "Buys" or not.

Uploaded by

focusedbanda117

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views

DSML Practical

Uploaded by

focusedbanda117

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Name:Shreyas Satish Jagadale

Roll No. 322030

Batch B2
PRN 22110649

Assignment 3
AIM: Write a program to do: A dataset collected in a cosmetics shop showing details of customers
and whether or not they responded to a special offer to buy a new lip-stick is shown in table below.
Use this dataset to build a decision tree, with Buys as the target variable, to help in buying lipsticks in
the future. Find the root node of decision tree. According to the decision tree you have made from
previous training data set, what is the decision for the test data: [Age < 21, Income = Low, Gender =
Female, Marital Status = Married]?

OBJECTIVE: Describe the Data Science Process and explore components interaction. Apply
specific unsupervised machine learning algorithm for a particular problem.

ALGORITHM:
Here is a step-by-step algorithm for building a decision tree using the ID3 (Iterative Dichotomiser 3)
algorithm, a popular method for constructing decision trees in machine learning:

ID3 Algorithm for Decision Tree Construction:

Input:
- Data: The training dataset containing features and corresponding target labels.
- Attributes: The set of attributes/features available for classification.
- Target Variable: The variable we want to predict (e.g., "Buys" in the given cosmetics shop dataset).

Output:
- Decision Tree: A tree structure representing a sequence of decisions that can be followed to make
predictions.

Algorithm Steps:

1. If all examples in the dataset belong to the same class: - Return a leaf node
with the class label.

2. If the list of attributes is empty (no more features to split on):

- Return a leaf node with the most frequent class label in the dataset.

3. Calculate the entropy (or Gini index) of the dataset based on the target variable.

4. For each attribute in the attribute list:

- Calculate the information gain (or Gini gain) for the attribute.
- Select the attribute with the highest information gain (or lowest Gini index) as the splitting
attribute.
5. Create a decision tree node with the selected attribute as the root
- For each value of the selected attribute:
- Split the dataset based on the selected attribute value.
- Recursively apply the ID3 algorithm to the divided subsets of the data.
- Attach the resulting subtree to the corresponding branch of the root node.
6. Return the constructed decision tree.

Additional Notes:

- Entropy Calculation:
- Entropy measures the impurity or disorder of a dataset. For a binary classification
problem, entropyis calculated as: \( -p_+ \log_2(p_+) - p_- \log_2(p_-) \), where \(p_+\) and
\(p_-\) are the probabilitiesof positive and negative classes, respectively.

- Information Gain:
- Information gain measures the effectiveness of an attribute in classifying the
dataset. It is calculated as the difference between the entropy of the original dataset
and the weighted sum ofentropies after splitting on the attribute.

- Stopping Criteria:
- The tree construction stops when either all data points in a branch belong to the same
class or thereare no more attributes to split on.

- Tree Pruning (Optional):

After the tree is constructed, pruning techniques can be applied to avoid overfitting and improve the
tree's generalization ability on unseen data

Code:
Output:

CONCLUSION:
The provided Python code demonstrates the construction of a decision tree classifier
using a cosmetics shop dataset. The dataset includes information about customers'
age, income, gender, marital status, and whether they purchased a lipstick ("Buys"
column).

Component Diagnostics: E1 - Engine Control Unit: ECU Pin and Plug
No ratings yet
Component Diagnostics: E1 - Engine Control Unit: ECU Pin and Plug
9 pages
Construction Companies
100% (1)
Construction Companies
24 pages
Ass 5 PDF
100% (8)
Ass 5 PDF
3 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
DM Mod 3
No ratings yet
DM Mod 3
14 pages
CC Unit IV
No ratings yet
CC Unit IV
30 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
Ijret - Research Scholars Evaluation Based On Guides View Using Id3
No ratings yet
Ijret - Research Scholars Evaluation Based On Guides View Using Id3
4 pages
Lec 10 DT - Hunt Algorithm
No ratings yet
Lec 10 DT - Hunt Algorithm
6 pages
Research Scholars Evaluation Based On Guides View Using Id3
No ratings yet
Research Scholars Evaluation Based On Guides View Using Id3
4 pages
CAP3770 Lab#4 DecsionTree Sp2017
No ratings yet
CAP3770 Lab#4 DecsionTree Sp2017
4 pages
PRACTICAL5
No ratings yet
PRACTICAL5
23 pages
Decisiontree
No ratings yet
Decisiontree
6 pages
Lab 2
No ratings yet
Lab 2
3 pages
Asign-3 DWDM
No ratings yet
Asign-3 DWDM
27 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
dwm_06
No ratings yet
dwm_06
4 pages
decision tree
No ratings yet
decision tree
13 pages
LP I Assignment A4 Clustering
No ratings yet
LP I Assignment A4 Clustering
13 pages
Decision Tree Algorithm in Machine Learning
No ratings yet
Decision Tree Algorithm in Machine Learning
17 pages
Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI
No ratings yet
Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI
4 pages
Decision Tree Algorithm, Explained-1-22
No ratings yet
Decision Tree Algorithm, Explained-1-22
22 pages
K.Venkat Ratnam 191911412 Class Work 1) Describe The Attribute Selection Measures Used by The ID3 Algorithm To Construct A Decision Tree. A)
No ratings yet
K.Venkat Ratnam 191911412 Class Work 1) Describe The Attribute Selection Measures Used by The ID3 Algorithm To Construct A Decision Tree. A)
8 pages
Prac 6
No ratings yet
Prac 6
6 pages
DM VSAQ
No ratings yet
DM VSAQ
8 pages
UNIT III DM (2)
No ratings yet
UNIT III DM (2)
48 pages
ml unit 2
No ratings yet
ml unit 2
23 pages
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
No ratings yet
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
22 pages
Types of Pruning Techniques
No ratings yet
Types of Pruning Techniques
10 pages
Assignment1_LATEX
No ratings yet
Assignment1_LATEX
11 pages
Decisiontree 2
No ratings yet
Decisiontree 2
16 pages
Unit Iii DM
No ratings yet
Unit Iii DM
48 pages
Decision Tree and Related Techniques For Classification in Scalation
No ratings yet
Decision Tree and Related Techniques For Classification in Scalation
12 pages
Assignment No 1
No ratings yet
Assignment No 1
9 pages
Machine Learning Bangalore City University 2024
No ratings yet
Machine Learning Bangalore City University 2024
5 pages
Decision Tree (Autosaved)
No ratings yet
Decision Tree (Autosaved)
14 pages
Decision Tree R
No ratings yet
Decision Tree R
5 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
41 j48 Naive Bayes Weka
No ratings yet
41 j48 Naive Bayes Weka
5 pages
AICS Topics
No ratings yet
AICS Topics
250 pages
Module 2notes
No ratings yet
Module 2notes
44 pages
ASSIGNMEnt 3
No ratings yet
ASSIGNMEnt 3
26 pages
Lecture Notes 3
No ratings yet
Lecture Notes 3
11 pages
ID3 Algorithm
No ratings yet
ID3 Algorithm
5 pages
Mid Term
No ratings yet
Mid Term
5 pages
Individual Assignment 2
No ratings yet
Individual Assignment 2
4 pages
Module 2
No ratings yet
Module 2
20 pages
Machine Learning Unit4
No ratings yet
Machine Learning Unit4
8 pages
Data Mining Unit-Iii
No ratings yet
Data Mining Unit-Iii
36 pages
Dw & Dm Lab(Exp 5 to 12 ) Kcs 751 A
No ratings yet
Dw & Dm Lab(Exp 5 to 12 ) Kcs 751 A
19 pages
2167TC1 Lab
No ratings yet
2167TC1 Lab
8 pages
THUẬT TOÁN
No ratings yet
THUẬT TOÁN
4 pages
ML 1
No ratings yet
ML 1
20 pages
Business Data Mining WEEK-10 LAQ
No ratings yet
Business Data Mining WEEK-10 LAQ
4 pages
Business Data Mining Week 11
No ratings yet
Business Data Mining Week 11
15 pages
Decision Tree
No ratings yet
Decision Tree
26 pages
Learning Decision Trees
No ratings yet
Learning Decision Trees
13 pages
Assignment 04
No ratings yet
Assignment 04
17 pages
Decision Trees
67% (3)
Decision Trees
14 pages
Data Mining Unit2
No ratings yet
Data Mining Unit2
9 pages
Python Decision Tree Classification
No ratings yet
Python Decision Tree Classification
14 pages
Trinh Khanh Ly 20213676
No ratings yet
Trinh Khanh Ly 20213676
13 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
SEO Packages India
No ratings yet
SEO Packages India
1 page
Sand Reclamation and Conditioning
No ratings yet
Sand Reclamation and Conditioning
13 pages
Ed698 14 2
No ratings yet
Ed698 14 2
5 pages
SLR
No ratings yet
SLR
6 pages
Reductive Dissolution of PbO2
No ratings yet
Reductive Dissolution of PbO2
6 pages
Industrial Engineering Mec 422 2 Unit Course Note WK1-3
No ratings yet
Industrial Engineering Mec 422 2 Unit Course Note WK1-3
8 pages
Yash Resume
No ratings yet
Yash Resume
3 pages
EE311 Exam 2013 V1 Solution
No ratings yet
EE311 Exam 2013 V1 Solution
3 pages
Vedant's Resumé
No ratings yet
Vedant's Resumé
1 page
Polynomials
No ratings yet
Polynomials
3 pages
Grammar Task 1. Fill in The Gaps With Appropriate Question Words
100% (1)
Grammar Task 1. Fill in The Gaps With Appropriate Question Words
3 pages
LRFD Pre Standard - Revised FINAL - Nov 9 2010
No ratings yet
LRFD Pre Standard - Revised FINAL - Nov 9 2010
215 pages
Answer Key
No ratings yet
Answer Key
5 pages
Rubric For Mathematical Presentations
No ratings yet
Rubric For Mathematical Presentations
1 page
CI3907 Granosik
No ratings yet
CI3907 Granosik
3 pages
Research Article: Modelling of Dual-Junction Solar Cells Including Tunnel Junction
No ratings yet
Research Article: Modelling of Dual-Junction Solar Cells Including Tunnel Junction
6 pages
Analyzing Word Problems (Word Clues and The Operation To Use)
No ratings yet
Analyzing Word Problems (Word Clues and The Operation To Use)
5 pages
Din 1592
No ratings yet
Din 1592
1 page
Unit 22 L3 Assignment B Lessson
No ratings yet
Unit 22 L3 Assignment B Lessson
34 pages
Utilisation of Research Findings
100% (1)
Utilisation of Research Findings
13 pages
Philosophy and Objectives of Edukasyon Sa Pagpapakatao
No ratings yet
Philosophy and Objectives of Edukasyon Sa Pagpapakatao
5 pages
Aesthetic Experience: and Literary Hermeneutics
No ratings yet
Aesthetic Experience: and Literary Hermeneutics
389 pages
IOM Presnetation GCT, Mianwali 30-01-2019 2
100% (2)
IOM Presnetation GCT, Mianwali 30-01-2019 2
42 pages
Changelog
No ratings yet
Changelog
6 pages
Motor Feeder Cable & Cable Tray Sizing and Data
No ratings yet
Motor Feeder Cable & Cable Tray Sizing and Data
5 pages
Manual DS-416
No ratings yet
Manual DS-416
16 pages
Qashqai 1.6T
100% (1)
Qashqai 1.6T
3 pages