0% found this document useful (0 votes)

45 views

Decision Tree Practice

Uploaded by

Sowrov TheHodophile

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views

Decision Tree Practice

Uploaded by

Sowrov TheHodophile

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Question 1: CGPA Prediction (Using Gini Index)

Dataset:
Hours of Attendanc CGPA (> 3.0)
Study e

Low Low No

High High Yes

Medium Medium Yes

Low High No

Medium Low No

High Medium Yes

Low Low No

High Low Yes

Step 1: Calculate Gini Index for the root node:

Gini Index formula:

Gini=1−∑(pi)2Gini=1−∑(pi)2

Where pipiis the probability of each class (Yes or No in this case).

● Total instances = 8
● Yes (CGPA > 3.0) = 4 instances, p(Yes)=48=0.5p(Yes)=84=0.5
● No (CGPA ≤ 3.0) = 4 instances, p(No)=48=0.5p(No)=84=0.5

Gini(root)=1−(0.52+0.52)=1−(0.25+0.25)=1−0.5=0.5Gini(root)=1−(0.52+0.52)=1−(0.25+0.25)=1
−0.5=0.5

Step 2: Calculate Gini Index for splits based on Hours of Study:

1. For 'Low' Hours of Study:

○ 3 instances, all 'No'
○ Gini(Low) = 1−(0/3)2−(3/3)2=01−(0/3)2−(3/3)2=0
2. For 'Medium' Hours of Study:
○ 2 instances, 1 'Yes', 1 'No'
○ Gini(Medium) =
1−(1/2)2−(1/2)2=1−0.25−0.25=0.51−(1/2)2−(1/2)2=1−0.25−0.25=0.5
3. For 'High' Hours of Study:
○ 3 instances, 3 'Yes', 1 'No'
○ Gini(High) =
1−(3/4)2−(1/4)2=1−0.5625−0.0625=0.3751−(3/4)2−(1/4)2=1−0.5625−0.0625=0.3
75

Weighted Gini for Hours of Study:

Weighted Gini=(38)×0+(28)×0.5+(38)×0.375=0.265625Weighted
Gini=(83)×0+(82)×0.5+(83)×0.375=0.265625

Gini Gain for Hours of Study:

Gini Gain=0.5−0.265625=0.234375Gini Gain=0.5−0.265625=0.234375

Step 3: Calculate Gini Index for splits based on Attendance:

1. For 'Low' Attendance:

○ 4 instances, 1 'Yes', 3 'No'
○ Gini(Low) =
1−(1/4)2−(3/4)2=1−0.0625−0.5625=0.3751−(1/4)2−(3/4)2=1−0.0625−0.5625=0.3
75
2. For 'Medium' Attendance:
○ 2 instances, 1 'Yes', 1 'No'
○ Gini(Medium) = 1−(1/2)2−(1/2)2=0.51−(1/2)2−(1/2)2=0.5
3. For 'High' Attendance:
○ 2 instances, 2 'Yes', 0 'No'
○ Gini(High) = 1−(2/2)2−(0/2)2=01−(2/2)2−(0/2)2=0

Weighted Gini for Attendance:

Weighted Gini=(48)×0.375+(28)×0.5+(28)×0=0.4375Weighted
Gini=(84)×0.375+(82)×0.5+(82)×0=0.4375

Gini Gain for Attendance:

Gini Gain=0.5−0.4375=0.0625Gini Gain=0.5−0.4375=0.0625

Conclusion:Since Hours of Study has the highest Gini Gain (0.234375), it is chosen as the
root split.

Hours of Study

/ | \

Low Medium High (No)

(1 Yes, 1 No) (3 Yes, 1 No)

Question 2: Fruit Classification (Using Entropy)

Dataset:
Color Size Fruit (Apple/NotApple)

Red Small Apple

Green Large NotApple

Yellow Medium NotApple

Red Medium Apple

Green Small Apple

Yellow Large NotApple

Red Large Apple

Green Medium NotApple

Step 1: Calculate Entropy for the root node:

Entropy formula:

Entropy=−∑pi⋅log⁡2(pi)Entropy=−∑pi⋅log2(pi)

Where pipiis the probability of each class.

● Total instances = 8
● Apple = 4 instances, p(Apple)=48=0.5p(Apple)=84=0.5
● NotApple = 4 instances, p(NotApple)=48=0.5p(NotApple)=84=0.5

Entropy(root)=−(0.5×log⁡2(0.5)+0.5×log⁡2(0.5))=−(0.5×−1+0.5×−1)=1Entropy(root)=−(0.5×log2(0.
5)+0.5×log2(0.5))=−(0.5×−1+0.5×−1)=1

Step 2: Calculate Entropy for splits based on Color:

1. For 'Red' Color:

○ 3 instances, all 'Apple'
○ Entropy(Red) = 0 (since all are 'Apple')
2. For 'Green' Color:
○ 3 instances, 1 'Apple', 2 'NotApple'
○ Entropy(Green) =
−(1/3×log⁡2(1/3)+2/3×log⁡2(2/3))=0.918−(1/3×log2(1/3)+2/3×log2(2/3))=0.918
3. For 'Yellow' Color:
○ 2 instances, all 'NotApple'
○ Entropy(Yellow) = 0

Weighted Entropy for Color:

Weighted Entropy=(38)×0+(38)×0.918+(28)×0=0.34425Weighted
Entropy=(83)×0+(83)×0.918+(82)×0=0.34425

Information Gain for Color:

Information Gain=1−0.34425=0.65575Information Gain=1−0.34425=0.65575

Step 3: Calculate Entropy for splits based on Size:

1. For 'Small' Size:

○ 2 instances, all 'Apple'
○ Entropy(Small) = 0
2. For 'Medium' Size:
○ 3 instances, 1 'Apple', 2 'NotApple'
○ Entropy(Medium) = 1
3. For 'Large' Size:
○ 3 instances, 1 'Apple', 2 'NotApple'
○ Entropy(Large) = 0.918

Weighted Entropy for Size:

Weighted Entropy=(28)×0+(38)×1+(38)×0.918=0.73425Weighted
Entropy=(82)×0+(83)×1+(83)×0.918=0.73425

Information Gain for Size:

Information Gain=1−0.73425=0.26575Information Gain=1−0.73425=0.26575

Conclusion:

Since Color has the highest Information Gain (0.65575), it is chosen as the root split.

Color
/ | \
Red Green Yellow
(Apple) (1 Apple, 2 NotApple) (NotApple)

Error Code Manual For ECORAY Generator XR7
100% (2)
Error Code Manual For ECORAY Generator XR7
16 pages
Entropy ID3 Exercise
No ratings yet
Entropy ID3 Exercise
3 pages
NEC Article 250
100% (1)
NEC Article 250
42 pages
Rodin AeroDynamics Presentation
100% (2)
Rodin AeroDynamics Presentation
23 pages
Decision Trees
No ratings yet
Decision Trees
13 pages
Aiml Easy Solution
No ratings yet
Aiml Easy Solution
70 pages
23 Id3
No ratings yet
23 Id3
20 pages
Naïve Bayes-DecisionTrees-RandomForest-SVM
No ratings yet
Naïve Bayes-DecisionTrees-RandomForest-SVM
26 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
18 pages
Decision-Tree Learning .
No ratings yet
Decision-Tree Learning .
29 pages
Unit 4
No ratings yet
Unit 4
19 pages
solution for dwdm problems (1)
No ratings yet
solution for dwdm problems (1)
24 pages
تمييز اشكال ميد
No ratings yet
تمييز اشكال ميد
267 pages
Decision Tree Version 3
No ratings yet
Decision Tree Version 3
16 pages
Decision Tree Tutorial
No ratings yet
Decision Tree Tutorial
8 pages
COS10022 DSP Week05 Decision Tree and Random Forest
No ratings yet
COS10022 DSP Week05 Decision Tree and Random Forest
50 pages
Decision Trees - Detailed Notes
No ratings yet
Decision Trees - Detailed Notes
8 pages
(MXML-2-02) Decision Trees - ID3 - C4.5, Impurity, Gini Index, Entropy, Information Gain
No ratings yet
(MXML-2-02) Decision Trees - ID3 - C4.5, Impurity, Gini Index, Entropy, Information Gain
3 pages
Clase12 13
No ratings yet
Clase12 13
15 pages
C4.5 Decision Tree Solution With Calculations
No ratings yet
C4.5 Decision Tree Solution With Calculations
4 pages
Classification - Decision Trees
No ratings yet
Classification - Decision Trees
43 pages
3 Decision Trees_LMS
No ratings yet
3 Decision Trees_LMS
47 pages
Classification With Decision Trees: Instructor: Qiang Yang
100% (1)
Classification With Decision Trees: Instructor: Qiang Yang
62 pages
Arbre Decision 2324 4p 1 11
No ratings yet
Arbre Decision 2324 4p 1 11
11 pages
ML Unit-3 ppt
No ratings yet
ML Unit-3 ppt
92 pages
Classification: Table 4.1. Data Set For Exercise 2
No ratings yet
Classification: Table 4.1. Data Set For Exercise 2
7 pages
Decision Tree
No ratings yet
Decision Tree
43 pages
Decision Tree Induction
No ratings yet
Decision Tree Induction
52 pages
Decision Tree
No ratings yet
Decision Tree
8 pages
Learning Decision Trees
No ratings yet
Learning Decision Trees
10 pages
ML-chap9_2024_110217
No ratings yet
ML-chap9_2024_110217
52 pages
Gini V Entropy
No ratings yet
Gini V Entropy
12 pages
Decision Tree
No ratings yet
Decision Tree
30 pages
Entropy and Information Gain Explained
No ratings yet
Entropy and Information Gain Explained
10 pages
Week - 2 Day - 2 Machine Learning 2 - 3
No ratings yet
Week - 2 Day - 2 Machine Learning 2 - 3
33 pages
ML Unit 3_Questions
No ratings yet
ML Unit 3_Questions
7 pages
FALLSEM2024-25 BCSE209L TH VL2024250101586 2024-07-30 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101586 2024-07-30 Reference-Material-I
22 pages
2024 Decision Trees
No ratings yet
2024 Decision Trees
28 pages
Decision Trees MIT 15.097 Course Notes
No ratings yet
Decision Trees MIT 15.097 Course Notes
17 pages
3 Decision Trees
No ratings yet
3 Decision Trees
41 pages
Decision Tree
No ratings yet
Decision Tree
52 pages
Decision Trees Palagraism
No ratings yet
Decision Trees Palagraism
16 pages
Decision Trees
No ratings yet
Decision Trees
31 pages
Decision Tree and KNN Assignment Two
No ratings yet
Decision Tree and KNN Assignment Two
13 pages
ID3 MedhaPradhan
No ratings yet
ID3 MedhaPradhan
22 pages
Lecture 4
No ratings yet
Lecture 4
74 pages
MLRD 7
No ratings yet
MLRD 7
23 pages
Data Mining Unit 3
No ratings yet
Data Mining Unit 3
21 pages
Lesson 5
No ratings yet
Lesson 5
28 pages
7_DecisionTree
No ratings yet
7_DecisionTree
58 pages
Decision Trees: Classifier
No ratings yet
Decision Trees: Classifier
23 pages
Decision Tree
No ratings yet
Decision Tree
23 pages
Classifying fruits based on their color and shape
No ratings yet
Classifying fruits based on their color and shape
3 pages
[PR 2024] Lec6 Classification IV
No ratings yet
[PR 2024] Lec6 Classification IV
27 pages
Decision Tree
No ratings yet
Decision Tree
47 pages
Chap 6 - Decision Tree Induction - Using Frequency
No ratings yet
Chap 6 - Decision Tree Induction - Using Frequency
16 pages
CSE445 T5a Decision Trees
No ratings yet
CSE445 T5a Decision Trees
54 pages
Data Minning Unit 5 PDF
No ratings yet
Data Minning Unit 5 PDF
19 pages
Lecture 4
No ratings yet
Lecture 4
74 pages
DMDW-CO3-SESSION-14
No ratings yet
DMDW-CO3-SESSION-14
55 pages
Decision Tree
No ratings yet
Decision Tree
19 pages
SAT Math Shortcuts
From Everand
SAT Math Shortcuts
Bella Biscotti
No ratings yet
Fractions Are Easy
From Everand
Fractions Are Easy
Alexey Molchanov
No ratings yet
University Professor_Profile and Contact Details
No ratings yet
University Professor_Profile and Contact Details
250 pages
Front Page Algo
No ratings yet
Front Page Algo
1 page
Lab
No ratings yet
Lab
4 pages
20 Pip Challange
No ratings yet
20 Pip Challange
1 page
Gmail - Thanks! Your booking is confirmed at Ambassador Hotel Bangkok - SHA Extra Plus
No ratings yet
Gmail - Thanks! Your booking is confirmed at Ambassador Hotel Bangkok - SHA Extra Plus
3 pages
Ai Lab
No ratings yet
Ai Lab
30 pages
7-11 Endurance Racer-Cycleworld Dec 88
No ratings yet
7-11 Endurance Racer-Cycleworld Dec 88
11 pages
Array sorting
No ratings yet
Array sorting
5 pages
Q2 - (LAS) GEN. CHEM wk4 PDF
No ratings yet
Q2 - (LAS) GEN. CHEM wk4 PDF
4 pages
Digital Circuit Analysis and Design - Malestrom
No ratings yet
Digital Circuit Analysis and Design - Malestrom
32 pages
Soil and Water Conservation Engineering Prof. Rajendra Singh Department of Agricultural and Food Engineering Indian Institute of Technology, Kharagpur Lecture - 26 Gully Control Measures
No ratings yet
Soil and Water Conservation Engineering Prof. Rajendra Singh Department of Agricultural and Food Engineering Indian Institute of Technology, Kharagpur Lecture - 26 Gully Control Measures
22 pages
Seminar Report Format PDF
No ratings yet
Seminar Report Format PDF
5 pages
XC 003
No ratings yet
XC 003
6 pages
Complex Number Assignment - (Sankalp Batch)
No ratings yet
Complex Number Assignment - (Sankalp Batch)
4 pages
Tutorial 5 - DMCF 2213 (Buoyancy)
No ratings yet
Tutorial 5 - DMCF 2213 (Buoyancy)
2 pages
Actuators
No ratings yet
Actuators
89 pages
NCERT Solutions For CBSE Class 10 Maths Chapter 10 Circles
No ratings yet
NCERT Solutions For CBSE Class 10 Maths Chapter 10 Circles
15 pages
Arithmetic Paper 2 A4
No ratings yet
Arithmetic Paper 2 A4
16 pages
Module 4 - ME - MA PDF
No ratings yet
Module 4 - ME - MA PDF
94 pages
Think Like An Engineer Website PDF
No ratings yet
Think Like An Engineer Website PDF
12 pages
Numbers of Head Pan of Sand and Gravel For A Bag of Cement. - PR
No ratings yet
Numbers of Head Pan of Sand and Gravel For A Bag of Cement. - PR
5 pages
Act 1 4
No ratings yet
Act 1 4
19 pages
Missing Person Project
No ratings yet
Missing Person Project
9 pages
PHILOSOPHY OF SCIENCE (Kuliah 1)
83% (12)
PHILOSOPHY OF SCIENCE (Kuliah 1)
38 pages
Ebook Ebook PDF The Analysis of Biological Data Second Edition All Chapter PDF Docx Kindle
100% (37)
Ebook Ebook PDF The Analysis of Biological Data Second Edition All Chapter PDF Docx Kindle
47 pages
Tiramisu Kica
No ratings yet
Tiramisu Kica
4 pages
ST 200 Pro Electrolyte Analyzer 2
No ratings yet
ST 200 Pro Electrolyte Analyzer 2
66 pages
Pyrantel Embonate: Pyranteli Embonas
No ratings yet
Pyrantel Embonate: Pyranteli Embonas
2 pages
Lin 2022 - Recent Advances in The Application of Machine Learning Methods
No ratings yet
Lin 2022 - Recent Advances in The Application of Machine Learning Methods
7 pages
1 s2.0 S2405844019356981 Main
No ratings yet
1 s2.0 S2405844019356981 Main
9 pages
Finite Element Simulation and Experiment of Chip Formation Process During High Speed Machining of AISI 1045 Hardened Steel
100% (1)
Finite Element Simulation and Experiment of Chip Formation Process During High Speed Machining of AISI 1045 Hardened Steel
5 pages
Ii - Year QB
No ratings yet
Ii - Year QB
78 pages
Evaluation Activity #14
No ratings yet
Evaluation Activity #14
4 pages

Decision Tree Practice

Uploaded by

Decision Tree Practice

Uploaded by

Question 1: CGPA Prediction (Using Gini Index)

High High Yes

Medium Medium Yes

High Medium Yes

High Low Yes

Step 1: Calculate Gini Index for the root node:

Gini Index formula:

Where pipi​is the probability of each class (Yes or No in this case).

Step 2: Calculate Gini Index for splits based on Hours of Study:

1. For 'Low' Hours of Study:

Weighted Gini for Hours of Study:

Gini Gain for Hours of Study:

Gini Gain=0.5−0.265625=0.234375Gini Gain=0.5−0.265625=0.234375

Step 3: Calculate Gini Index for splits based on Attendance:

1. For 'Low' Attendance:

Weighted Gini for Attendance:

Gini Gain for Attendance:

Gini Gain=0.5−0.4375=0.0625Gini Gain=0.5−0.4375=0.0625

Low Medium High (No)

(1 Yes, 1 No) (3 Yes, 1 No)

Red Small Apple

Green Large NotApple

Yellow Medium NotApple

Red Medium Apple

Green Small Apple

Yellow Large NotApple

Red Large Apple

Green Medium NotApple

Step 1: Calculate Entropy for the root node:

Where pipi​is the probability of each class.

Step 2: Calculate Entropy for splits based on Color:

1. For 'Red' Color:

Weighted Entropy for Color:

Information Gain for Color:

Information Gain=1−0.34425=0.65575Information Gain=1−0.34425=0.65575

Step 3: Calculate Entropy for splits based on Size:

1. For 'Small' Size:

Weighted Entropy for Size:

Information Gain for Size:

Information Gain=1−0.73425=0.26575Information Gain=1−0.73425=0.26575

You might also like

Where pipiis the probability of each class (Yes or No in this case).

Where pipiis the probability of each class.