Decision Tree Assignment

This document discusses key aspects of decision tree assignment including: 1. Handling missing attribute values by filling them in with the most common value or assigning probabilities based on other samples. 2. Choosing the best attribute by using either information gain, which measures entropy changes, or Gini index, which measures impurity. 3. Avoiding overfitting by using reduced error pruning or rule post-pruning to simplify the tree while maintaining accuracy.

Uploaded by

Saif Bakry

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

86 views3 pages

Decision Tree Assignment

Uploaded by

Saif Bakry

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Decision Tree Assignment

Supervised By
Dr Mohamed Abo Rizka
Prepared By
Saif Allah Mohamed Bakry
1. How to handling the training data with the missing attribute value?
Decision Trees handle missing values in the following ways:
 Fill the missing attribute value by the most common value of that attribute.
 Fill the missing value by assigning a probability to each of the possible values
of the attribute based on other samples

2. Choosing an best attribute: what quality measure to use?

When we choose the best attribute, we must select one of this two techniques of
decision tree:
 Information gain
 Gini index

 Information Gain:
The measurement of changes in entropy after segmenting a dataset based on an
attribute is known as information gain.
It calculates how much information a feature provides us about a class
We split the node and built the decision tree based on the value of information
gain
Information Gain Formula = Entropy(S)- [(Weighted Avg) *Entropy (each feature)

 Gini Index:
 Gini index is a measure of impurity or purity used while creating a decision tree in
the CART (Classification and Regression Tree) algorithm.
 An attribute with the low Gini index should be preferred as compared to the high
Gini index.
 It only creates binary splits, and the CART algorithm uses the Gini index to create
binary splits

Gini Index formula = 1- ∑jPj2

2|Page
3. Determining when to stop splitting: avoid overfitting?
To avoid the overfitting in decision tree we must follow 2 ways:
 Reduced error pruning: Creating a node from a subtree and
characterizing it with the most common classification of the training
examples
 Rule post-pruning: Convert the tree to rules, then prune the decision
tree's rules by removing preconditions that improve the rule's estimated
accuracy

4. Handling Attributes with Differing Costs?

We would prefer decision trees that use low-cost attributes whenever
possible, and only use high-cost attributes when necessary to produce
reliable classifications

3|Page

Unit-4 (1) .Docx ML
No ratings yet
Unit-4 (1) .Docx ML
42 pages
Decision Tree & Techniques
71% (7)
Decision Tree & Techniques
41 pages
Decision Tree Notes
No ratings yet
Decision Tree Notes
6 pages
Ecture Ecision REE: Sajal Halder Bsmrstu
100% (1)
Ecture Ecision REE: Sajal Halder Bsmrstu
22 pages
2 Decision Tree Algo
No ratings yet
2 Decision Tree Algo
46 pages
6 DecisionTrees ID3 CART
No ratings yet
6 DecisionTrees ID3 CART
24 pages
Unit 3
No ratings yet
Unit 3
98 pages
Decision Trees
No ratings yet
Decision Trees
26 pages
Decision Tree Induction
No ratings yet
Decision Tree Induction
23 pages
Decision Trees
No ratings yet
Decision Trees
61 pages
Assignment 2 Part 1
No ratings yet
Assignment 2 Part 1
2 pages
My Decision Tree Algorithm
No ratings yet
My Decision Tree Algorithm
21 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
Security Assignment 1
No ratings yet
Security Assignment 1
3 pages
Security Assignment
No ratings yet
Security Assignment
3 pages
Decision Tree Questions
No ratings yet
Decision Tree Questions
8 pages
Week 7 Solution
No ratings yet
Week 7 Solution
4 pages
Unit 3
No ratings yet
Unit 3
14 pages
ML Unit II
No ratings yet
ML Unit II
183 pages
Chapter 04
No ratings yet
Chapter 04
48 pages
Solution For DWDM Problems
No ratings yet
Solution For DWDM Problems
24 pages
ML Classification Tree
No ratings yet
ML Classification Tree
36 pages
Decistion Tree
No ratings yet
Decistion Tree
27 pages
Unit-3 ML
No ratings yet
Unit-3 ML
47 pages
Learning Decision Trees
No ratings yet
Learning Decision Trees
10 pages
Feature Selection Method in Decision Tree Induction
No ratings yet
Feature Selection Method in Decision Tree Induction
7 pages
Decision Tree - ML Class
No ratings yet
Decision Tree - ML Class
58 pages
CSE445 NSU Week - 4
No ratings yet
CSE445 NSU Week - 4
48 pages
ML CLASS 6 Decision Tree Algorithm
No ratings yet
ML CLASS 6 Decision Tree Algorithm
21 pages
DM 3
No ratings yet
DM 3
37 pages
Session 5b Classification by Decision Tree Induction
No ratings yet
Session 5b Classification by Decision Tree Induction
42 pages
AIML Removed Merged
No ratings yet
AIML Removed Merged
31 pages
Supervised Decision TreeRandom Forest
No ratings yet
Supervised Decision TreeRandom Forest
39 pages
Decision Trees: Decision Tree Is One of The Most Widely Used and
No ratings yet
Decision Trees: Decision Tree Is One of The Most Widely Used and
53 pages
Unit 3 (A) NGP
No ratings yet
Unit 3 (A) NGP
78 pages
Data Minning Unit 5 PDF
No ratings yet
Data Minning Unit 5 PDF
19 pages
Unit 1 Classification & Prediction DM
No ratings yet
Unit 1 Classification & Prediction DM
71 pages
Decision Tree
No ratings yet
Decision Tree
35 pages
Assignment 2 Part 1
No ratings yet
Assignment 2 Part 1
2 pages
DS Tech M 3 1
No ratings yet
DS Tech M 3 1
13 pages
S&ML Unit 6 - Q & A
No ratings yet
S&ML Unit 6 - Q & A
12 pages
ML Unit 2 Final - III Yr
No ratings yet
ML Unit 2 Final - III Yr
72 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
CS467-M4-Machine Learning-Ktustudents - in
No ratings yet
CS467-M4-Machine Learning-Ktustudents - in
9 pages
Applying Decision Tree Algorithm Classification An
No ratings yet
Applying Decision Tree Algorithm Classification An
5 pages
Decision Tree
No ratings yet
Decision Tree
43 pages
Decision Trees and Random Forest Q&a
No ratings yet
Decision Trees and Random Forest Q&a
48 pages
Class Basic
No ratings yet
Class Basic
75 pages
Lecture 7.1 - Decision Tree Classification
No ratings yet
Lecture 7.1 - Decision Tree Classification
15 pages
ML Unit-2
No ratings yet
ML Unit-2
16 pages
DM Unit 4
No ratings yet
DM Unit 4
24 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Decitions Tree
No ratings yet
Decitions Tree
6 pages
Day48 Decision Trees
No ratings yet
Day48 Decision Trees
5 pages
Decision Tree: "For Each Node of The Tree, The Information Value Measures
No ratings yet
Decision Tree: "For Each Node of The Tree, The Information Value Measures
3 pages
Classification and Regression Trees (CART - I) : Dr. A. Ramesh
No ratings yet
Classification and Regression Trees (CART - I) : Dr. A. Ramesh
34 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Decision Trees
No ratings yet
Decision Trees
3 pages
25 Questions To Test Your Skills On Decision Trees
No ratings yet
25 Questions To Test Your Skills On Decision Trees
10 pages
Decsion Tree
No ratings yet
Decsion Tree
6 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
Machine Learning with Python: Foundations and Applications: ML, #1
From Everand
Machine Learning with Python: Foundations and Applications: ML, #1
Mohammed Nurudeen
No ratings yet
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet