0% found this document useful (0 votes)

10 views27 pages

Decision Trees

Uploaded by

Ashish Sinha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views27 pages

Decision Trees

Uploaded by

Ashish Sinha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Decision Trees

Agenda

1. Introduction to Decision Trees

2. Loss function for Decision Trees

3. Regularization

4. Case studies
Decision Trees -

1. Decision Tree is one of the most commonly used models in data science world

2. It is also a proven management tool used to take decisions in complex situations

3. It can be used for regression and classification, more often used for classification

4. Can be used for binary classification such as whether an applicant for loan is likely to
turn into defaulter or not, whether a customer is likely to churn or not

5. It can also be used for multi-class classification for example, identifying the character
in English alphabet

6. Decision Tree algorithm finds the relation between the target column and the
independent variables and expresses it as a tree structure

7. It does so by binary splitting data using functions based on comparison operators on

the independent columns
Decision Trees: Training/Building
Tree Horse-Power Weight Car Type
130 3500 L
1. Suppose we are given the data about
cars as shown
90 2000 S
90 1500 S
2. Our objective is to find if any patterns
exist that connect the “Horse-Power” and
150 3000 L
“Weight” to car type ( Large or Small) 270 2500 L
3. The independent variables are “Horse- 200 2900 L
Power” and “Weight” while the target 70 2530 S
column is “Car Type”
215 2000 L
4. The target column has binary values ( L
80 2200 S
and S) in equal numbers
100 1700 S
Decision Trees: Training/Building Tree
Horse- Weight Car Type
Let us apply a function on the Power
1. This smaller node on top has
weight column 130 3500 L “L” in majority in the target
150 3000 L column hence gets label “L”

Horse- Weight Car Type 270 2500 L 2. The smaller node on the
Power
200 2900 L bottom has “S” in majority in
130 3500 L the target column hence gets
70 2530 S label “S”
90 2000 S
80 2200 S
90 1500 S 3. The homogeneity of the target
column in both the smaller
150 3000 L nodes has increased
Weight > 2000 compared to parent
270 2500 L

200 2900 L 4. But both the smaller nodes

still have a mix of values in
70 2530 S the target column
215 2000 L Horse- Weight Car Type
Power
5. Let us split the data further
80 2200 S 90 2000 S using Horse-Power
100 1700 S 90 1500 S

215 2000 L

100 1700 S
Decision Trees: Training/Building Tree
Horse- Weight Car Type
Class = L (Majority) Power

Horse- Weight Car Type 70 2530 S 1. The smaller node on the top
Power now is perfectly homogenous
80 2200 S
in the target column and
130 3500 L HorsePower > 100 belongs to class “S”
150 3000 L
2. The second node similarity
Horse- Weight Car Type
270 2500 L
Power
belongs to “L”
200 2900 L 130 3500 L
3. The third node belongs to “S”
150 3000 L
70 2530 S
270 2500 L
80 2200 S 4. The fourth node belongs to
200 2900 L “L”

Horse- Weight Car Type

Let us apply a function on the 5. There is no further need to
Power
Horse-Power column split the data as it is perfectly
90 2000 S
Horse- Weight Car Type homogenous
Power 90 1500 S
100 1700 S
90 2000 S
HorsePower > 200
90 1500 S

215 2000 L
Horse- Weight Car Type
Power
100 1700 S
215 2000 L
Class = S (Majority)
Decision Trees: Training/Building Tree

The tree thus has given us the

Let us apply a function on following relation between
the weight column “Weight” and “Horse-Power” and
“Car Type” as

If wt > 2000
if hp > 100
class = “L”
else
class = “S”

If wt <= 2000
if hp > 200
class = “L”
else
class = “S”

Note: The CART algorithm employed by scikitlearn creates only binary tree i.e., each node is split into
two subnodes
Decision Trees: Predicting

Let us apply a
function on the
weight column Predicted class of the
new data point

Horse- Weight Car Horse- Weight Car

Power Type Power Type
150 2500 ? 150 2500 ?

Large Car
Decision Trees – Training Errors

1. Suppose we come across a combination of “Weight” and “Horse-Power” for any of

the classes which was not available in the training data on which the decision tree
was built. For example, a Small Car with HorsePower of 250 and Weight is 2000. The
Decision Tree will classify it as Large Car

2. Such classification errors can occur both during the training and testing. They are
called training errors and testing errors. This is true for any algorithm

3. The decision tree algorithm by default will try to build a tree whether the smallest child
nodes are perfectly homogenous in the target columns

4. To achieve perfect homogeneity in the target column, the algorithm may build a large
tree where each leaf has only 1 record! Such models are overfit models. They give
zero errors on training but perform poorly on test data
Decision Trees – Posterior Probability

5. Sometimes when the algorithm runs out of independent attributes to use to break a
node into smaller nodes or it is forced to stop, we may find nodes where the target
column is not homogeneous

Horse- Weight Car Type

Power

90 2000 S

90 1500 S

215 2000 L

100 1700 S

6. In such case, the label assigned to the node is based on majority class and the ratio of
classes indicates the posterior probability of the two classes at that node. P(S) = ¾ and
P(L) = 1/4
Decision Trees – Structure & Node types

1. Classifiers utilize a tree structure to model relationships among the features and the
potential outcomes

2. Decision trees consist of nodes and branches. Nodes represent a decision function
while branch represents the result of the function. Thus it is a flow chart for deciding
how to classify a new observation:

3. The nodes are of three types, Root Node (representing the original data), Branch
Node (representing a function), Leaf Node (which holds the result of all the previous
functions that connect to it)
Decision Trees - Structure & Node types

4. For classification problem, the posterior probability of all the classes is reflected in the
leaf node and the Leaf Node belongs to the majority class.

5. After executing all the functions from Root Node to Leaf Node, the class of a data
point is decided by the leaf node to which it reaches

6. For regression, the average/ median value of the target attribute is assigned to the
query variable

7. Tree creation splits data into subsets and subsets into further smaller subsets. The
algorithm stops splitting data when data within the subsets are sufficiently
homogenous or some other stopping criterion is met
Decision Trees - Structure & Node types

1. The decision tree algorithm learns (i.e. creates the decision tree from the data set)
through optimization of a loss function

2. The loss function represents the loss of impurity in the target column. The
requirement here is to minimize the impurity as much as possible at the leaf nodes

3. Purity of a node is a measure of homogeneity in the target column at that node

Decision Trees -
1. There is a bag of 50 balls of red, green, blue, white and
yellow colour respectively
2. You have to pull out one ball from the bag with closed
eyes. If the ball is -
a. Red, you loose the prize money accumulated
b. Green, you can quit
c. Blue you loose half prize money but continue
d. White you loose quarter prize money & continue
e. Yellow you can skip the question
3. This state where you have to decide and your decision
can result in various outcomes with equal probability is
said to be state of maximum uncertainty
4. If you have a bag full of balls of only one colour, then
there is no uncertainty. You know what is going to
happen. Uncertainty is zero.
5. Thus, the more the homogeneity, lesser the uncertainty
and vice versa
6. Uncertainty is expressed as entropy or Gini index
Decision Trees -

Suppose we wish to find if there was any influence of shipping mode, order priority on
customer location. Customer location is target column and like the bag of coloured balls

Shipping
Sales
Mode
Data

Regular Express
Air Air

Low High
Priority Priority

When sub branches are created, the total entropy of the sub branches should be
less than the entropy of the parent node. More the drop in entropy, more the
information gained
Decision Trees – Shannon's Entropy
a. Imagine a bag contains 6 red and 4 black balls.

b. Let the two classes Red -> class 0 and Black -> class 1

c. Entropy of the bag (X) will be calculated as per the formula

a. H(X) = - (0.6 * log2( 0.6)) - (0.4 * log2(0.4)) = 0.9709506

d. Suppose we remove all red balls from the bag and then entropy will be
a. H(X) = - 1.0 *log2(1.0) – 0.0 * log2(0) = 0 ## Entropy is 0! i.e. Information is
100%
Machine Learning (Decision Tree Classification)
Decision Trees -
Entropy Info Gain

E0 = max entropy 0
Shipping
Mode (1000) E0 say 1

E1 = E0 – E1
Express (E1a*700/1000) +
Regular Air
(700), E1a Air (300), (E1b * 300/1000)
E1b

E2 = (E2a * E1 – E2
500/700) + (E2b *
Low High Low High
Priority Priority Priority Priority
200/700) + (E2c *
(500) E2a (200) E2b (100) E2c (200) E2d 100/300) + (E2d *
200/300)

Tree will stop growing when stop criterion for the splitting is reached which could be -
a. Tree has reached certain pre-fixed depth (longestt path from root node to leaf node)
b. Tree has achieve maximum number of nodes (tree size)
c. Exhausted all attributes to split
d. Leaf node on split will have less than predefined number of data points
Decision Trees - Information Gain using Entropy

Information Gain = reduction in entropy =

Decision Trees - Information Gain using Gini index

Information Gain = reduction in Gini index =

Decision Trees -

Common measures of purity

1. Gini index – is calculated by subtracting the sum of the squared probabilities of each
class from one
a. Uses squared proportion of classes
b. Perfectly classified, Gini Index would be zero
c. Evenly distributed would be 1 – (1/# Classes)
d. You want a variable split that has a low Gini Index
e. Used in CART algorithm

2. Entropy –
a. Favors splits with small counts but many unique values
b. Weights probability of class by log(base=2) of the class probability
c. A smaller value of Entropy is better. That makes the difference between the parent node’s
entropy larger
d. Information Gain is the Entropy of the parent node minus the entropy of the child nodes
Decision Trees – Gini , Entropy , Misclassification Error

Note: Misclassification Error is not used in Decision Trees

Decision Trees - Algorithms

1. ID3 (Iterative Dicotomizer 3) – developed by Ross Quinlan. Creates a multi

branch tree at each node using greedy algorithm. Trees grow to maximum
size before pruning

2. C4.5 succeeded ID3 by overcoming limitation of features required to be

categorical. It dynamically defines discrete attribute for numerical attributes.
It converts the trained trees into a set of if-then rules. Accuracy of each rule
is evaluated to determine the order in which they should be applied

3. C5.0 is Quinlan’s latest version and it uses less memory and builds smaller
rulesets than C4.5 while being more accurate

4. CART (Classification & Regression Trees) is similar to C4.5 but it supports

numerical target variables and does not compute rule sets. Creates binary
tree. Scikit uses CART
Decision Trees -

Advantages -
1. Simple , Fast in processing and effective
2. Does well with noisy data and missing data
3. Handles numeric and categorical variables
4. Interpretation of results does not required mathematical or statistical knowledge

Dis-advantages -
1. Often biased towards splits or features have large number of levels
2. May not be optimum as modelling some relations on axis parallel basis is not
optimal
3. Small changes in training data can result in large changes to the logic
4. Large trees can be difficult to interpret
Decision Trees - Preventing overfitting through regularization

1. Decision trees do not assume a particular form of relationship between the

independent and dependent variables unlike linear models for e.g.

2. DT is a non-parametrized algorithm unlike linear models where we supply

the input parameters

3. If left unconstrained, they can build tree structures to adapt to the training
data leading to overfitting

4. To avoid overfitting, we need to restrict the DT’s freedom during the tree
creation. This is called regularization

5. The regularization hyperparameters depend on the algorithms used

Decision Trees - Regularization parameters

1. max_depth – Is the maximum length of a path from root to leaf (in terms of
number of decision points. The leaf node is not split further. It could lead to
a tree with leaf node containing many observations on one side of the tree,
whereas on the other side, nodes containing much less observations get
further split

2. min_sample_split - A limit to stop further splitting of nodes when the

number of observations in the node is lower than this value

3. min_sample_leaf – Minimum number of samples a leaf node must have.

When a leaf contains too few observations, further splitting will result
in overfitting (modeling of noise in the data).
Decision Trees - Regularization parameters (Contd…)

4. min_weight_fraction_leaf – Same as min_sample_leaf but expressed in

fraction of total number of weighted instances

5. max_leaf_nodes – maximum number of leaf nodes in a tree

6. max_feature_size - max number of features that are evaluated for splitting

each node
Decision Tree -

Lab- 1 Model to predict potential credit defaulters

The dataset has 16 attributes described at

https://archive.ics.uci.edu/ml/datasets/statlog+(german+credit+data)
or in the notes page of this slide

Module 3 - Decision Tress and Artificial Neural Networks
No ratings yet
Module 3 - Decision Tress and Artificial Neural Networks
177 pages
Decision Trees
No ratings yet
Decision Trees
26 pages
Unit IV Da Online - PPTX 2 82
No ratings yet
Unit IV Da Online - PPTX 2 82
81 pages
STAT 451: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
No ratings yet
STAT 451: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
18 pages
Decision Trees
No ratings yet
Decision Trees
8 pages
Decision Tree
100% (1)
Decision Tree
57 pages
ShortCourse QTT Lecture2
No ratings yet
ShortCourse QTT Lecture2
37 pages
Unit 3,4,5 ML (CS - AI)
No ratings yet
Unit 3,4,5 ML (CS - AI)
37 pages
Lecture 5a
No ratings yet
Lecture 5a
24 pages
Decision Trees Lectures
No ratings yet
Decision Trees Lectures
55 pages
Decision Tree Notes
No ratings yet
Decision Tree Notes
6 pages
What Is Decision Tree
No ratings yet
What Is Decision Tree
35 pages
ML Unit 3
No ratings yet
ML Unit 3
22 pages
GitHub - Peggy1502 - Amazing-Resources - List of References and Online Resources Related To Data Science, Machine Learning and Deep Learning
No ratings yet
GitHub - Peggy1502 - Amazing-Resources - List of References and Online Resources Related To Data Science, Machine Learning and Deep Learning
41 pages
Module 4 Lecture - 2
No ratings yet
Module 4 Lecture - 2
65 pages
Decision Tree Algorithm, Explained-1-22
No ratings yet
Decision Tree Algorithm, Explained-1-22
22 pages
Unit 3 (MLT)
No ratings yet
Unit 3 (MLT)
42 pages
Engineering 23 06 2017
No ratings yet
Engineering 23 06 2017
137 pages
ML L8 Decision Tree
No ratings yet
ML L8 Decision Tree
109 pages
AIML Module-04
No ratings yet
AIML Module-04
46 pages
Decision Tree Algorithm Tutorial With Example in R
No ratings yet
Decision Tree Algorithm Tutorial With Example in R
23 pages
Decision - Tree
No ratings yet
Decision - Tree
75 pages
One of The Three
No ratings yet
One of The Three
1 page
Bulletin Personality and Social Psychology: Solitude Experiences: Varieties, Settings, and Individual Differences
No ratings yet
Bulletin Personality and Social Psychology: Solitude Experiences: Varieties, Settings, and Individual Differences
7 pages
QUESTION AND ANSWERS For An Angel in Disguise
No ratings yet
QUESTION AND ANSWERS For An Angel in Disguise
8 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
EDA Cat2
No ratings yet
EDA Cat2
54 pages
Week 11 - Decision Tree Learning
No ratings yet
Week 11 - Decision Tree Learning
43 pages
Decisiontrees
No ratings yet
Decisiontrees
28 pages
AI&Ml-module 4 (Complete)
No ratings yet
AI&Ml-module 4 (Complete)
124 pages
Aiml M4 C1
No ratings yet
Aiml M4 C1
101 pages
Unit II
No ratings yet
Unit II
34 pages
Machine - Learning - Lecture - 08 - Decision Tree Learning
No ratings yet
Machine - Learning - Lecture - 08 - Decision Tree Learning
67 pages
GitHub - Peggy1502 - ML-Notes - My Notes While Learning ML
No ratings yet
GitHub - Peggy1502 - ML-Notes - My Notes While Learning ML
2 pages
Linear Vibration Analysis of Cantilever Plates Partially Submerged in Fluid
No ratings yet
Linear Vibration Analysis of Cantilever Plates Partially Submerged in Fluid
13 pages
AI&Ml-module 4 (Part 1)
No ratings yet
AI&Ml-module 4 (Part 1)
85 pages
IEER
100% (1)
IEER
252 pages
Paul Li MSR Tech Report
No ratings yet
Paul Li MSR Tech Report
76 pages
Decision Tree
No ratings yet
Decision Tree
45 pages
DataMining-Handouts1 5
No ratings yet
DataMining-Handouts1 5
8 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
Volvo 700/900 Series: The Essential Buyer’s Guide
From Everand
Volvo 700/900 Series: The Essential Buyer’s Guide
Tim A. Beavis
No ratings yet
Informal Essay
100% (2)
Informal Essay
3 pages
UNIT-IV - Decision Tree Induction
No ratings yet
UNIT-IV - Decision Tree Induction
19 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Chi 2-2
No ratings yet
Chi 2-2
12 pages
ML Unit 3
No ratings yet
ML Unit 3
15 pages
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
No ratings yet
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
54 pages
Decision Trees
No ratings yet
Decision Trees
18 pages
Lec.7.intro.D.S. Fall 2023
No ratings yet
Lec.7.intro.D.S. Fall 2023
26 pages
Critical Appraisal
100% (2)
Critical Appraisal
132 pages
Lecture 8
No ratings yet
Lecture 8
28 pages
Decision Tree
No ratings yet
Decision Tree
57 pages
LVC 1 Post-Session Summary
No ratings yet
LVC 1 Post-Session Summary
9 pages
2021 2022 2nd
No ratings yet
2021 2022 2nd
7 pages
Wisc BC Data Notes
No ratings yet
Wisc BC Data Notes
10 pages
ML Unit-2 Material WORD
No ratings yet
ML Unit-2 Material WORD
25 pages
Slide 3
No ratings yet
Slide 3
23 pages
Original
100% (1)
Original
44 pages
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
No ratings yet
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
22 pages
Humanoid
No ratings yet
Humanoid
21 pages
GitHub - Peggy1502 - Fraud-Detection-Handbook - Machine Learning For Credit Card Fraud Detection - Practical Handbook
No ratings yet
GitHub - Peggy1502 - Fraud-Detection-Handbook - Machine Learning For Credit Card Fraud Detection - Practical Handbook
5 pages
Unit Ii
No ratings yet
Unit Ii
22 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
5 pages
DECISION TREES-jb
No ratings yet
DECISION TREES-jb
8 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
11 pages
Problem Definition - Software Engineering
No ratings yet
Problem Definition - Software Engineering
10 pages
Formula Cheat Sheet and Tips and Tricks
No ratings yet
Formula Cheat Sheet and Tips and Tricks
12 pages
Lecture Note #5 - PEC-CS701E
No ratings yet
Lecture Note #5 - PEC-CS701E
16 pages
GitHub - Peggy1502 - FM-Recommender-Engine - Build A Recommender Engine Using Amazon SageMaker Factorization Machines
No ratings yet
GitHub - Peggy1502 - FM-Recommender-Engine - Build A Recommender Engine Using Amazon SageMaker Factorization Machines
2 pages
GitHub - Peggy1502 - Data-Science-Articles - A Collection of My Data Science Articles Published in Towards Data Science and Towards AI
No ratings yet
GitHub - Peggy1502 - Data-Science-Articles - A Collection of My Data Science Articles Published in Towards Data Science and Towards AI
2 pages
Mastery Level Frequency of Errors
No ratings yet
Mastery Level Frequency of Errors
5 pages
Decision Tree
No ratings yet
Decision Tree
13 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
v1.1 ABC LTB LG Day 4
No ratings yet
v1.1 ABC LTB LG Day 4
32 pages
Prac 6
No ratings yet
Prac 6
6 pages
Anshu Yadav Rakesh Kataria Rituparna Neog Subhasmita Panigrahi Nift, Mfm-2 SEM
No ratings yet
Anshu Yadav Rakesh Kataria Rituparna Neog Subhasmita Panigrahi Nift, Mfm-2 SEM
34 pages
AIML Module 4 Imp
No ratings yet
AIML Module 4 Imp
5 pages
Worksheet 7
No ratings yet
Worksheet 7
3 pages
Direct Inverse Proportion
No ratings yet
Direct Inverse Proportion
59 pages
Decision Tree Algorithm - A Complete Guide: Data Science Blogathon
No ratings yet
Decision Tree Algorithm - A Complete Guide: Data Science Blogathon
13 pages
Bridon Construction Products
No ratings yet
Bridon Construction Products
16 pages
TNPSC Group 2 Complete Syllabus: TNPSC Group 2 Previous Questions: TNPSC Group 2 Model Questions
No ratings yet
TNPSC Group 2 Complete Syllabus: TNPSC Group 2 Previous Questions: TNPSC Group 2 Model Questions
4 pages
Sovrinmind Com Posts We Are Victimized by Facts
No ratings yet
Sovrinmind Com Posts We Are Victimized by Facts
21 pages
Chapter 7 Flashcards - Quizlet
No ratings yet
Chapter 7 Flashcards - Quizlet
3 pages
TAP413 3 Force Moving Charge
No ratings yet
TAP413 3 Force Moving Charge
5 pages
A.) B.) C.) D.)
No ratings yet
A.) B.) C.) D.)
4 pages
The Merciad, April 14, 1978
No ratings yet
The Merciad, April 14, 1978
8 pages
Cooling by Underground Earth Tubes
No ratings yet
Cooling by Underground Earth Tubes
4 pages
Automation in Manufacturing Unit-1
No ratings yet
Automation in Manufacturing Unit-1
58 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
15 pages
CH 9
No ratings yet
CH 9
9 pages
Articulation Assignment Final
No ratings yet
Articulation Assignment Final
7 pages
Leadership Style of Managers in 5 Star Hotels
No ratings yet
Leadership Style of Managers in 5 Star Hotels
6 pages
Cambridge As Level Results Statistics June 2015
No ratings yet
Cambridge As Level Results Statistics June 2015
2 pages
Static Balancing
No ratings yet
Static Balancing
4 pages

Decision Trees

Uploaded by

Decision Trees

Uploaded by

Decision Trees

1. Introduction to Decision Trees

2. Loss function for Decision Trees

2. It is also a proven management tool used to take decisions in complex situations

7. It does so by binary splitting data using functions based on comparison operators on

200 2900 L 4. But both the smaller nodes

Horse- Weight Car Type

The tree thus has given us the

Horse- Weight Car Horse- Weight Car

1. Suppose we come across a combination of “Weight” and “Horse-Power” for any of

Horse- Weight Car Type

3. Purity of a node is a measure of homogeneity in the target column at that node

c. Entropy of the bag (X) will be calculated as per the formula

a. H(X) = - (0.6 * log2( 0.6)) - (0.4 * log2(0.4)) = 0.9709506

Information Gain = reduction in entropy =

Information Gain = reduction in Gini index =

Common measures of purity

Note: Misclassification Error is not used in Decision Trees

1. ID3 (Iterative Dicotomizer 3) – developed by Ross Quinlan. Creates a multi

2. C4.5 succeeded ID3 by overcoming limitation of features required to be

4. CART (Classification & Regression Trees) is similar to C4.5 but it supports

1. Decision trees do not assume a particular form of relationship between the

2. DT is a non-parametrized algorithm unlike linear models where we supply

5. The regularization hyperparameters depend on the algorithms used

2. min_sample_split - A limit to stop further splitting of nodes when the

3. min_sample_leaf – Minimum number of samples a leaf node must have.

4. min_weight_fraction_leaf – Same as min_sample_leaf but expressed in

5. max_leaf_nodes – maximum number of leaf nodes in a tree

6. max_feature_size - max number of features that are evaluated for splitting

Lab- 1 Model to predict potential credit defaulters

The dataset has 16 attributes described at

You might also like