Machine Learning Chapter 4
Machine Learning Chapter 4
Definition
Decision trees are a popular and powerful tool used in various fields
such as machine learning, data mining, and statistics. They provide a
clear and intuitive way to make decisions based on data by modeling
the relationships between different variables
What is a Decision Tree?
Root Node: Represents the entire dataset and the initial decision to be
made.
Internal Nodes: Represent decisions or tests on attributes. Each
internal node has one or more branches.
Branches: Represent the outcome of a decision or test, leading to
another node.
Leaf Nodes: Represent the final decision or prediction. No further splits
occur at these nodes.
How Decision Trees Work?
Overfitting: Decision trees can easily over fit the training data,
especially if they are deep with many nodes.
Instability: Small variations in the data can result in a completely
different tree being generated.
Bias towards Features with More Levels: Features with more levels
can dominate the tree structure.