0% found this document useful (0 votes)

8 views26 pages

Machine Learning Lecture 2,3,4

Uploaded by

Asma Ayub

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views26 pages

Machine Learning Lecture 2,3,4

Uploaded by

Asma Ayub

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

Decision Trees and Ensemble Methods in

Machine Learning
Lecture #2
Dr.Sadaqat Ali
Introduction to Decision Trees

● Decision trees are used for classification and regression

● They split data into subsets based on feature values
● Create a tree-like structure for making decisions
● A decision tree is a flowchart-like structure used to
make decisions or predictions
Introduction to Decision Trees
Structure of a Decision Tree

1. Root Node: Represents the entire dataset and the initial

decision to be made.
2. Internal Nodes: Represent decisions or tests on
attributes. Each internal node has one or more branches.
3. Branches: Represent the outcome of a decision or test,
leading to another node.
4. Leaf Nodes: Represent the final decision or prediction. No
further splits occur at these nodes.
How Decision Trees Work?

The process of creating a decision tree involves:

1. Selecting the Best Attribute: Using a metric like Gini impurity,

entropy, or information gain, the best attribute to split the data is
selected.
2. Splitting the Dataset: The dataset is split into subsets based on the
selected attribute.
3. Repeating the Process: The process is repeated recursively for each
subset, creating a new internal node or leaf node until a stopping
criterion is met (e.g., all instances in a node belong to the same class
Classification Trees

● Goal: Classify data into categories

● Use measures like Gini Impurity or Entropy
● Output: A class label
● Can you think of a situation where you'd
want to classify something?
Regression Trees

● Goal: Predict continuous numeric values

● Use measures like Mean Squared Error (MSE)
● Output: A predicted value
● When might predicting a number be useful in real life?
Advantages of Decision Trees

Advantages of Decision Trees

● Simplicity and Interpretability: Decision trees are easy to
understand and interpret. The visual representation closely
mirrors human decision-making processes.
● Versatility: Can be used for both classification and regression
tasks.
● No Need for Feature Scaling: Decision trees do not require
normalization or scaling of the data.
● Handles Non-linear Relationships: Capable of capturing
non-linear relationships between features and target variables.
Disadvantages of Decision Trees

● Prone to overfitting
● Sensitive to small data changes
● Small variations in the data can result in a
completely different tree being generated.
Introduction to Ensemble Methods

● Combine multiple models to improve predictions

● Often use "weak learners" as base models
● Aim to increase accuracy and robustness

“Ensemble means ‘a collection of things’ and in Machine

Learning terminology, Ensemble learning refers to the approach
of combining multiple ML models to produce a more accurate and
robust prediction compared to any individual model ”
Introduction to Ensemble Methods

What is Ensemble Learning with examples?

● Ensemble learning is a machine learning technique that
combines the predictions from multiple individual models to
obtain a better predictive performance than any single model.
Introduction to Ensemble Methods

● Ensemble learning combines multiple models (weak or strong learners) to improve

overall predictive performance, making it more accurate and robust than individual
models. The main idea is to reduce errors by leveraging the strengths of diverse
models.
Types of Ensemble Methods

● Bagging: Uses bootstrap samples of data

● Boosting: Builds models sequentially
● Stacking: Combines predictions with a meta-model
● Which method sounds most interesting to you and why?
Types of Ensemble Methods
1. Bagging (Bootstrap Aggregating):
○ Trains models independently on random subsets of data.
○ Reduces variance and prevents overfitting.
○ Example: Random Forest (majority vote for classification or averaging for regression).
2. Boosting:
○ Builds models sequentially, where each corrects the errors of the previous one.
○ Reduces bias and variance.
○ Examples: AdaBoost, XGBoost (popular in predictive tasks).
3. Stacking:
○ Combines predictions from multiple base models using a meta-model.
○ Uses diverse models to optimize performance.
○ Example: Logistic Regression as a meta-model over Decision Trees and SVM.
4. Voting:
○ Aggregates predictions from independent models.
○ Hard Voting: Majority vote.
○ Soft Voting: Weighted average of probabilities.
Random Forests

● Based on bagging with decision trees

● Uses random sampling of data and
features
● Combines predictions from multiple trees
● How might this reduce overfitting
compared to a single tree?
Advantages of Random Forests

● Reduces overfitting compared to single trees

● Handles both numerical and categorical data well
● Scalable to large datasets
● Which advantage do you think is most important?
Disadvantages of Random Forests

● Less interpretable than a single decision tree

● Can be computationally expensive
● May require more memory for large datasets
● Why might interpretability be important in some cases?
Introduction to Boosting

● Iterative technique to improve weak models

● Adjusts weights of data points and models
● Aims to minimize errors over time
● How is this different from random forests?
AdaBoost (Adaptive Boosting)

● Combines weak learners (e.g., shallow trees)

● Increases weight of misclassified samples
● Final prediction is a weighted vote
● Why might focusing on misclassified samples be helpful?
XGBoost (Extreme Gradient Boosting)

● Advanced, scalable version of boosting

● Uses gradient boosting with optimizations
● Handles missing data and uses parallel computation
● How might these features be useful for big datasets?
Advantages of Boosting

● Improves weak models iteratively

● Works well with smaller datasets
(AdaBoost)
● Highly efficient and scalable (XGBoost)
● Which advantage stands out to you most?
Disadvantages of Boosting

● Can be sensitive to noisy data

● Prone to overfitting if not regularized
● May require parameter tuning (XGBoost)
● How might these disadvantages affect real-world use?
Introduction to Stacking

● Combines predictions from multiple base models

● Uses a meta-model trained on base model outputs
● Allows blending of diverse models
● How is this different from boosting and bagging?
How Stacking Works

● Train multiple base models (e.g., trees,

SVM)
● Use their predictions as features for a
meta-model
● Final prediction comes from the meta-
model
● Can you think of a real-world analogy for
this process?
Advantages and Disadvantages of Stacking

● Advantage: Can outperform individual models

● Advantage: Allows blending diverse models
● Disadvantage: Complex and time-consuming
● Disadvantage: Risk of overfitting the meta-model
● Which do you think is more significant: the advantages or
disadvantages?
Comparing Ensemble Methods

● Random Forests: Robust but less interpretable

● AdaBoost: Improves weak models but sensitive to noise
● XGBoost: Scalable but requires tuning
● Stacking: Blends models but complex
● Based on this comparison, which method interests you most?

Midterm Solution
No ratings yet
Midterm Solution
4 pages
ML Lecture 15 Ensemble
No ratings yet
ML Lecture 15 Ensemble
27 pages
Lecture 6
No ratings yet
Lecture 6
24 pages
Module 2
No ratings yet
Module 2
34 pages
Unit I ML (I) 24-25-1
No ratings yet
Unit I ML (I) 24-25-1
152 pages
Unit I ML (I) 24-25
No ratings yet
Unit I ML (I) 24-25
79 pages
UNIT III Word File
No ratings yet
UNIT III Word File
13 pages
Ensemble Learning
No ratings yet
Ensemble Learning
52 pages
D3 IT Random Forest Apr 2023
No ratings yet
D3 IT Random Forest Apr 2023
32 pages
Module 7 - Ensemble Learning
No ratings yet
Module 7 - Ensemble Learning
41 pages
22AIP3101A Session 11
No ratings yet
22AIP3101A Session 11
30 pages
ML Unit 4
No ratings yet
ML Unit 4
47 pages
Ensemble Learning
No ratings yet
Ensemble Learning
26 pages
2.4-Ensemble Methods Lecture Notes
No ratings yet
2.4-Ensemble Methods Lecture Notes
14 pages
ML4 - Decision Trees & Random Forest
No ratings yet
ML4 - Decision Trees & Random Forest
44 pages
ML - 5
No ratings yet
ML - 5
53 pages
Random Forest-Supervised ML
No ratings yet
Random Forest-Supervised ML
45 pages
Technical Report
No ratings yet
Technical Report
10 pages
Phys361 S24 Lecture 17 Random Forests
No ratings yet
Phys361 S24 Lecture 17 Random Forests
24 pages
Unit 4 ML
No ratings yet
Unit 4 ML
9 pages
Bagging
No ratings yet
Bagging
7 pages
ML Unit-3
No ratings yet
ML Unit-3
15 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
ML Unit-3
No ratings yet
ML Unit-3
16 pages
05 - Ensemble Learning
No ratings yet
05 - Ensemble Learning
39 pages
Unit 3
No ratings yet
Unit 3
63 pages
ML Unit 3-1
No ratings yet
ML Unit 3-1
14 pages
2025 Ensemble Learning
No ratings yet
2025 Ensemble Learning
25 pages
Ensemble Learning
No ratings yet
Ensemble Learning
16 pages
Random Forest Algorithm
No ratings yet
Random Forest Algorithm
39 pages
Ensemble Methods
No ratings yet
Ensemble Methods
32 pages
Ensemble Learning in Machine Learning
No ratings yet
Ensemble Learning in Machine Learning
4 pages
Eda - M4
No ratings yet
Eda - M4
7 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
32 pages
ML Unit-3
No ratings yet
ML Unit-3
28 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
An Introduction of Ensemble Learning
100% (1)
An Introduction of Ensemble Learning
40 pages
Ensemble Method
No ratings yet
Ensemble Method
8 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
Classification Algorithms
No ratings yet
Classification Algorithms
68 pages
Classification Random Forest
No ratings yet
Classification Random Forest
13 pages
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
36 pages
Lecture 11 Slides - After
No ratings yet
Lecture 11 Slides - After
55 pages
Unit 5 ML
No ratings yet
Unit 5 ML
14 pages
Ensemble-Based Techniques - XAI
No ratings yet
Ensemble-Based Techniques - XAI
13 pages
Ensemble Learning
100% (1)
Ensemble Learning
7 pages
Module 3: Advanced ML Algorithms and Hardware Design Optimization
No ratings yet
Module 3: Advanced ML Algorithms and Hardware Design Optimization
38 pages
Decision Trees
No ratings yet
Decision Trees
13 pages
5 - EnsembleModeling
No ratings yet
5 - EnsembleModeling
80 pages
Unit-V 1
No ratings yet
Unit-V 1
26 pages
Unit 3 Aml
No ratings yet
Unit 3 Aml
9 pages
Article Review 9 Eng
No ratings yet
Article Review 9 Eng
21 pages
Unit 4 PDF
No ratings yet
Unit 4 PDF
9 pages
Unit 3
No ratings yet
Unit 3
99 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
14 pages
Unit 3 by GPT
No ratings yet
Unit 3 by GPT
10 pages
Pa - Unit - Iv
No ratings yet
Pa - Unit - Iv
45 pages
Unit 3
No ratings yet
Unit 3
59 pages
AIR Questions
No ratings yet
AIR Questions
4 pages
T.Y.B.Sc. CS-357 Practicalslips
100% (3)
T.Y.B.Sc. CS-357 Practicalslips
25 pages
Practical Guide To Scikit-Learn For Data Science
No ratings yet
Practical Guide To Scikit-Learn For Data Science
27 pages
Assignment Comp
No ratings yet
Assignment Comp
2 pages
Data Structure PDF
No ratings yet
Data Structure PDF
233 pages
Specification: 5© C Program Program To Impliment Binary Search Algorithm
No ratings yet
Specification: 5© C Program Program To Impliment Binary Search Algorithm
5 pages
Y23!02!2119divide and Conquer
No ratings yet
Y23!02!2119divide and Conquer
36 pages
Searching Algorithms
No ratings yet
Searching Algorithms
12 pages
Leetcode Revision
No ratings yet
Leetcode Revision
4 pages
WWW - Manaresults.co - In: Set No. 1
No ratings yet
WWW - Manaresults.co - In: Set No. 1
2 pages
DSAIIMid Bank
No ratings yet
DSAIIMid Bank
323 pages
Polinom Java
No ratings yet
Polinom Java
3 pages
AVL Trees Implementation: Dr. Zahid Halim
No ratings yet
AVL Trees Implementation: Dr. Zahid Halim
17 pages
Apriori - Mlxtend
No ratings yet
Apriori - Mlxtend
4 pages
DSA Oral Questions
No ratings yet
DSA Oral Questions
2 pages
Pay Thon
No ratings yet
Pay Thon
19 pages
Dsapaper 1
0% (1)
Dsapaper 1
7 pages
The Two-Phase Simplex Method
No ratings yet
The Two-Phase Simplex Method
22 pages
Focs 2014-15
No ratings yet
Focs 2014-15
4 pages
Analysis and Design of Algorithms UNIT - 1
No ratings yet
Analysis and Design of Algorithms UNIT - 1
99 pages
Wolsey IntegerProgramming
No ratings yet
Wolsey IntegerProgramming
20 pages
Using List As Stack and Queues in Python
No ratings yet
Using List As Stack and Queues in Python
3 pages
Lab 3. Newton-Raphson Method: 1 Instructions
No ratings yet
Lab 3. Newton-Raphson Method: 1 Instructions
2 pages
Greedy Solution To The Fractional Knapsack Prob
No ratings yet
Greedy Solution To The Fractional Knapsack Prob
3 pages
Final Exam AI
No ratings yet
Final Exam AI
3 pages
Greedy Algorithm: Annu Malik, Anju Sharma, Mr. Vinod Saroha (Guide)
No ratings yet
Greedy Algorithm: Annu Malik, Anju Sharma, Mr. Vinod Saroha (Guide)
5 pages
Transportation Problem
No ratings yet
Transportation Problem
8 pages
L26 Banker Algorithm
No ratings yet
L26 Banker Algorithm
8 pages
15.053 - Optimization Methods in Management Science (Spring 2007) Problem Set 5
No ratings yet
15.053 - Optimization Methods in Management Science (Spring 2007) Problem Set 5
7 pages

Machine Learning Lecture 2,3,4

Uploaded by

Machine Learning Lecture 2,3,4

Uploaded by

Decision Trees and Ensemble Methods in

● Decision trees are used for classification and regression

1. Root Node: Represents the entire dataset and the initial

The process of creating a decision tree involves:

1. Selecting the Best Attribute: Using a metric like Gini impurity,

● Goal: Classify data into categories

● Goal: Predict continuous numeric values

Advantages of Decision Trees

● Combine multiple models to improve predictions

“Ensemble means ‘a collection of things’ and in Machine

What is Ensemble Learning with examples?

● Ensemble learning combines multiple models (weak or strong learners) to improve

● Bagging: Uses bootstrap samples of data

● Based on bagging with decision trees

● Reduces overfitting compared to single trees

● Less interpretable than a single decision tree

● Iterative technique to improve weak models

● Combines weak learners (e.g., shallow trees)

● Advanced, scalable version of boosting

● Improves weak models iteratively

● Can be sensitive to noisy data

● Combines predictions from multiple base models

● Train multiple base models (e.g., trees,

● Advantage: Can outperform individual models

● Random Forests: Robust but less interpretable

You might also like