ML-Lecture-15-Ensemble

This document is a lecture on Ensemble Learning Methods, detailing various techniques such as Bagging, Boosting, and Stacking, along with specific algorithms like Random Forest, AdaBoost, and XGBoost. It explains the process of creating multiple classifiers and combining their predictions to improve model performance. Additionally, it covers the advantages and disadvantages of these methods, particularly focusing on Random Forests and their comparison to decision trees.

Uploaded by

Shohanur Rahman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views

ML-Lecture-15-Ensemble

Uploaded by

Shohanur Rahman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Machine Learning

Lecture 15: Ensemble Learning Methods

COURSE CODE: CSE451
2023
Course Teacher
Dr. Mrinal Kanti Baowaly
Associate Professor
Department of Computer Science and
Engineering, Bangabandhu Sheikh
Mujibur Rahman Science and
Technology University, Bangladesh.

Email: [email protected]
Ensemble Learning
 A powerful way to improve the performance of your model
 Construct a set of classifiers from training data
 Predict class label of test data by combining the predictions made
by multiple classifiers or models
 Examples: Random Forest, AdaBoost, Stochastic Gradient Boosting,
Gradient Boosting Machine(GBM), XGBoost, LightGBM, CatBoost
General Approach
Original
D Training data

Step 1:
Create Multiple D1 D2 .... Dt-1 Dt
Data Sets

Step 2:
Build Multiple C1 C2 Ct -1 Ct
Classifiers

Step 3:
Combine C*
Classifiers
Simple Ensemble Techniques
 Max Voting
 Averaging
 Weighted Averaging
Max Voting
 Multiple models are used to make predictions for each data point
 The predictions by each model are considered as a ‘vote’
 The predictions which we get from the majority of the models are
used as the final prediction
 Generally used for classification problems
 For example, when you asked 5 of your colleagues to rate your movie (out of
5); we’ll assume three of them rated it as 4 while two of them gave it a 5. Since
the majority gave a rating of 4, you can take the final rating of the movie as 4.
You can consider this as taking the mode of all the predictions.
Averaging
 Similar to the max voting technique, multiple predictions are made
for each data
 Take an average of predictions from all the models and use it to
make the final prediction.
 Averaging can be used in regression or classification problems.
 For example, in the previous case study of max voting, the averaging method
would take the average of all the values, i.e. (5+4+5+4+4)/5 = 4.4.
Hence, final rating of the movie is 4.4.
Weighted Averaging
 This is an extension of the averaging method.
 All models are assigned different weights defining the importance
of each model for prediction.
 For example, if two of your colleagues are critics, while others have no prior
experience in this field, then the answers by these two friends are given more
importance as compared to the other people.
The result can be calculated as [(5*0.23) + (4*0.23) + (5*0.18) + (4*0.18) +
(4*0.18)] = 4.41.
Hence, final rating of the movie is 4.41.

Implementation: AnalyticsVidhya, GeeksForGeeks

Advanced Ensemble Techniques
 Bagging: The idea behind bagging is combining the results of
multiple models run in parallel (for instance, all decision trees) to
get a generalized result.
 Boosting: Boosting is a sequential process, where each subsequent
model attempts to correct the errors of the previous model.
 Stacking: Stacking is an ensemble learning technique that uses
multiple models’ (called base models) predictions as features to
build a new model (called meta-model).
Bagging
 Multiple subsets are created from the
original dataset, selecting observations
with replacement (called bootstrapping).
 A base model (weak model) is created on
each of these subsets.
 The models run in parallel and are
independent of each other.
 The final predictions are determined by
combining the predictions from all the
models
Boosting
1. A base (weak) learner takes all the distributions
and assign equal weight or attention to each
observation.
2. If there is any prediction error caused by the base
learning algorithm, then we pay higher weight or
attention to observations having prediction error.
3. Apply the next base learning algorithm.
4. Repeat step 2 to 3 until the algorithm can correctly
classify the output or maximum number of
iterations is reached.
5. The weak learners are combined to form a strong
learner that will predict a more accurate outcome.
An Example of Boosting (AdaBoost)
 B1 consist of 10 data points which consist of two types namely plus(+) and minus(-
) and 5 of which are plus(+) and other 5 are minus(-) and each one has been
assigned equal weight initially. The first model tries to classify the data points and
generates a vertical separator line but it wrongly classifies 3 plus(+) as minus(-).
 B2 consists of the 10 data points from the previous model in which the 3 wrongly
classified plus(+) are weighted more so that the current model tries more to
classify these pluses(+) correctly. This model generates a vertical separator line
which correctly classifies the previously wrongly classified pluses(+) but in this
attempt, it wrongly classifies three minuses(-).
 B3 consists of the 10 data points from the previous model in which the 3 wrongly
classified minus(-) are weighted more so that the current model tries more to
classify these minuses(-) correctly. This model generates a horizontal separator
line which correctly classifies the previously wrongly classified minuses(-).
 B4 combines together B1, B2 and B3 in order to build a strong prediction model
which is much better than any individual model used.
Another Example: Dataaspirant, Detail Implementation: AnalyticsVidhya
HW: Difference between Bagging and
Boosting

Ref: QuantDare
Stacking Ensemble Learning

Level 0

Level 1

Source and Implementation:

GeeksForGeeks, AnalyticsVidhya
Random Forests Classifier
 The random forests algorithm
 How does the algorithm work?
 Its advantages and disadvantages
 Comparison between random forests and decision trees
 Finding important features
 Building a classifier with scikit-learn
Random Forests Algorithm
 It is a popular supervised learning algorithm.
 Random forest builds multiple decision trees (called forest) on
various random samples (or subsets) from a given dataset takes the
prediction from each tree and predicts the final output based on
the majority votes of the predictions.
 It is based on ‘bagging’ ensemble method that yields a more
accurate and stable prediction.
 It can be used both for classification and regression.
How does the algorithm work?
 Select random samples from a given
dataset (using bootstrapping).
 Construct a decision tree for each
sample and get a prediction result
from each decision tree.
 Final prediction is made by selecting
the prediction with the most votes
(for classification) or averaging the
predictions (for regression).
Advantages of Random Forests
 Random forests is considered as a highly accurate and robust
method because of the number of decision trees participating in
the process.
 It likely does not suffer from the overfitting problem because it
creates multiple trees on random subsets, takes the average or
most votes of the predictions of the trees, which cancel out the
biases. The randomness and voting or averaging mechanisms in
random forests elegantly solve the overfitting problem.
 It can handle missing data.
 It can be used in both classification and regression problems.
Disadvantages of Random Forests
 Random forests is slow because it builds multiple decision trees
and makes the final prediction by combining the predictions of
each individual tree.
 The model is difficult to interpret compared to a decision tree,
where you can easily make a decision by following the path in the
tree
Random Forest vs Decision Tree
 Random forest is a set of multiple decision trees whereas decision
tree is a single tree.
 Deep decision tree may suffer from overfitting, but random forest
prevents overfitting by creating multiple trees on random subsets.
 Decision tree is computationally faster, but random forest is slower.
 Random forests is difficult to interpret, while a decision tree is
easily interpretable and can be converted to rules.
Finding Important Features
 Random forests offers a good feature selection indicator.
 Scikit-learn provides an extra variable(feature_importances_) with the
model, which shows the relative importance or contribution of each feature
in the prediction.
 It automatically computes the relevance score of each feature in the
training phase. Then it scales the relevance down so that the sum of all
scores is 1. The higher the score, the more important the feature.
 This score will help you choose the most important features and drop the
least important ones for model building.
 Random forest uses gini importance (or impurity-based feature importance)
to calculate the importance of each feature.
More on Random Forest (LAB)
 Build a Random Forest classifier with scikit-learn
 Find important features of a Random Forest classifier with scikit-
learn
 Build both Decision Tree and Random Forest classifiers and
compare their performances
 Why does Random Forest model outperform the Decision Tree?

Source: DataCamp, AnalyticsVidhya

Advanced Boosting Methods
 What is GBM?
 What is XGBoost?
 What is LightGBM?
 Advantages of using Light GBM and XGBoost
 Build classifiers using GBM, LightGBM and XGBoost
 Compare GBM, LightGBM and XGBoost
 Which algorithm takes the crown: LightGBM or XGBoost?
Source: AnalyticsVidhya [1], [2]
Advanced Boosting Methods(Cont..)
 What is CatBoost?
 Advantages of CatBoost library
 CatBoost in comparison to other boosting algorithms
 Installing CatBoost
 Solving ML challenge using CatBoost

Source: AnalyticsVidhya, Dataaspirant

Comparison of CatBoost to other
boosting algorithms
A Comprehensive Course on Ensemble
Learning

Enroll now
Study Materials of Ensemble Methods
 AnalyticsVidhya: A Comprehensive Guide to Ensemble Learning
(with Python codes)
 GeeksForGeeks: Ensemble Method in Python
 AnalyticsVidhya: Basics of Ensemble Learning Explained in Simple
English
 Dataaspirant: How the Kaggle winners algorithm XGBoost algorithm
works

Lab W
100% (1)
Lab W
11 pages
Lecture 6
No ratings yet
Lecture 6
24 pages
Module 2
No ratings yet
Module 2
34 pages
Module 7 - Ensemble Learning
No ratings yet
Module 7 - Ensemble Learning
41 pages
Machine learning lecture 2,3,4
No ratings yet
Machine learning lecture 2,3,4
26 pages
Ensemble Learning
No ratings yet
Ensemble Learning
52 pages
Ensemble Final
No ratings yet
Ensemble Final
41 pages
Classification Algorithms
No ratings yet
Classification Algorithms
68 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
ML Mod 5.1
No ratings yet
ML Mod 5.1
18 pages
ML-Unit I - Ensemble Methods
No ratings yet
ML-Unit I - Ensemble Methods
54 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
40 pages
Unit 3
No ratings yet
Unit 3
99 pages
ML UNIT 3-1
No ratings yet
ML UNIT 3-1
14 pages
Random Forest-Supervised ML
No ratings yet
Random Forest-Supervised ML
45 pages
Unit-3(1)
No ratings yet
Unit-3(1)
63 pages
ML Unit-3
No ratings yet
ML Unit-3
28 pages
5 - EnsembleModeling
No ratings yet
5 - EnsembleModeling
80 pages
Group9 ABA Ensemble Model
No ratings yet
Group9 ABA Ensemble Model
5 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
2.4-Ensemble_methods_lecture_notes (1)
No ratings yet
2.4-Ensemble_methods_lecture_notes (1)
14 pages
Ensemble Learning
No ratings yet
Ensemble Learning
16 pages
ML Unit-3
No ratings yet
ML Unit-3
15 pages
Ensemble Learning
No ratings yet
Ensemble Learning
35 pages
An Introduction of Ensemble Learning
100% (1)
An Introduction of Ensemble Learning
40 pages
Unit-3(1)
No ratings yet
Unit-3(1)
59 pages
Unit I ML (I) 24-25-1
No ratings yet
Unit I ML (I) 24-25-1
152 pages
22AIP3101A Session 11
No ratings yet
22AIP3101A Session 11
30 pages
Unit I ML (I) 24-25
No ratings yet
Unit I ML (I) 24-25
79 pages
Ensemble Method
No ratings yet
Ensemble Method
8 pages
unit 5 ML
No ratings yet
unit 5 ML
14 pages
Ensemble Learning Methods
100% (1)
Ensemble Learning Methods
24 pages
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
36 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
Random Forest
No ratings yet
Random Forest
20 pages
ML UNIT-3 PART-1
No ratings yet
ML UNIT-3 PART-1
17 pages
Ensemble Learning in Machine Learning
No ratings yet
Ensemble Learning in Machine Learning
4 pages
Enseble LEarning
100% (1)
Enseble LEarning
57 pages
ML Unit 3
No ratings yet
ML Unit 3
22 pages
UNIT-V (Bagging, Boosting, Random Forest) : by Dr. K. Aditya Shastry Associate Professor Dept. of ISE NMIT, Bengaluru
No ratings yet
UNIT-V (Bagging, Boosting, Random Forest) : by Dr. K. Aditya Shastry Associate Professor Dept. of ISE NMIT, Bengaluru
27 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
32 pages
ML Unit 3 V2
No ratings yet
ML Unit 3 V2
47 pages
Ensemble Methods.pptx
No ratings yet
Ensemble Methods.pptx
32 pages
ML - 5
No ratings yet
ML - 5
53 pages
05 - Ensemble Learning
No ratings yet
05 - Ensemble Learning
39 pages
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
No ratings yet
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
6 pages
Ensemble Learning
100% (1)
Ensemble Learning
7 pages
Article Review 9 Eng
No ratings yet
Article Review 9 Eng
21 pages
Bagging and Random Forest Presentation1
100% (2)
Bagging and Random Forest Presentation1
23 pages
2025 Ensemble Learning.docx
No ratings yet
2025 Ensemble Learning.docx
25 pages
Unit 4 Ensemble Techniques and Unsupervised Learning
100% (1)
Unit 4 Ensemble Techniques and Unsupervised Learning
25 pages
Evaluating Machine Learning Algorithms and Model Selection
No ratings yet
Evaluating Machine Learning Algorithms and Model Selection
10 pages
U1-Ensemble Methods
No ratings yet
U1-Ensemble Methods
17 pages
UNIT-3 Material
No ratings yet
UNIT-3 Material
19 pages
Ens Embling
No ratings yet
Ens Embling
19 pages
AIML UNIT 4
No ratings yet
AIML UNIT 4
26 pages
Ensemble Classification
No ratings yet
Ensemble Classification
25 pages
Unit Iv
No ratings yet
Unit Iv
14 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
ML-Lecture-14-SVM
No ratings yet
ML-Lecture-14-SVM
15 pages
ML-Lecture-11-Evaluation
No ratings yet
ML-Lecture-11-Evaluation
17 pages
ML-Lecture-8-9-Classification
No ratings yet
ML-Lecture-8-9-Classification
35 pages
ML-Lecture-12-NB
No ratings yet
ML-Lecture-12-NB
15 pages
ML-Lecture-13-KNN
No ratings yet
ML-Lecture-13-KNN
14 pages
ML-Lecture-2-3-Types
No ratings yet
ML-Lecture-2-3-Types
27 pages
B4 1-Seidel
No ratings yet
B4 1-Seidel
28 pages
ML-Lecture-1-Intro
No ratings yet
ML-Lecture-1-Intro
21 pages
Organization of 8086
No ratings yet
Organization of 8086
22 pages
Assignment2
No ratings yet
Assignment2
10 pages
Image Fish
No ratings yet
Image Fish
4 pages
Intro To Microprocessor
No ratings yet
Intro To Microprocessor
26 pages
Research Paper
No ratings yet
Research Paper
7 pages
Testing
No ratings yet
Testing
61 pages
Observer
No ratings yet
Observer
12 pages
Lec02 Process Model
No ratings yet
Lec02 Process Model
37 pages
Lec03 Agile
No ratings yet
Lec03 Agile
28 pages
Lec05 System Modeling Part2
No ratings yet
Lec05 System Modeling Part2
21 pages
Project Management
No ratings yet
Project Management
25 pages
Factory
No ratings yet
Factory
7 pages
Lec01 Intro
No ratings yet
Lec01 Intro
27 pages
Activity Case
No ratings yet
Activity Case
34 pages
Final Paper 2
No ratings yet
Final Paper 2
87 pages
Petroleum Development Oman Construction/Commissioning Punch List
No ratings yet
Petroleum Development Oman Construction/Commissioning Punch List
5 pages
RZAG-NV1 - NY1 - Product Flyer - ECPEN19-147 - English
No ratings yet
RZAG-NV1 - NY1 - Product Flyer - ECPEN19-147 - English
8 pages
Milestones Project Plan
No ratings yet
Milestones Project Plan
50 pages
Case Summary Google in China Final
No ratings yet
Case Summary Google in China Final
3 pages
A Communication Plan A Detailed Asset Inventory A Data Restoration Priority Plan A Vendor Communication and Service Restoration Plan
100% (1)
A Communication Plan A Detailed Asset Inventory A Data Restoration Priority Plan A Vendor Communication and Service Restoration Plan
4 pages
BazarBelanja Pitch Deck
No ratings yet
BazarBelanja Pitch Deck
11 pages
Neo 8Q Neo M8 FW3 - Him - (Ubx 15029985)
No ratings yet
Neo 8Q Neo M8 FW3 - Him - (Ubx 15029985)
31 pages
Use of Coconut Shell As An Aggregate in Concrete: A Review
No ratings yet
Use of Coconut Shell As An Aggregate in Concrete: A Review
2 pages
Internship RNT Robotics O
No ratings yet
Internship RNT Robotics O
5 pages
How To Change Suddenlink Wifi Name and Password PDF
No ratings yet
How To Change Suddenlink Wifi Name and Password PDF
4 pages
Defining Business Analytics - An Empirical Approach PDF
No ratings yet
Defining Business Analytics - An Empirical Approach PDF
15 pages
PMI Virtual Experience Series 2021: 6-7 October: Please See Page 2 For Sessions Available On Demand
No ratings yet
PMI Virtual Experience Series 2021: 6-7 October: Please See Page 2 For Sessions Available On Demand
2 pages
Milling Rate Formate 2019 SDB
No ratings yet
Milling Rate Formate 2019 SDB
3 pages
BALICK Final Minus Index 9aug Chapters 1 and 2
No ratings yet
BALICK Final Minus Index 9aug Chapters 1 and 2
84 pages
GW3300_XS-G3_Certificate-G98
No ratings yet
GW3300_XS-G3_Certificate-G98
14 pages
Cyber Power CP850:1000:1350:1500AVRLCD UPS Manual
No ratings yet
Cyber Power CP850:1000:1350:1500AVRLCD UPS Manual
2 pages
Gripton - Electric Chain Hoist 2
No ratings yet
Gripton - Electric Chain Hoist 2
1 page
Sanayi A.Ş. MVD İnan Takım Tezgahları Sanayi A.Ş
No ratings yet
Sanayi A.Ş. MVD İnan Takım Tezgahları Sanayi A.Ş
13 pages
ESB - Catalogo Passive Das 10 3
No ratings yet
ESB - Catalogo Passive Das 10 3
105 pages
Survery On Fpga and LLM
No ratings yet
Survery On Fpga and LLM
16 pages
Sega: The Rise and Fall of A Home Video Game System
100% (1)
Sega: The Rise and Fall of A Home Video Game System
13 pages
Spring 2025 - Database Systems - BSCS
No ratings yet
Spring 2025 - Database Systems - BSCS
4 pages
Abrasive Jet Machining (Ajm) : Dept. of ME, ACE
No ratings yet
Abrasive Jet Machining (Ajm) : Dept. of ME, ACE
8 pages
Iitb Library Thesis Submission
100% (1)
Iitb Library Thesis Submission
8 pages
Call Flow
No ratings yet
Call Flow
4 pages
Meesho
100% (1)
Meesho
2 pages
How Many Mods Can Minecraft Handle_ - 英语 (自动生成)
No ratings yet
How Many Mods Can Minecraft Handle_ - 英语 (自动生成)
11 pages
Java - Understanding The Workings of Equals and Hashcode in A HashMap - Stack Overflow
No ratings yet
Java - Understanding The Workings of Equals and Hashcode in A HashMap - Stack Overflow
10 pages

ML-Lecture-15-Ensemble

Uploaded by

ML-Lecture-15-Ensemble

Uploaded by

Machine Learning

Lecture 15: Ensemble Learning Methods

Implementation: AnalyticsVidhya, GeeksForGeeks

Source and Implementation:

Source: DataCamp, AnalyticsVidhya

Source: AnalyticsVidhya, Dataaspirant

You might also like