0% found this document useful (0 votes)

18 views7 pages

33 - Assignment 7 - Implementation of Ensemble Techniques

Uploaded by

chetanlabs123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views7 pages

33 - Assignment 7 - Implementation of Ensemble Techniques

Uploaded by

chetanlabs123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Assignment 7: Implementation of Ensemble

techniques

Aim:
Select appropriate dataset for classification/regression problem and
implement various ensemble techniques like:
1. Stacking
2. blending
3. Random forest and compare their performances.

Theory:

1. What is ensemble learning?

Ensemble learning is a machine learning technique that combines multiple

models to improve the overall predictive performance. The key idea is to
train a diverse collection of models and aggregate their predictions to
produce a more accurate and robust result than any single model could
achieve on its own.
Some key points about ensemble learning:
● It trains two or more machine learning algorithms on a specific
classification or regression task.
● The algorithms within the ensemble are called "base models", "base
learners" or "weak learners".
● Base models can be constructed using a single algorithm or multiple
different algorithms.
● The goal is to train a diverse set of weak models that exhibit high bias
(poor predictive ability individually) but high variance among each
other.
● Combining the predictions of these diverse weak learners results in a
stronger, more accurate and lower variance model.

Ensemble Techniques
There are several main techniques used in ensemble learning:
1. Bagging (Bootstrap Aggregating)
● Creates diversity by generating random samples from the training
data and fitting the same model to each sample.
● Produces a "homogeneous parallel ensemble" of models of the same
type.
● Examples include Random Forests which extend bagging with
decision trees.
2. Boosting
● Follows an iterative process, sequentially training each model on the
errors of the previous model.
● Produces an additive model to progressively reduce the final errors.
● Examples include AdaBoost and Gradient Boosting.
3. Stacking/Blending
● Combines different base models, each trained independently to be
diverse.
● Produces a "heterogeneous parallel ensemble" of different model
types.
● Combines the base models using a meta-model trained on their
outputs.
Other ensemble techniques include:
● Majority voting for classification
● Averaging predictions for regression
● Weighted averaging based on model performance

Applications
Ensemble learning has been successfully applied to a wide range of
machine learning tasks including:
● Classification
● Regression
● Clustering
● Anomaly detection
● Structured prediction
It is particularly effective for improving model performance on noisy,
complex or imbalanced datasets. Ensemble methods are widely used in
areas like random forests, gradient boosting, and stacking models.

2. Explain stacking, blending, and Random Forest techniques used

above. How are they different from each other?

Ensemble learning techniques such as stacking, blending, and Random

Forest are powerful methods used to improve the performance of machine
learning models by combining multiple algorithms. Below is a detailed
explanation of each technique and their differences.

Stacking
Stacking, or stacked generalization, is an ensemble learning technique that
combines the predictions of multiple base models (also known as level 0
models) to improve predictive performance. Here’s how it works:
1. Base Models: Different machine learning algorithms are trained on
the same dataset. These models can be of different types (e.g.,
decision trees, support vector machines, etc.) to ensure diversity.
2. Meta-Model: A second-level model, called the meta-model (or level 1
model), is trained on the outputs (predictions) of the base models.
The meta-model learns how to best combine these predictions to
produce a final output.
3. Training Process: Typically, k-fold cross-validation is used to generate
predictions from the base models. Each base model is trained on k-1
folds and validated on the remaining fold, ensuring that the
meta-model is trained on predictions that are not biased by the
training data.
4. Final Prediction: Once the meta-model is trained, it can be used to
make predictions on new data by combining the predictions from the
base models.
Stacking is advantageous because it allows the meta-model to learn the
best way to combine the strengths of various models, potentially leading to
improved accuracy over any single model used alone.

Blending
Blending is a variation of stacking that simplifies the process by using a
holdout validation set instead of k-fold cross-validation. Here’s how
blending differs from stacking:
1. Training and Validation Split: In blending, the training dataset is split
into two parts: a training set and a validation set. The base models
are trained on the training set.
2. Predictions on Validation Set: Each base model makes predictions on
the validation set. These predictions are then used as features to
train the meta-model.
3. Final Prediction: The meta-model is trained on these predictions and
is then used to make predictions on the test dataset.
Blending is generally faster than stacking because it does not require the
computational overhead of k-fold cross-validation. However, it may be less
robust due to the potential for overfitting on the validation set, especially if
the dataset is small.

Random Forest
Random Forest is an ensemble learning method specifically designed for
classification and regression tasks. It is a type of bagging technique that
uses decision trees as its base learners. Here’s how it works:
1. Bootstrap Sampling: Random Forest creates multiple subsets of the
training data through bootstrapping (random sampling with
replacement). Each subset is used to train a separate decision tree.
2. Feature Randomness: When splitting nodes in each decision tree,
Random Forest randomly selects a subset of features rather than
considering all features. This introduces additional diversity among
the trees.
3. Aggregation: For classification tasks, the final prediction is made
through majority voting among the trees. For regression tasks, the
average of the predictions from all trees is taken.
Random Forest is robust against overfitting due to its ensemble nature and
the randomness introduced in both data sampling and feature selection. It
generally performs well on many datasets and is less sensitive to
hyperparameter tuning compared to other algorithms.

Differences Between Stacking, Blending, and Random Forest

Feature Stacking Blending Random Forest

Combines multiple Combines multiple

Uses decision trees as
Model Type different models different models
base models
(base models) (base models)

Uses k-fold Uses a single

Trains each tree on a
Training Method cross-validation for holdout validation
bootstrap sample
predictions set

Trained on Trained on
Aggregates predictions
Final Model predictions of base predictions of base
from multiple trees
models models
More
Less
computationally More straightforward;
Complexity computationally
intensive due to less tuning required
intensive
cross-validation

Higher risk of Lower risk of

Risk of Can overfit if not
overfitting on small overfitting due to
Overfitting properly validated
datasets ensemble nature

3. Conclude appropriately about the performance for your dataset.

The performance differences between the ensemble methods—Random

Forest, Stacking, and Blending—can be attributed to how each technique
combines and processes the data:

Random Forest performed consistently well with an accuracy of 88.33%

before tuning. This method benefits from the aggregation of multiple
decision trees, reducing variance and preventing overfitting. The model’s
strength lies in its robustness to overfitting, which is why it performed well
without much need for hyperparameter adjustments.

Stacking showed lower performance, with accuracy around 66.67% before

tuning. This method’s effectiveness heavily depends on the diversity and
strength of the base models and how well the meta-model can combine
their predictions. The weaker performance here suggests that the chosen
base models and meta-model were not diverse or strong enough to
improve upon each other's weaknesses, leading to suboptimal results.
Blending, after tuning, outperformed the other models with an accuracy of
91.67%. The significant improvement after hyperparameter tuning indicates
that blending was highly sensitive to the choice of parameters. This method
likely benefited from a better balance between the base models and the
meta-model, which, when tuned correctly, led to the highest overall
performance.

Conclusion:
The differences in model performance can be explained by how each
ensemble method leverages the strengths of its base learners and how
sensitive each method is to hyperparameter settings. Blending, with its
optimal tuning, was able to best capitalize on the strengths of its base
models, while Random Forest’s inherent robustness allowed it to perform
well out of the box. Stacking, however, struggled due to potential
mismatches between its base and meta-models.

Pa - Unit - Iv
No ratings yet
Pa - Unit - Iv
45 pages
Ensemble-Based Techniques - XAI
No ratings yet
Ensemble-Based Techniques - XAI
13 pages
Unit 2 ML
No ratings yet
Unit 2 ML
47 pages
Unit I ML (I) 24-25-1
No ratings yet
Unit I ML (I) 24-25-1
152 pages
Stacking
No ratings yet
Stacking
4 pages
The Soil Underfoot - Infinite Possibilities For A Finite Resource (Gnv64)
100% (2)
The Soil Underfoot - Infinite Possibilities For A Finite Resource (Gnv64)
462 pages
Unit I ML (I) 24-25
No ratings yet
Unit I ML (I) 24-25
79 pages
ML Lecture 15 Ensemble
No ratings yet
ML Lecture 15 Ensemble
27 pages
Unit 4
No ratings yet
Unit 4
24 pages
Ensemble Learning (Autosaved)
No ratings yet
Ensemble Learning (Autosaved)
31 pages
Ensemble Methods Send
No ratings yet
Ensemble Methods Send
20 pages
Ensemble Learning
No ratings yet
Ensemble Learning
20 pages
Ensemble Learning
100% (1)
Ensemble Learning
7 pages
Ensemble Learning
No ratings yet
Ensemble Learning
26 pages
IX Science Ch-12 Solutions (Improvement in Food Resources)
No ratings yet
IX Science Ch-12 Solutions (Improvement in Food Resources)
6 pages
ML Unit 3 V2
No ratings yet
ML Unit 3 V2
47 pages
Unit 3
No ratings yet
Unit 3
22 pages
ML Uint 4-2
No ratings yet
ML Uint 4-2
20 pages
Aiml Unit 4
No ratings yet
Aiml Unit 4
17 pages
Unit 4 Updated Notes
No ratings yet
Unit 4 Updated Notes
13 pages
Unit 4 PDF
No ratings yet
Unit 4 PDF
9 pages
Ensemble Methods
No ratings yet
Ensemble Methods
27 pages
Eda - M4
No ratings yet
Eda - M4
7 pages
D3 IT Random Forest Apr 2023
No ratings yet
D3 IT Random Forest Apr 2023
32 pages
Module 2
No ratings yet
Module 2
34 pages
Ensemble Methods Advanced ML
No ratings yet
Ensemble Methods Advanced ML
14 pages
Bagging
No ratings yet
Bagging
7 pages
22AIP3101A Session 11
No ratings yet
22AIP3101A Session 11
30 pages
Ensemble Methods
No ratings yet
Ensemble Methods
3 pages
Unit 5 ML
No ratings yet
Unit 5 ML
14 pages
Ensemble Methods
No ratings yet
Ensemble Methods
32 pages
Classification Through Ensembling Techniques
No ratings yet
Classification Through Ensembling Techniques
10 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Green University of Bangladesh Department of Computer Science and Engineering (CSE)
No ratings yet
Green University of Bangladesh Department of Computer Science and Engineering (CSE)
6 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
FUJITSU SoftwareServerView Suite Remote Management
No ratings yet
FUJITSU SoftwareServerView Suite Remote Management
426 pages
Machine Learning Lecture 2,3,4
No ratings yet
Machine Learning Lecture 2,3,4
26 pages
ML Unit-3
No ratings yet
ML Unit-3
15 pages
AI25
No ratings yet
AI25
7 pages
7 - Ensemble Techniques-Converted Updated
No ratings yet
7 - Ensemble Techniques-Converted Updated
8 pages
Lecture 5
No ratings yet
Lecture 5
11 pages
Ensemble Interview Questions
No ratings yet
Ensemble Interview Questions
3 pages
Module 7 - Ensemble Learning
No ratings yet
Module 7 - Ensemble Learning
41 pages
Unit 4 ML
No ratings yet
Unit 4 ML
9 pages
Technical Report
No ratings yet
Technical Report
10 pages
Ensemble Learning
No ratings yet
Ensemble Learning
16 pages
ML Unit 3-1
No ratings yet
ML Unit 3-1
14 pages
Ensemble Learning: Wisdom of The Crowd
100% (1)
Ensemble Learning: Wisdom of The Crowd
12 pages
Aiml Unit 4
No ratings yet
Aiml Unit 4
26 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
Unit 4 Ensemble Techniques and Unsupervised Learning
100% (1)
Unit 4 Ensemble Techniques and Unsupervised Learning
25 pages
MODULE 3 Developmental Stages in Middle and Late Adolescence
100% (1)
MODULE 3 Developmental Stages in Middle and Late Adolescence
33 pages
Unit 3 Aml
No ratings yet
Unit 3 Aml
9 pages
Ensemble Learning Techniques 12 Marks
No ratings yet
Ensemble Learning Techniques 12 Marks
3 pages
Lecture 6
No ratings yet
Lecture 6
24 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
2.4-Ensemble Methods Lecture Notes
No ratings yet
2.4-Ensemble Methods Lecture Notes
14 pages
Time To Explore (5) ML
No ratings yet
Time To Explore (5) ML
9 pages
Unit 4
No ratings yet
Unit 4
24 pages
Ensemble Learning in Machine Learning
No ratings yet
Ensemble Learning in Machine Learning
4 pages
JHA Painting
100% (1)
JHA Painting
9 pages
Grade 10: 2 Term-Test Bank & Mock Exam Model Answers
No ratings yet
Grade 10: 2 Term-Test Bank & Mock Exam Model Answers
90 pages
Sipass Integrated Afi5100: Installation Manual
No ratings yet
Sipass Integrated Afi5100: Installation Manual
14 pages
AIDS-II PT1 Question Bank
No ratings yet
AIDS-II PT1 Question Bank
27 pages
Ob Labor Delivery Skills Checklist
No ratings yet
Ob Labor Delivery Skills Checklist
3 pages
Amino Acids II
No ratings yet
Amino Acids II
64 pages
Why The Cross 2
No ratings yet
Why The Cross 2
17 pages
Abb Disconnector Gw54 1yva000105 Reva en
No ratings yet
Abb Disconnector Gw54 1yva000105 Reva en
8 pages
What Is Ensemble Learning
No ratings yet
What Is Ensemble Learning
4 pages
Ju. 24, 2CR
No ratings yet
Ju. 24, 2CR
24 pages
Anaphy Lab Disc 6
No ratings yet
Anaphy Lab Disc 6
25 pages
SCN1501 - 2025 - A1 Questions
No ratings yet
SCN1501 - 2025 - A1 Questions
3 pages
CIDAM World Religions
100% (16)
CIDAM World Religions
18 pages
Effects of Green Seaweeds (Ulva SP.) As Feed Supplements in Red Tilapia Diets
No ratings yet
Effects of Green Seaweeds (Ulva SP.) As Feed Supplements in Red Tilapia Diets
16 pages
Group9 ABA Ensemble Model
No ratings yet
Group9 ABA Ensemble Model
5 pages
Tok Pisin PNG Prases
No ratings yet
Tok Pisin PNG Prases
6 pages
4399 Aq DC 33210000000010
No ratings yet
4399 Aq DC 33210000000010
1 page
Assignment 5 - Implementing Image Classification Using Deep Learning
No ratings yet
Assignment 5 - Implementing Image Classification Using Deep Learning
8 pages
Conjunction Worksheet
No ratings yet
Conjunction Worksheet
4 pages
Spec Sheet - Bass - Xls - Bass
No ratings yet
Spec Sheet - Bass - Xls - Bass
2 pages
Animesh Parab BE - B22 109 Assignment 2 IOE
No ratings yet
Animesh Parab BE - B22 109 Assignment 2 IOE
4 pages
Indicaciones de Uso Vitremer
No ratings yet
Indicaciones de Uso Vitremer
4 pages
Animesh Parab BE - B22 109 Assignment 1 IOE
No ratings yet
Animesh Parab BE - B22 109 Assignment 1 IOE
3 pages
Mastering O'Level Islamiyat
98% (47)
Mastering O'Level Islamiyat
343 pages
Acknowledgement
No ratings yet
Acknowledgement
6 pages
A240CX-BD CD DD Flameproof Coil Solenoid Valves PDF
No ratings yet
A240CX-BD CD DD Flameproof Coil Solenoid Valves PDF
1 page
Axle Fabco FSD-XA
No ratings yet
Axle Fabco FSD-XA
3 pages
LBIS Review Pointers A.Y 22-23 (Term 4) - S2D
No ratings yet
LBIS Review Pointers A.Y 22-23 (Term 4) - S2D
3 pages
JCP KM
No ratings yet
JCP KM
2 pages
1 Oz Equals How Many Grains - Google Search
No ratings yet
1 Oz Equals How Many Grains - Google Search
1 page
Quiz3 Sol
No ratings yet
Quiz3 Sol
2 pages
Bridget 1
No ratings yet
Bridget 1
2 pages

33 - Assignment 7 - Implementation of Ensemble Techniques

Uploaded by

33 - Assignment 7 - Implementation of Ensemble Techniques

Uploaded by

Assignment 7: Implementation of Ensemble

1. What is ensemble learning?

Ensemble learning is a machine learning technique that combines multiple

2. Explain stacking, blending, and Random Forest techniques used

Ensemble learning techniques such as stacking, blending, and Random

Differences Between Stacking, Blending, and Random Forest

Feature Stacking Blending Random Forest

Combines multiple Combines multiple

Uses k-fold Uses a single

Higher risk of Lower risk of

3. Conclude appropriately about the performance for your dataset.

The performance differences between the ensemble methods—Random

Random Forest performed consistently well with an accuracy of 88.33%

Stacking showed lower performance, with accuracy around 66.67% before

You might also like