0% found this document useful (0 votes)

10 views

Module 2

Uploaded by

falishaumaiza6

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Module 2

Uploaded by

falishaumaiza6

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 34

Module: II: Ensemble Learning

[8 Sessions] [Apply]

Ensemble Learning – using subset of instances – Bagging, Pasting, using

subset of features –random patches and random subspaces method; Voting
Classifier, Random Forest; Boosting – AdaBoost, Gradient Boosting, Extremely
Randomized Trees, Stacking.
Why Ensemble Learning ?

Suppose you ask a complex question to thousands of random people, then aggregate their
answers. In many cases you will find that this aggregated answer is better than an expert’s
answer

Similarly, if you aggregate the predictions of a group of predictors (such as classifiers or

regressors), you will often get better predictions than with the best individual predictor.

A group of predictors is called an ensemble; thus, this technique is called

Ensemble Learning, and an Ensemble Learning algorithm is called an Ensemble
method.
Voting Classifiers

Suppose you have trained a few classifiers, each one achieving about 80% accuracy.
You may have :
• Logistic Regression classifier
• SVM classifier
• Random Forest classifier
• K-Nearest Neighbors classifier
A very simple way to create an even better classifier is to aggregate the predictions of each
classifier and predict the class that gets the most votes. This majority-vote classifier is
called a hard voting classifier

Surprisingly, this voting

classifier often achieves
a higher accuracy than
the best classifier in the
ensemble.

Hard Voting Classifier Predictions

The following code creates and trains a voting classifier in Scikit-Learn, composed of
three diverse classifiers:

The Voting Classifier

outperforms the
individual classifier
with Accuracy of
90.4%
Bagging and Pasting
Another approach is to use the same training algorithm for every predictor, but to train
them on different random subsets of the training set.
When sampling is performed with replacement, this method is called bagging (short for
bootstrap aggregating ).
When sampling is performed without replacement, it is called pasting.

Pasting/Bagging Training Set sampling and Training

Bagging Classifier
Python coding for ensemble
learning
• import pandas as pd
• from pandas import read_csv
• import numpy as np
• from sklearn.metrics import accuracy_score
• from sklearn import tree
• from sklearn import svm
• from sklearn.linear_model import
LogisticRegression
• df = read_csv("diabetes.csv")
• x = np.array(df.drop(["Outcome"], 1))
• y = np.array(df["Outcome"])
• from sklearn.model_selection import
train_test_split
• x_train, x_test, y_train, y_test = train_test_split(x,
y, test_size = 0.2,random_state = 100)
• model1=tree.DecisionTreeClassifier()
• model2=svm.SVC(kernel='sigmoid', C=1,
gamma=1)
• model3=LogisticRegression(solver='liblinear',
random_state=0)
• model1.fit(x_train,y_train)
• model2.fit(x_train,y_train)
• model3.fit(x_train,y_train)
• prediction1=model1.predict(x_test)
• Decision Tree
• print("Decision Tree")
• print(accuracy_score(prediction1,y_t • 70.77922077922078
est)*100)
• support vector
• prediction2=model2.predict(x_test) machine
• print("support vector machine")
• print(accuracy_score(prediction2,y_t • 65.5844155844156
est)*100)
• LogisticRegression
• prediction3=model3.predict(x_test) • 74.02597402597402
• print("LogisticRegression")
• print(accuracy_score(prediction3,y_t
est)*100)
• Output:
• # Ensemble of Models
• estimator = []
• estimator.append(('LR',LogisticRegression(solver ='lbfgs',multi_class ='multinomial',max_iter
= 200)))
• estimator.append(('SVC', svm.SVC(gamma ='auto', probability = True)))
• estimator.append(('DTC', tree.DecisionTreeClassifier()))

• # Voting Classifier with hard voting

• from sklearn.ensemble import VotingClassifier
• hard_voting = VotingClassifier(estimators = estimator, voting ='hard')
• hard_voting.fit(x_train, y_train)
• prediction4= hard_voting.predict(x_test)
• print("Ensemble learning")
• print(accuracy_score(prediction4,y_test)*100)

Ensemble learning 75.32467532467533

The following code trains an ensemble of 500 Decision Tree classifiers,5 each trained
on 100 training instances randomly sampled from the training set with replacement
(this is an example of bagging, but if you want to use pasting instead, just set
bootstrap=False)
A single Decision Tree versus a bagging ensemble of 500 trees
Random Patches and Random Subspaces
The BaggingClassifier class supports sampling the features as well. This is controlled by two
hyperparameters: max_features and bootstrap_features.
They work the same way as max_samples and bootstrap, but for feature sampling instead of
instance sampling.
Thus, each predictor will be trained on a random subset of the input features

Sampling both training instances and features is called the Random Patches method.
Keeping all training instances but sampling features (i.e., bootstrap_features=True and/or
max_fea tures smaller than 1.0) is called the Random Subspaces method.

This is particularly useful when you are dealing with high-dimensional inputs (such as
images).
Random Forests

Random Forest is an ensemble of Decision Trees, generally trained via the bagging
method (or sometimes pasting).

Instead of building a Bagging Classifier and passing it a Decision Tree Classifier, you can
instead use the Random Forest Classifier class, which is more convenient and optimized
for Decision Trees.

Random forest tends to combine hundreds of decision

trees and then trains each decision tree on a different
sample of the observations.
18
Decision Tree is a series of
Nodes

19
Pruning-To reduce the
complexity of Decision Tree
Algorithm

20
Random Forest-An Improved
version of Decision Tree

21
Bagging using Random Forest
Trees
• Random Forest is a specific ensemble method that
utilizes bagging as its underlying technique.
• Random forest is one of the most popular tree-based
supervised learning algorithms. It is also the most
flexible and easy to use.
• The algorithm can be used to solve both classification
and regression problems.
• Random forest tends to combine hundreds of decision
trees and then trains each decision tree on a different
sample of the observations.
Algorithm
• Step 1: The algorithm select random samples from
the dataset provided.
• Step 2: The algorithm will create a decision tree for
each sample selected. Then it will get a prediction
result from each decision tree created.
• Step 3: Voting will then be performed for every
predicted result. For a classification problem, it will
use mode, and for a regression problem, it will
use mean.
• Step 4: And finally, the algorithm will select the
most voted prediction result as the final prediction.
Extremely Randomized Trees ensemble

When you are growing a tree in a Random Forest, at each node only a random subset of
the features is considered for splitting.
Trees can be made even more random by also using random thresholds for each feature
rather than searching for the best possible thresholds.
A forest of such extremely random trees is simply called an Extremely Randomized
Trees ensemble
Boosting
Boosting (originally called hypothesis boosting) refers to any Ensemble method that
can combine several weak learners into a strong learner.

The general idea of most boosting methods is to train predictors sequentially, each
trying to correct its predecessor.

Most popular Boosting methods are:

• AdaBoost( Adaptive Boosting)
• Gradient Boosting
AdaBoost
A first base classifier (such as a Decision Tree) is trained and used to make predictions on the
training set. The relative weight of misclassified training instances is then increased. A second
classifier is trained using the updated weights and again it makes predictions on the training
set, weights are updated, and so on

AdaBoost sequential training with instance weight updates

The first classifier gets many instances wrong, so their weights get
boosted.
The second classifier therefore does a better job on these instances,
and so on.
The plot on the right represents the same sequence of predictors
except that the learning rate is halved (i.e., the misclassified instance
weights are boosted half as much at every iteration).
Gradient Boosting
• Gradient Boosting works by sequentially adding
predictors to an ensemble, each one correcting
its predecessor. However, instead of tweaking the
instance weights at every iteration like AdaBoost
does, this method tries to fit the new predictor to
the residual errors made by the previous
predictor.

29
Stacking
Instead of using trivial functions (such as hard voting) to aggregate the predictions of
all predictors in an ensemble, we train a model to perform this aggregation.

Each of the bottom three

predictors predicts a different
value (3.1, 2.7, and 2.9), and
then the final predictor
(called a blender, or a meta
learner) takes these
predictions as inputs and
makes the final prediction
(3.0).
Training the First layer

The training set is split in two subsets. The first subset is used to train the predictors in the
first layer.
Next, the first layer predictors are used to make predictions on the second (held-out) set .

31
Training the Blender
We can create a new training set
using these predicted values as
input features (which makes this
new training set three-
dimensional), and keeping the
target values. The blender is
trained on this new training set,
so it learns to predict the target
value given the first layer’s
predictions.

32
Predictions in a multilayer stacking ensemble
• The first training subset is
used to train the first layer,
the second one is used to
create the training set used to
train the second layer and the
third one is used to create the
training set to train the third
layer.

• Once this is done, we can

make a prediction for a new
instance by going through
each layer sequentially.

Handout9 Trees Bagging Boosting
100% (1)
Handout9 Trees Bagging Boosting
23 pages
ML Tennis
No ratings yet
ML Tennis
6 pages
Unit-3(1)
No ratings yet
Unit-3(1)
63 pages
Unit-3(1)
No ratings yet
Unit-3(1)
59 pages
ML-Lecture-15-Ensemble
No ratings yet
ML-Lecture-15-Ensemble
27 pages
ML Unit-3
No ratings yet
ML Unit-3
15 pages
Unit 3
No ratings yet
Unit 3
99 pages
What Is Ensemble Learning
No ratings yet
What Is Ensemble Learning
4 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
40 pages
Unit I ML (I) 24-25-1
No ratings yet
Unit I ML (I) 24-25-1
152 pages
Unit I ML (I) 24-25
No ratings yet
Unit I ML (I) 24-25
79 pages
ML UNIT-3 PART-1
No ratings yet
ML UNIT-3 PART-1
17 pages
Module 7 - Ensemble Learning
No ratings yet
Module 7 - Ensemble Learning
41 pages
Lecture 6
No ratings yet
Lecture 6
24 pages
ML Unit-3
No ratings yet
ML Unit-3
28 pages
D3 IT Random Forest Apr 2023
No ratings yet
D3 IT Random Forest Apr 2023
32 pages
UNIT-3 Material
No ratings yet
UNIT-3 Material
19 pages
Mid2 Answers
No ratings yet
Mid2 Answers
42 pages
ML Unit 3
No ratings yet
ML Unit 3
22 pages
Machine learning lecture 2,3,4
No ratings yet
Machine learning lecture 2,3,4
26 pages
VTU Module-4 Chapter-2 Ensemble Learning and Random Forests
No ratings yet
VTU Module-4 Chapter-2 Ensemble Learning and Random Forests
61 pages
ML UNIT 3-1
No ratings yet
ML UNIT 3-1
14 pages
Ensemble Learning
No ratings yet
Ensemble Learning
52 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
Random Forest
No ratings yet
Random Forest
10 pages
Chapter07 Ensemble Learning
No ratings yet
Chapter07 Ensemble Learning
21 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
Ensemble Learning
No ratings yet
Ensemble Learning
16 pages
22AIP3101A Session 11
No ratings yet
22AIP3101A Session 11
30 pages
unit 5 ML
No ratings yet
unit 5 ML
14 pages
Ensemble Method
No ratings yet
Ensemble Method
8 pages
Lecture 10 Ensemble Methods
No ratings yet
Lecture 10 Ensemble Methods
69 pages
Ensemble Methods
100% (1)
Ensemble Methods
15 pages
ML Unit-3
No ratings yet
ML Unit-3
16 pages
Ensemble Learning in Machine Learning
No ratings yet
Ensemble Learning in Machine Learning
4 pages
unit 4 pdf
No ratings yet
unit 4 pdf
9 pages
unit 4 ml
No ratings yet
unit 4 ml
9 pages
Ensemble Learning
No ratings yet
Ensemble Learning
16 pages
Unit-3 ML P (1) PPTs by DR KSR
No ratings yet
Unit-3 ML P (1) PPTs by DR KSR
21 pages
UNIT-V (Bagging, Boosting, Random Forest) : by Dr. K. Aditya Shastry Associate Professor Dept. of ISE NMIT, Bengaluru
No ratings yet
UNIT-V (Bagging, Boosting, Random Forest) : by Dr. K. Aditya Shastry Associate Professor Dept. of ISE NMIT, Bengaluru
27 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
UNIT-5 ML notes
No ratings yet
UNIT-5 ML notes
24 pages
ML - 5
No ratings yet
ML - 5
53 pages
Week 11 EnsembleLearning
No ratings yet
Week 11 EnsembleLearning
34 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
32 pages
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
36 pages
Ensemble Learning
No ratings yet
Ensemble Learning
35 pages
Ensemble Models
No ratings yet
Ensemble Models
52 pages
UNIT 3 AML
No ratings yet
UNIT 3 AML
9 pages
Ensemble Learning and Random Forest 4th
No ratings yet
Ensemble Learning and Random Forest 4th
19 pages
Classification Algorithms
No ratings yet
Classification Algorithms
68 pages
Ensemble_learning
No ratings yet
Ensemble_learning
12 pages
Ensembles 1
No ratings yet
Ensembles 1
4 pages
5 - EnsembleModeling
No ratings yet
5 - EnsembleModeling
80 pages
ML Mod 5.1
No ratings yet
ML Mod 5.1
18 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
An Introduction of Ensemble Learning
100% (1)
An Introduction of Ensemble Learning
40 pages
Chapter 7_Printed
No ratings yet
Chapter 7_Printed
14 pages
Article Review 9 Eng
No ratings yet
Article Review 9 Eng
21 pages
05 - Ensemble Learning
No ratings yet
05 - Ensemble Learning
39 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Real_Time_Network_Traffic_Analysis_Using_Artificial_Intelligence_Machine_Learning_and_Deep_Learning_A_Review_of_Methods_Tools_and_Applications
No ratings yet
Real_Time_Network_Traffic_Analysis_Using_Artificial_Intelligence_Machine_Learning_and_Deep_Learning_A_Review_of_Methods_Tools_and_Applications
8 pages
Vehicle Detection Tools in Real-Time Using Lora (Long Range)
No ratings yet
Vehicle Detection Tools in Real-Time Using Lora (Long Range)
12 pages
Problem Statement SND Rubric - ML-2 - PGPDSBA.O.aug23.B
No ratings yet
Problem Statement SND Rubric - ML-2 - PGPDSBA.O.aug23.B
3 pages
AI for Cyber Security Automated Incident Response Systems
No ratings yet
AI for Cyber Security Automated Incident Response Systems
30 pages
(Program Curriculum) : PG Diploma in Data Science
No ratings yet
(Program Curriculum) : PG Diploma in Data Science
6 pages
Pattern Classification Using Ensemble Methods Rokach L download
No ratings yet
Pattern Classification Using Ensemble Methods Rokach L download
79 pages
The Mathematics of Machine Learning Lectures on Supervised Methods and Beyond 1st Edition Maria Han Veiga pdf download
100% (1)
The Mathematics of Machine Learning Lectures on Supervised Methods and Beyond 1st Edition Maria Han Veiga pdf download
49 pages
Question-Answers in Machine Learning
No ratings yet
Question-Answers in Machine Learning
14 pages
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
No ratings yet
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
16 pages
EPGP in Data Science Gen AI PDF
No ratings yet
EPGP in Data Science Gen AI PDF
63 pages
Exploring_Flavors_Through_AI_The_Future_of_Culinary_Taste_Prediction
No ratings yet
Exploring_Flavors_Through_AI_The_Future_of_Culinary_Taste_Prediction
9 pages
Meshref 5 DR - Hossam PredictingLoanApproval CMM2020 Dec2020
No ratings yet
Meshref 5 DR - Hossam PredictingLoanApproval CMM2020 Dec2020
10 pages
L5 Slides
No ratings yet
L5 Slides
23 pages
Detection of Cyber Attacks Using Artificial Intelligence
No ratings yet
Detection of Cyber Attacks Using Artificial Intelligence
14 pages
Abstract
No ratings yet
Abstract
4 pages
Machine Learning - Unit 2
No ratings yet
Machine Learning - Unit 2
104 pages
Machine Learning A Review On Binary Classification
No ratings yet
Machine Learning A Review On Binary Classification
5 pages
XG Boost
No ratings yet
XG Boost
39 pages
Real Time Finger Tracking and Contour Detection For Gesture Recognition Using Opencv
No ratings yet
Real Time Finger Tracking and Contour Detection For Gesture Recognition Using Opencv
4 pages
Efficient Learning Machines Theories Concepts And Applications For Engineers And System Designers Rahul Khanna instant download
No ratings yet
Efficient Learning Machines Theories Concepts And Applications For Engineers And System Designers Rahul Khanna instant download
91 pages
1 s2.0 S2214509523007374 Main
No ratings yet
1 s2.0 S2214509523007374 Main
26 pages
Hybrid-Recursive Feature Elimination for Efficient Feature Selection
No ratings yet
Hybrid-Recursive Feature Elimination for Efficient Feature Selection
9 pages
Developed Title List--updated till FEB
No ratings yet
Developed Title List--updated till FEB
67 pages
Enhancing Android Malware Detection Throught Ensemble Stakcking
No ratings yet
Enhancing Android Malware Detection Throught Ensemble Stakcking
11 pages
Seckin Et Al 2019 Production Fault Simulation and Forecasting From Time Series Data With Machine Learning in Glove
No ratings yet
Seckin Et Al 2019 Production Fault Simulation and Forecasting From Time Series Data With Machine Learning in Glove
12 pages
Prediction_of_Crime_Hotspots_using_Machine_Learning_with_Stacked_Generalized_Approach
No ratings yet
Prediction_of_Crime_Hotspots_using_Machine_Learning_with_Stacked_Generalized_Approach
5 pages
Weekly Quiz 2 Machine Learning PDF
100% (1)
Weekly Quiz 2 Machine Learning PDF
4 pages
Generalized Boosted Models: A Guide To The GBM Package: Greg Ridgeway August 3, 2007
No ratings yet
Generalized Boosted Models: A Guide To The GBM Package: Greg Ridgeway August 3, 2007
12 pages