0% found this document useful (0 votes)

107 views4 pages

Bagging and Boosting

Bagging is a way to decrease the variance in the prediction by generating additional data for training from dataset using combinations with repetitions to produce multi-sets of the original data. Boosting is an iterative technique which adjusts the weight of an observation based on the last classification

Uploaded by

sumit rakesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

107 views4 pages

Bagging and Boosting

Uploaded by

sumit rakesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

What is Ensemble method?

Ensemble methods is a machine learning technique that combines several base models in order to
produce one optimal predictive model.

Note that Ensemble Methods do not only pertain to Decision Trees.

Ensemble Methods come in handy! Rather than just relying on one Decision Tree and hoping we made
the right decision at each split, Ensemble Methods allow us to take a sample of Decision Trees into
account, calculate which features to use or questions to ask at each split, and make a final predictor
based on the aggregated results of the sampled Decision Trees.

Best on Bagging and Boosting:

https://www.educba.com/bagging-and-boosting/?source=leftnav

https://quantdare.com/what-is-the-difference-between-bagging-and-boosting/

https://towardsdatascience.com/ensemble-methods-in-machine-learning-what-are-they-and-why-use-
them-68ec3f9fef5f

Types of Ensemble Methods

BAGGing, or Bootstrap AGGregating. BAGGing gets its name because it combines Bootstrapping and
Aggregation to form one ensemble model. Given a sample of data, multiple bootstrapped subsamples
are pulled. A Decision Tree is formed on each of the bootstrapped subsamples. After each subsample
Decision Tree has been formed, an algorithm is used to aggregate over the Decision Trees to form the
most efficient predictor. The image below will help explain:

Given a Dataset, bootstrapped subsamples are pulled. A Decision Tree is formed on each bootstrapped
sample. The results of each tree are aggregated to yield the strongest, most accurate predictor.

Random Forest Models. Random Forest Models can be thought of as Bagging.

Good Resource:

https://towardsdatascience.com/boosting-and-adaboost-clearly-explained-856e21152d3e

https://www.educba.com/bagging-and-boosting/?source=leftnav

https://towardsdatascience.com/ensemble-methods-bagging-boosting-and-stacking-c9214a10a205
Bootstrapping
Confidence Interval for a Proportion.

We take sample data and copy over and over and called population. The concept of Bootstrap. Our first
bootstrap sample is same size as original sample but randomly selected. First time it was 24 ,16. This
time it is 20, 20. On stat key we have original count as 24 out of 40 samples. Now we generate bootstrap
sample and got this graph.

https://youtu.be/655X9eZGxls
Bagging
Bagging Refers to Bootstrap Aggregating. There is another way we can build an ensemble of learners.
We can build them using the same learning algorithm but train each learner on a different set of data.
This is what we call bootstrap aggregating or bagging.

So, what we do is we create a number of subsets of the data. To understand it can be thinking of
different bags of data. And each one of these is a subset of original data. How do we collect this data, we
do it randomly? We create M different bags where each of the bag contain n prime different data
instances chosen at random with replacement.

n: number of instances

n’: N prime is the number of instances we put in each bag.

M: number of bags.

n’<n: 60% So each of these bags has about 60% as many training instances.

Now, we use each of these collections of data to train different model. We have not m different models
each one trained on a little bit of different data. And just like when we have ensemble of different
learning algorithm. Here we have an ensemble of different models we query in the same way.

We query each model with same X and we collect all their outputs.

We take Y output of each model, take their mean and that’s out why for the ensemble.

https://www.udacity.com/course/machine-learning-for-trading--ud501

https://youtu.be/2Mg8QD0F1dQ
Boosting

Boosting is a fairly simple variation on bagging that strives to improve the learners by focusing on areas
where the system is not performing well. One of the most well-known algorithms is called Adaboost.
Ada stands for adaptive.

We build our first bag of data in usual way. We select randomly from our training data. We then train a
model in usual way. The next we do is something different we take all our training data and use it to test
our model. In order to discover that some of the points in here, our x’s and our y’s are not well
predicted. So, there will be some points in Training data for which there would be a significant error.
Now when when we are going to build our next D2 bag of data . Again, we chose randomly from our
original data but each instance is weighted according to its error. So, the points that had significant
error, would more likely to get picked to go on to D2. We build a model from this data and when we test
it. Again we test from the training data on both the models M1 and M2, and again we measure error
across this data. Similarly we do it to the number of models we want to create or the numbers of bags
are exhausted.

So, bagging is simply choosing subset of the data at random with replacement and we create each bag in
the same way. Boosting is add on to the idea where in subsequent bags we chose those data instances
that had been modeled poorly in overall system before.

https://www.youtube.com/watch?v=GM3CDQfQ4sw

Unit 4 Ensemble Techniques and Unsupervised Learning
100% (1)
Unit 4 Ensemble Techniques and Unsupervised Learning
25 pages
Midterm: (15 Points) : Indian Institute of Management Bangalore Decision Science II Old Exams
0% (1)
Midterm: (15 Points) : Indian Institute of Management Bangalore Decision Science II Old Exams
72 pages
ML Lecture 15 Ensemble
No ratings yet
ML Lecture 15 Ensemble
27 pages
Path Analysis
No ratings yet
Path Analysis
11 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
Gauss Markov Theorem
No ratings yet
Gauss Markov Theorem
16 pages
Module-I Machine Learning1
No ratings yet
Module-I Machine Learning1
20 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
12 Ensemble Model
No ratings yet
12 Ensemble Model
90 pages
Enseble LEarning
100% (1)
Enseble LEarning
57 pages
Ensemble Learning Methods
100% (1)
Ensemble Learning Methods
24 pages
(FREE PDF Sample) Multidimensional Item Response Theory 1st Edition Wes Bonifay Ebooks
No ratings yet
(FREE PDF Sample) Multidimensional Item Response Theory 1st Edition Wes Bonifay Ebooks
49 pages
Unit V - Multiple Learners
No ratings yet
Unit V - Multiple Learners
54 pages
Provided by K-State Research Exchange
No ratings yet
Provided by K-State Research Exchange
130 pages
Module 5,1 Ensemble - Bagging, RF, Boosting
No ratings yet
Module 5,1 Ensemble - Bagging, RF, Boosting
66 pages
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
No ratings yet
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
6 pages
Machine Learning
No ratings yet
Machine Learning
76 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
8 pages
EE3211 Modelling Techniques
No ratings yet
EE3211 Modelling Techniques
47 pages
Ensemble Learning (Autosaved)
No ratings yet
Ensemble Learning (Autosaved)
31 pages
Module 7 - Ensemble Learning
No ratings yet
Module 7 - Ensemble Learning
41 pages
Unit 3
No ratings yet
Unit 3
59 pages
UMl - Unit 3
No ratings yet
UMl - Unit 3
50 pages
Ensemble Learning
No ratings yet
Ensemble Learning
52 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
UNIT3 Class
No ratings yet
UNIT3 Class
30 pages
ML-Unit I - Ensemble Methods
No ratings yet
ML-Unit I - Ensemble Methods
54 pages
Ensemble Learning: Proprietary Content. ©great Learning. All Rights Reserved. Unauthorized Use or Distribution Prohibited
No ratings yet
Ensemble Learning: Proprietary Content. ©great Learning. All Rights Reserved. Unauthorized Use or Distribution Prohibited
6 pages
Unit 4
No ratings yet
Unit 4
24 pages
ML Unit 3 V2
No ratings yet
ML Unit 3 V2
47 pages
Ensemble Final
No ratings yet
Ensemble Final
41 pages
ML Chapter 3
No ratings yet
ML Chapter 3
25 pages
Ensemble Learning
No ratings yet
Ensemble Learning
35 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
40 pages
Models For Non-Stationary Time Series: T T T T T
No ratings yet
Models For Non-Stationary Time Series: T T T T T
29 pages
Machine Learning Masterclass
100% (11)
Machine Learning Masterclass
108 pages
Ensemble - Part 1
No ratings yet
Ensemble - Part 1
33 pages
Unit Iv
No ratings yet
Unit Iv
14 pages
Unit 3
No ratings yet
Unit 3
20 pages
UNIT III Word File
No ratings yet
UNIT III Word File
13 pages
Ensemble Learning
No ratings yet
Ensemble Learning
30 pages
Oneway Anova F
No ratings yet
Oneway Anova F
17 pages
Seasonal It y
No ratings yet
Seasonal It y
7 pages
ML Cat 2 - 7
No ratings yet
ML Cat 2 - 7
30 pages
Module 4
No ratings yet
Module 4
30 pages
Ensemble Techniques Presentation
No ratings yet
Ensemble Techniques Presentation
17 pages
3 Ensemble Learning Bagging
No ratings yet
3 Ensemble Learning Bagging
15 pages
Ensemble Learning in Machine Learning
No ratings yet
Ensemble Learning in Machine Learning
15 pages
ENsemble, Random Forest
No ratings yet
ENsemble, Random Forest
28 pages
Multiple Choice Test Bank Questions No Feedback - Chapter 1
No ratings yet
Multiple Choice Test Bank Questions No Feedback - Chapter 1
47 pages
Unit 3 Aml
No ratings yet
Unit 3 Aml
9 pages
04 Chap04 ClassificationMethods-LogisticRegression 2024
No ratings yet
04 Chap04 ClassificationMethods-LogisticRegression 2024
23 pages
ML Mod 5.1
No ratings yet
ML Mod 5.1
18 pages
ML Unit 3-1
No ratings yet
ML Unit 3-1
14 pages
Ensembles Learning
No ratings yet
Ensembles Learning
16 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
Bagging Vs Boosting in Machine Learning - GeeksforGeeks
No ratings yet
Bagging Vs Boosting in Machine Learning - GeeksforGeeks
9 pages
Ens Embling
No ratings yet
Ens Embling
8 pages
Reporting MANOVA
No ratings yet
Reporting MANOVA
20 pages
Bagging
No ratings yet
Bagging
7 pages
ML U3 Notes
No ratings yet
ML U3 Notes
10 pages
16-Ensemble Learning - Cont... - 12-04-2024
No ratings yet
16-Ensemble Learning - Cont... - 12-04-2024
13 pages
Bagging Vs Boosting - Javatpoint
No ratings yet
Bagging Vs Boosting - Javatpoint
8 pages
Lecture 5
No ratings yet
Lecture 5
11 pages
Bagging Vs Boosting in Machine Learning
No ratings yet
Bagging Vs Boosting in Machine Learning
5 pages
Marketing Optimization: Predictive Analytics Use Case
No ratings yet
Marketing Optimization: Predictive Analytics Use Case
18 pages
Effect of Diversity Management On Employees' Intention To Quit: Mediating Role of Employee Motivation
No ratings yet
Effect of Diversity Management On Employees' Intention To Quit: Mediating Role of Employee Motivation
12 pages
MCQ
No ratings yet
MCQ
8 pages
Bagging & Boosting
No ratings yet
Bagging & Boosting
10 pages
SPSS Uji PH Perbandingan Ekstrak - 032018
No ratings yet
SPSS Uji PH Perbandingan Ekstrak - 032018
9 pages
AI25
No ratings yet
AI25
7 pages
PR Ekonometrika
No ratings yet
PR Ekonometrika
8 pages
β β X, X σ X X X: Simposium Nasional Akuntansi Vi
No ratings yet
β β X, X σ X X X: Simposium Nasional Akuntansi Vi
12 pages
Bagging
No ratings yet
Bagging
6 pages
Mann-Whitney Table Real Statistics Using Excel
No ratings yet
Mann-Whitney Table Real Statistics Using Excel
6 pages
Boosting
No ratings yet
Boosting
6 pages
E4fbc2f-C755-Ed1a-C18-F18ec25eb0d Ensemble Learning Bagging Boosting and Stacking
No ratings yet
E4fbc2f-C755-Ed1a-C18-F18ec25eb0d Ensemble Learning Bagging Boosting and Stacking
6 pages
Stock Watson 3U ExerciseSolutions Chapter11 Students
No ratings yet
Stock Watson 3U ExerciseSolutions Chapter11 Students
7 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
ECS4863 SOLUTIONS Activity 4.2 A Reviews of Statistical and Econometric Concepts Continued
No ratings yet
ECS4863 SOLUTIONS Activity 4.2 A Reviews of Statistical and Econometric Concepts Continued
3 pages
Group9 ABA Ensemble Model
No ratings yet
Group9 ABA Ensemble Model
5 pages
Sheet 1 Trip Generation
No ratings yet
Sheet 1 Trip Generation
2 pages
Ensemble TBL Notes
No ratings yet
Ensemble TBL Notes
2 pages
Postgraduate PG - Mba - Semester 3 - 2023 - May - Advanced Statistical Method Using R Pattern 2019
No ratings yet
Postgraduate PG - Mba - Semester 3 - 2023 - May - Advanced Statistical Method Using R Pattern 2019
2 pages
N SR TSRPT
No ratings yet
N SR TSRPT
1 page
Prelim Exam Question Paper - BI
No ratings yet
Prelim Exam Question Paper - BI
2 pages
Mathematics in The Modern World: Predicted Discharge - 7792 + (4.226 X Year)
No ratings yet
Mathematics in The Modern World: Predicted Discharge - 7792 + (4.226 X Year)
2 pages
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet

Bagging and Boosting

Uploaded by

Bagging and Boosting

Uploaded by

What is Ensemble method?

Note that Ensemble Methods do not only pertain to Decision Trees.

Best on Bagging and Boosting:

Types of Ensemble Methods

Random Forest Models. Random Forest Models can be thought of as Bagging.

n’: N prime is the number of instances we put in each bag.

You might also like