0% found this document useful (0 votes)

5 views2 pages

Boosting

Boosting is an ensemble learning technique that combines multiple low-accuracy classifiers to create a highly accurate model by sequentially correcting errors from previous models. Key algorithms include AdaBoost, which focuses on misclassified examples, Gradient Boosting, which predicts residual errors, and XGBoost, known for its scalability and performance optimizations. Overall, boosting methods enhance predictive accuracy while managing bias and variance effectively.

Uploaded by

kalshkingu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views2 pages

Boosting

Uploaded by

kalshkingu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Boosting: An Ensemble Learning Technique

Kaleab Tadesse
FTP0848/14
Addis Ababa Science and Technology University
Email: [email protected]

I NTRODUCTION • The final or combined classifier is a weighted

Boosting is an ensemble learning method. It involves majority vote of the base classifiers, with (αt ) being
combining a set of low-accurate classifiers to create the weight assigned to the classifier (ht ).
a highly accurate classifier. The core idea is to build • Base models used in AdaBoost should be simple,
models sequentially, where each subsequent model at- so that different instance weights lead to different
tempts to correct the errors of the model before it. This models. Each simple model acts as an ’expert’ on
is done by building a model from the training data, some parts of the data.
then creating a second model that attempts to correct • The final AdaBoost model is an additive model,
errors from the first, and so on, adding models until where predictions are the sum of base model predic-
the training set is predicted perfectly or a maximum tions, and each base model receives a unique weight
number of models is reached. Boosting is primarily a related to its weighted error rate.
bias reduction technique. In contrast, Bagging (like Ran- • AdaBoost works by reducing the margin of training
dom Forest) is primarily a variance reduction technique. examples, especially those with the smallest mar-
While boosting reduces bias, boosting too much will gins. Larger margins on the training set translate
eventually increase variance. Highly accurate classifiers into a superior upper bound on generalization error.
produced by boosting can offer an error rate close to 0. • Bias-variance analysis: AdaBoost reduces bias (for
Boosting algorithms can track which base models failed underfitting problems).
accurate prediction and are less affected by overfitting • There is a close connection between AdaBoost
compared to individual weak models. and logistic regression. AdaBoost minimizes an
Some of the boosting techniques used to train ML exponential loss function, which is an upper bound
models are: for the logistic loss function minimized in logistic
regression.
1. AdaBoost (Adaptive Boosting) • AdaBoost has practical advantages: it is fast, sim-
• AdaBoost is based on the observation that finding ple, and easy to program. It generally has no
many rough rules of thumb can be easier than parameters to tune except the number of rounds. It
finding a single highly accurate rule. It is a general requires no prior knowledge about the base learner
method for improving the accuracy of any given and can be combined flexibly with various methods.
learning algorithm. • AdaBoost can identify outliers, which are examples
• The algorithm calls a base learning algorithm re- that are mislabeled or inherently ambiguous, by
peatedly, each time feeding it a different subset of focusing weight on the hardest examples. However,
training examples or, more precisely, a different dis- it can be susceptible to noise when the number
tribution or weighting over the training examples. of outliers is large. Variants like Gentle AdaBoost
• The key idea is to maintain a distribution or set and BrownBoost exist to handle noisy data and de-
of weights over the training set. On each round, emphasize outliers.
the weights of incorrectly classified examples are • AdaBoost can be extended to multiclass classi-
increased, forcing the base learner to focus on the fication using methods like AdaBoost.M1, Ad-
”hard” examples. aBoost.MH, and AdaBoost.M2. Some variants use
• Different models are obtained by reweighting the real-valued outputs which can speed up boosting.
training data every iteration. This process aims
to reduce underfitting by focusing on the ’hard’
2. Gradient Boosting
training examples.
• After each round, AdaBoost chooses a parameter • Gradient Boosting is an ensemble method where
(αt ) that measures the importance assigned to the models are built sequentially, with each model
base classifier produced in that round. fixing the remaining mistakes of the previous ones.
• In each iteration, the task of the new model is to mat, sorted by feature value. This structure
predict the residual error of the current ensemble’s reduces sorting costs and optimizes split find-
prediction. ing. Cache-aware prefetching is used for exact
• Pseudo-residuals are computed based on a differen- greedy algorithms to improve performance on
tiable loss function, such as least squares loss for large datasets. Block size is optimized for ap-
regression or log loss for classification. proximate algorithms to balance parallelization
• The algorithm uses a gradient descent approach: and cache performance.
predictions are updated step by step until conver- – Support for out-of-core computation by divid-
gence. ing data into blocks stored on disk, utilizing
• The final model is an additive model, with predic- independent threads for pre-fetching to over-
tions being the sum of the base model predictions. lap computation and disk reading. Techniques
• A learning rate is often used, which scales the like block compression and sharding data onto
contribution of each new model. Small updates with multiple disks are used to improve disk I/O
a learning rate typically work better as they reduce throughput.
variance. • XGBoost incorporates a regularized learning ob-
• Base models should generally be low variance and jective beyond traditional gradient boosting, which
flexible enough to accurately predict the residuals, penalizes the complexity of the tree models (number
such as decision trees of depth 2-5. of leaves and L2 norm of leaf weights) to help
• For regression using square loss, the pseudo- prevent overfitting.
residuals are simply the prediction errors. A new • It also utilizes shrinkage and feature (column)
regression tree is fitted to these errors. subsampling to further prevent overfitting. Column
• For classification using log loss, the base models subsampling can also speed up computation.
predict the probability of the positive class. The
pseudo-residuals are the difference between the true S UMMARY
class (0 or 1) and the predicted probability. In summary, Boosting is a powerful ensemble learning
• Bias-variance analysis: Gradient Boosting is highly technique that constructs a strong predictive model by
effective at reducing bias error. Like other boosting sequentially combining predictions from multiple sim-
methods, boosting too much can eventually increase pler, typically low-performing base models. The funda-
variance. mental principle involves fitting each subsequent model
to correct the errors or residual errors made by the
3. Extreme Gradient Boosting (XGBoost) models before it in the sequence, a process that primarily
• It is a variant of gradient tree boosting. aims to reduce bias. Key boosting algorithms include
• The most significant factor behind XGBoost’s suc- AdaBoost, which adaptively recomputes weights to fo-
cess is its scalability in all scenarios. It can run cus on previously misclassified examples, and Gradient
much faster than existing solutions on a single Boosting, which builds models to predict the negative
machine and scales to billions of examples in dis- gradient of a loss function with respect to the current
tributed or memory-limited settings. It can process ensemble’s prediction. Advanced implementations like
terabyte size datasets. XGBoost enhance gradient boosting with optimizations
• Factors that contribute to its scalability: for scalability, sparsity handling, and regularization to
– A sparsity-aware algorithm to handle sparse achieve high performance efficiently. Ultimately, this
data efficiently. It adds a default direction to iterative error correction process enables boosting meth-
tree nodes for missing values and processes ods to yield improved accuracy compared to individual
only non-missing entries, making complexity base models.
linear in the number of non-missing entries. R EFERENCES
– A theoretically justified weighted quantile
[1] T. Chen and C. Guestrin, “Xgboost: A scalable tree boosting
sketch for efficient approximate tree learn- system,” in Proceedings of the 22nd ACM SIGKDD International
ing. This method proposes candidate split Conference on Knowledge Discovery and Data Mining, ser.
points based on percentiles of feature values, KDD ’16. ACM, Aug. 2016, pp. 785–794. [Online]. Available:
http://dx.doi.org/10.1145/2939672.2939785
weighted by the second-order gradient statistics [2] J. H. Friedman, “Greedy function approximation: A
(hi). This is a novel approach for handling gradient boosting machine,” The Annals of Statistics,
weighted data in quantile computation. vol. 29, no. 5, pp. 1189–1232, 2001. [Online]. Available:
http://www.jstor.org/stable/2699986
– An effective cache-aware block structure for
parallel and out-of-core learning. Data is stored
in blocks in a compressed column (CSC) for-

Datamites Certified Data Scientist Syllabus PDF
50% (2)
Datamites Certified Data Scientist Syllabus PDF
12 pages
Experimenting XGBoost Algorithmfor Predictionand Classificationof Different Datasets
No ratings yet
Experimenting XGBoost Algorithmfor Predictionand Classificationof Different Datasets
12 pages
Gradient Boosting in ML
No ratings yet
Gradient Boosting in ML
5 pages
ML Lecture 15 Ensemble
No ratings yet
ML Lecture 15 Ensemble
27 pages
LECTURE+NOTES Boosting
No ratings yet
LECTURE+NOTES Boosting
8 pages
Handout9 Trees Bagging Boosting
100% (1)
Handout9 Trees Bagging Boosting
23 pages
Malware Detection
No ratings yet
Malware Detection
15 pages
Top 10 Data Mining Algorithms
No ratings yet
Top 10 Data Mining Algorithms
65 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
Ensemble, Voting, Bagging, Boosting
No ratings yet
Ensemble, Voting, Bagging, Boosting
15 pages
Unit V - Multiple Learners
No ratings yet
Unit V - Multiple Learners
54 pages
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
No ratings yet
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
6 pages
107 Boostong Models
No ratings yet
107 Boostong Models
27 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
کتاب هفتم بارگزاری شده
No ratings yet
کتاب هفتم بارگزاری شده
57 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
Class Adv Classification V
No ratings yet
Class Adv Classification V
50 pages
Module 7 - Ensemble Learning
No ratings yet
Module 7 - Ensemble Learning
41 pages
Introduction To Boosting - 2
No ratings yet
Introduction To Boosting - 2
79 pages
Boosting
No ratings yet
Boosting
13 pages
Ensemble Final
No ratings yet
Ensemble Final
41 pages
Machine Learning: Ensemble Methods
No ratings yet
Machine Learning: Ensemble Methods
54 pages
Lesson 8 - Ensemble Learning
No ratings yet
Lesson 8 - Ensemble Learning
61 pages
Ensemble Classifiers
No ratings yet
Ensemble Classifiers
37 pages
Boosting
No ratings yet
Boosting
12 pages
Random Forest-Supervised ML
No ratings yet
Random Forest-Supervised ML
45 pages
Session 10 - Ensemble Methods (XGBoost)
No ratings yet
Session 10 - Ensemble Methods (XGBoost)
37 pages
Overview of Adaboost: Reconciling Its Views To Better Understand Its Dynamics
No ratings yet
Overview of Adaboost: Reconciling Its Views To Better Understand Its Dynamics
39 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
Module 3.5 Ensemble Learning XGBoost
No ratings yet
Module 3.5 Ensemble Learning XGBoost
26 pages
Ensemble - Part 1
No ratings yet
Ensemble - Part 1
33 pages
AIML Lect6 Ensembles
No ratings yet
AIML Lect6 Ensembles
41 pages
22 Boosting
No ratings yet
22 Boosting
32 pages
Unit Iv
No ratings yet
Unit Iv
14 pages
Ensemble Learning
No ratings yet
Ensemble Learning
9 pages
Computational Data Analysis: Machine Learning
No ratings yet
Computational Data Analysis: Machine Learning
26 pages
Opencv-0 9 5 Doc Full
No ratings yet
Opencv-0 9 5 Doc Full
285 pages
Ensemble Learning Methods
100% (1)
Ensemble Learning Methods
24 pages
XGBoost & Adaboost
No ratings yet
XGBoost & Adaboost
22 pages
Types of Boosting
No ratings yet
Types of Boosting
4 pages
Unit 3 Aml
No ratings yet
Unit 3 Aml
9 pages
14-AI ML Ensemble 2022
No ratings yet
14-AI ML Ensemble 2022
41 pages
Boosting
No ratings yet
Boosting
6 pages
Data Mining - Ensemble Methods
No ratings yet
Data Mining - Ensemble Methods
12 pages
Unit 4
No ratings yet
Unit 4
17 pages
UNIT III Word File
No ratings yet
UNIT III Word File
13 pages
Chapter 3 - Boosting Theory
No ratings yet
Chapter 3 - Boosting Theory
7 pages
AdaBoost Notes
No ratings yet
AdaBoost Notes
5 pages
Decision Tree
No ratings yet
Decision Tree
66 pages
Adaboost Solutions
No ratings yet
Adaboost Solutions
6 pages
Bagging Vs Boosting in Machine Learning
No ratings yet
Bagging Vs Boosting in Machine Learning
5 pages
Bagging and Boosting: 9.520 Class 10, 13 March 2006 Sasha Rakhlin
No ratings yet
Bagging and Boosting: 9.520 Class 10, 13 March 2006 Sasha Rakhlin
19 pages
FAQ - Boosting - Ensemble Techniques - Great Learning
No ratings yet
FAQ - Boosting - Ensemble Techniques - Great Learning
2 pages
Ensemble Method
No ratings yet
Ensemble Method
8 pages
Bagging
No ratings yet
Bagging
7 pages
ML Exp 9
No ratings yet
ML Exp 9
3 pages
AdaBoost Classifier in Python (Article) - DataCamp
100% (1)
AdaBoost Classifier in Python (Article) - DataCamp
9 pages
Baysian Final
No ratings yet
Baysian Final
7 pages
Boosting Algo Adaboost
No ratings yet
Boosting Algo Adaboost
3 pages
Machine Learning: Video 106: Gradient Boosting Explained - How Gradient Boosting Works?
No ratings yet
Machine Learning: Video 106: Gradient Boosting Explained - How Gradient Boosting Works?
6 pages
Ensemble Classifiers
100% (1)
Ensemble Classifiers
37 pages
Machine Learning Algorithmsfor Predictionofmobilephone Price
No ratings yet
Machine Learning Algorithmsfor Predictionofmobilephone Price
9 pages
Projectreport Diabetes Prediction
No ratings yet
Projectreport Diabetes Prediction
22 pages
Vehicle Detection Tools in Real-Time Using Lora (Long Range)
No ratings yet
Vehicle Detection Tools in Real-Time Using Lora (Long Range)
12 pages
MLDM Lect17 Classification Ensembles
No ratings yet
MLDM Lect17 Classification Ensembles
2 pages
Week 12
No ratings yet
Week 12
34 pages
Cyberbullying Detection On Twitter Using Machine Learning A Review
No ratings yet
Cyberbullying Detection On Twitter Using Machine Learning A Review
5 pages
What's Up CAPTCHA - A CAPTCHA Based On Image Orientation
No ratings yet
What's Up CAPTCHA - A CAPTCHA Based On Image Orientation
10 pages
Analysis of Wear Debris Through Classification
No ratings yet
Analysis of Wear Debris Through Classification
13 pages
CS 4700: Foundations of Artificial Intelligence
No ratings yet
CS 4700: Foundations of Artificial Intelligence
36 pages
Advanced Driver Assistant System: Zihui Liu Chen Zhu Department of Electrical Engineering
No ratings yet
Advanced Driver Assistant System: Zihui Liu Chen Zhu Department of Electrical Engineering
7 pages
Vision-Based Cleaning Area Control For Cleaning Robots
No ratings yet
Vision-Based Cleaning Area Control For Cleaning Robots
6 pages
Zhu - Multiclass Adaboost2009 PDF
No ratings yet
Zhu - Multiclass Adaboost2009 PDF
12 pages
High Protection Voice Identification Based Bank Locker Security System With Live Image Authentication
No ratings yet
High Protection Voice Identification Based Bank Locker Security System With Live Image Authentication
6 pages
Face Recognition Sample
No ratings yet
Face Recognition Sample
9 pages
Pump It Up: Data Mining The Water Table
No ratings yet
Pump It Up: Data Mining The Water Table
5 pages
Malware Detection: A Framework For Reverse Engineered Android Applications Through Machine Learning Algorithms
No ratings yet
Malware Detection: A Framework For Reverse Engineered Android Applications Through Machine Learning Algorithms
20 pages
Lec07 Classification ModelEvaluation Ensemble
No ratings yet
Lec07 Classification ModelEvaluation Ensemble
62 pages
A Two-Stage Optimized Robust Kernel Density Estima
No ratings yet
A Two-Stage Optimized Robust Kernel Density Estima
36 pages
Trees, Bagging, Random Forests and Boosting
No ratings yet
Trees, Bagging, Random Forests and Boosting
43 pages
Customer Personality Analysis For Churn Prediction Using Hybrid Ensemble Models and Class Balancing Techniques
No ratings yet
Customer Personality Analysis For Churn Prediction Using Hybrid Ensemble Models and Class Balancing Techniques
15 pages
d2c0 PDF
No ratings yet
d2c0 PDF
6 pages
Week7 Assignment
No ratings yet
Week7 Assignment
3 pages
ML Question Bank CA-II
No ratings yet
ML Question Bank CA-II
10 pages
XGBoost in Practice: Definitive Reference for Developers and Engineers
From Everand
XGBoost in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
From Everand
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
Fouad Sabry
No ratings yet

Boosting

Uploaded by

Boosting

Uploaded by

Boosting: An Ensemble Learning Technique

I NTRODUCTION • The final or combined classifier is a weighted

You might also like