Gradient Boosting in ML

Gradient Boosting is a powerful ensemble learning algorithm that sequentially trains weak learners to minimize the loss function using gradient descent. Unlike AdaBoost, it focuses on the residual errors of previous models rather than adjusting instance weights, and it can utilize various base learners like decision trees. The algorithm continues to add new models until a stopping criterion is met, with predictions being a combination of all models' outputs adjusted by a learning rate.

Uploaded by

Nilay Yadav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

87 views5 pages

Gradient Boosting in ML

Uploaded by

Nilay Yadav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Gradient Boosting in ML

Gradient Boosting is a popular boosting algorithm in machine learning used for classification and
regression tasks. Boosting is one kind of ensemble Learning method which trains the model
sequentially and each new model tries to correct the previous model. It combines several weak
learners into strong learners. There is two most popular boosting algorithm i.e

1. AdaBoost
2. Gradient Boosting

Gradient Boosting
Gradient Boosting is a powerful boosting algorithm that combines several weak learners into strong
learners, in which each new model is trained to minimize the loss function such as mean squared
error or cross-entropy of the previous model using gradient descent. In each iteration, the algorithm
computes the gradient of the loss function with respect to the predictions of the current ensemble
and then trains a new weak model to minimize this gradient. The predictions of the new model are
then added to the ensemble, and the process is repeated until a stopping criterion is met.
In contrast to AdaBoost, the weights of the training instances are not tweaked, instead, each
predictor is trained using the residual errors of the predecessor as labels. There is a technique
called the Gradient Boosted Trees whose base learner is CART (Classification and Regression
Trees). The below diagram explains how gradient-boosted trees are trained for regression
problems.

Gradient Boosted Trees for Regression

The ensemble consists of M trees. Tree1 is trained using the feature matrix X and the labels y. The
predictions labeled y1(hat) are used to determine the training set residual errors r1. Tree2 is then
trained using the feature matrix X and the residual errors r1 of Tree1 as labels. The predicted
results r1(hat) are then used to determine the residual r2. The process is repeated until all the M
trees forming the ensemble are trained. There is an important parameter used in this technique
known as Shrinkage. Shrinkage refers to the fact that the prediction of each tree in the ensemble is
shrunk after it is multiplied by the learning rate (eta) which ranges between 0 to 1. There is a trade-
off between eta and the number of estimators, decreasing learning rate needs to be compensated
with increasing estimators in order to reach certain model performance. Since all trees are trained
now, predictions can be made. Each tree predicts a label and the final prediction is given by the
formula,

y(pred) = y1 + (eta * r1) + (eta * r2) + ....... + (eta * rN)

Difference between Adaboost and Gradient Boosting

The difference between AdaBoost and gradient boosting are as follows:

AdaBoost Gradient Boosting

During each iteration in AdaBoost, the weights of incorrectly Gradient Boosting updates the weights by computing
classified samples are increased, so that the next weak learner the negative gradient of the loss function with respect to
focuses more on these samples. the predicted output.

Gradient Boosting can use a wide range of base learners,

AdaBoost uses simple decision trees with one split known as the
such as decision trees, and linear models.
decision stumps of weak learners.

AdaBoost is more susceptible to noise and outliers in the data, as it Gradient Boosting is generally more robust, as it updates
assigns high weights to misclassified samples
the weights based on the gradients, which are less
sensitive to outliers.
Gradient Boosting Algorithm

Step 1:
Let’s assume X, and Y are the input and target having N samples. Our goal is to learn the function
f(x) that maps the input features X to the target variables y. It is boosted trees i.e the sum of trees.
The loss function is the difference between the actual and the predicted variables.

Step 2: We want to minimize the loss function L( f) with respect to f.

If our gradient boosting algorithm is in M stages then To improve the the algorithm can add some
new estimator as having

Step 3: Steepest Descent

For M stage gradient boosting, The steepest Descent finds where is constant and
known as step length and is the gradient of loss function L(f)

Step 4: Solution
The gradient Similarly for M trees:

The current solution will be

Handout9 Trees Bagging Boosting
100% (1)
Handout9 Trees Bagging Boosting
23 pages
DataMining Course Handout PDF
No ratings yet
DataMining Course Handout PDF
5 pages
LECTURE+NOTES Boosting
No ratings yet
LECTURE+NOTES Boosting
8 pages
Introduction To Boosting - 2
No ratings yet
Introduction To Boosting - 2
79 pages
Ensemble, Voting, Bagging, Boosting
No ratings yet
Ensemble, Voting, Bagging, Boosting
15 pages
Types of Boosting
No ratings yet
Types of Boosting
4 pages
Boosting
No ratings yet
Boosting
12 pages
Boosting
No ratings yet
Boosting
6 pages
Boosting
No ratings yet
Boosting
2 pages
Boosting Algo Adaboost
No ratings yet
Boosting Algo Adaboost
3 pages
107 Boostong Models
No ratings yet
107 Boostong Models
27 pages
Ensemble - Part 1
No ratings yet
Ensemble - Part 1
33 pages
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
No ratings yet
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
6 pages
Pradipta Kumar Pattanayak - Ada Boosting
No ratings yet
Pradipta Kumar Pattanayak - Ada Boosting
44 pages
Computational Data Analysis: Machine Learning
No ratings yet
Computational Data Analysis: Machine Learning
26 pages
Lesson 8 - Ensemble Learning
No ratings yet
Lesson 8 - Ensemble Learning
61 pages
ML Presentation
No ratings yet
ML Presentation
14 pages
FAQ - Boosting - Ensemble Techniques - Great Learning
No ratings yet
FAQ - Boosting - Ensemble Techniques - Great Learning
2 pages
Lecture 16: Boosting - Applied ML
No ratings yet
Lecture 16: Boosting - Applied ML
20 pages
DM (Boosting)
No ratings yet
DM (Boosting)
15 pages
Gradient Boosting Ensemble Learning
No ratings yet
Gradient Boosting Ensemble Learning
19 pages
Boosting: 1. What Is The Difference Between Adaboost and Gradient Boosting?
No ratings yet
Boosting: 1. What Is The Difference Between Adaboost and Gradient Boosting?
2 pages
AdaBoost Classifier in Python (Article) - DataCamp
100% (1)
AdaBoost Classifier in Python (Article) - DataCamp
9 pages
Statistics Project
No ratings yet
Statistics Project
5 pages
Boosting
No ratings yet
Boosting
13 pages
Resilience To Overfitting AdaBoosts Approach
No ratings yet
Resilience To Overfitting AdaBoosts Approach
8 pages
ADABOOST
No ratings yet
ADABOOST
9 pages
Ensemble
No ratings yet
Ensemble
33 pages
Adaboost Solutions
No ratings yet
Adaboost Solutions
6 pages
Chapter 3 - Boosting Theory
No ratings yet
Chapter 3 - Boosting Theory
7 pages
Bagging Vs Boosting in Machine Learning
No ratings yet
Bagging Vs Boosting in Machine Learning
5 pages
ML Unit 3 (Ab22)
No ratings yet
ML Unit 3 (Ab22)
42 pages
Boosting Approach To Machine Learn
No ratings yet
Boosting Approach To Machine Learn
23 pages
A Brief Introduction To Adaboost: Hongbo Deng 6 Feb, 2007
No ratings yet
A Brief Introduction To Adaboost: Hongbo Deng 6 Feb, 2007
35 pages
09 Boosting
No ratings yet
09 Boosting
5 pages
16-Ensemble Learning - Cont... - 12-04-2024
No ratings yet
16-Ensemble Learning - Cont... - 12-04-2024
13 pages
Ensemble Learning
No ratings yet
Ensemble Learning
9 pages
ML Exp 9
No ratings yet
ML Exp 9
3 pages
Ensemble Learning Methods
100% (1)
Ensemble Learning Methods
24 pages
Machine Learning: Video 106: Gradient Boosting Explained - How Gradient Boosting Works?
No ratings yet
Machine Learning: Video 106: Gradient Boosting Explained - How Gradient Boosting Works?
6 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
AdaBoost Notes
No ratings yet
AdaBoost Notes
5 pages
ENG6500 7 Ensembles Boosting
No ratings yet
ENG6500 7 Ensembles Boosting
49 pages
Boosting and AdaBoost For Machine Learning
No ratings yet
Boosting and AdaBoost For Machine Learning
18 pages
Unit V - Multiple Learners
No ratings yet
Unit V - Multiple Learners
54 pages
A Short Introduction To Boosting
No ratings yet
A Short Introduction To Boosting
14 pages
کتاب هفتم بارگزاری شده
No ratings yet
کتاب هفتم بارگزاری شده
57 pages
ML U3 Notes
No ratings yet
ML U3 Notes
10 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
Ada Boost
No ratings yet
Ada Boost
11 pages
Ensemble (v6)
No ratings yet
Ensemble (v6)
45 pages
Gradient Boosting
No ratings yet
Gradient Boosting
17 pages
A Short Introduction To Boosting
No ratings yet
A Short Introduction To Boosting
14 pages
Exp 3
No ratings yet
Exp 3
11 pages
Survey - Gradient Boosting Machine
No ratings yet
Survey - Gradient Boosting Machine
9 pages
Boosting Reduces Bias
No ratings yet
Boosting Reduces Bias
7 pages
Module 3.5 Ensemble Learning XGBoost
No ratings yet
Module 3.5 Ensemble Learning XGBoost
26 pages
Gradient Boosting
No ratings yet
Gradient Boosting
9 pages
Ensemble Machine Learning Approach
No ratings yet
Ensemble Machine Learning Approach
13 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Lecture 2 Deep Learning Overview
No ratings yet
Lecture 2 Deep Learning Overview
98 pages
Exercise6 Solution
No ratings yet
Exercise6 Solution
8 pages
ML Lecture 15 Ensemble
No ratings yet
ML Lecture 15 Ensemble
27 pages
Apicella Et Al. 2019 - A Simple and Efficient Architecture For Trainable Activation Functions
No ratings yet
Apicella Et Al. 2019 - A Simple and Efficient Architecture For Trainable Activation Functions
15 pages
Machine Learning Deep Learning
No ratings yet
Machine Learning Deep Learning
2 pages
مقياس الضغوط النفسيه والمهنيه
100% (1)
مقياس الضغوط النفسيه والمهنيه
16 pages
ML Lesson Plan (2021-22)
No ratings yet
ML Lesson Plan (2021-22)
2 pages
Unit 5
No ratings yet
Unit 5
50 pages
Roman Urdu News Headline Classification Empowered With Machine Learning
No ratings yet
Roman Urdu News Headline Classification Empowered With Machine Learning
16 pages
MST-2 - Machine Learning
No ratings yet
MST-2 - Machine Learning
14 pages
MCQs Dumps 2
No ratings yet
MCQs Dumps 2
15 pages
ML Lecture#3
No ratings yet
ML Lecture#3
37 pages
DM Question Bank Mid 1
No ratings yet
DM Question Bank Mid 1
2 pages
Machine Learning Toolkit User Manual
No ratings yet
Machine Learning Toolkit User Manual
7 pages
Be Computer-Engineering Semester-8 2024 March Deep-Learning-2019-Pattern
No ratings yet
Be Computer-Engineering Semester-8 2024 March Deep-Learning-2019-Pattern
2 pages
CMR University School of Engineering and Technology Department of Cse and It
No ratings yet
CMR University School of Engineering and Technology Department of Cse and It
6 pages
Comparison of Neural Networks With Traditional Machine Learning Models
No ratings yet
Comparison of Neural Networks With Traditional Machine Learning Models
20 pages
Data Mining
No ratings yet
Data Mining
35 pages
M.E. Cse.
No ratings yet
M.E. Cse.
11 pages
Generative AI Course Syllabus Project Pro
No ratings yet
Generative AI Course Syllabus Project Pro
5 pages
Unit1 DL JNTUK
No ratings yet
Unit1 DL JNTUK
43 pages
Training Deep Neural Networks in Python Keras Framework (Tensor Ow Backend) With Inertial Sensor Data For Human Activity Classification
No ratings yet
Training Deep Neural Networks in Python Keras Framework (Tensor Ow Backend) With Inertial Sensor Data For Human Activity Classification
28 pages
Hierarchical Clustering and Experiment With Cutting The Dendrogram
No ratings yet
Hierarchical Clustering and Experiment With Cutting The Dendrogram
5 pages
K-Means and K-NN Methods For Determining Student Interest
No ratings yet
K-Means and K-NN Methods For Determining Student Interest
13 pages
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
No ratings yet
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
9 pages
DL Unit5 RNN
No ratings yet
DL Unit5 RNN
107 pages
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
No ratings yet
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
45 pages
Data Mining Techniques: Presentation On Neural Network
No ratings yet
Data Mining Techniques: Presentation On Neural Network
55 pages
AIML Engineer Roadmap
No ratings yet
AIML Engineer Roadmap
4 pages