0% found this document useful (0 votes)

38 views17 pages

Datagiri: Presented 17 November By: Himanshu Shrivastava

The document provides an overview of gradient boosting machines (GBM). It explains that GBM fits an additive model in a stage-wise fashion by introducing weak learners to compensate for the shortcomings of existing models, where shortcomings are identified by residuals. The key steps of the GBM algorithm are outlined, including computing residuals, fitting a regression tree to the residuals from the previous tree, and repeating this process through a chain of successive trees where the final prediction is the sum of the weighted contributions of each tree. Important GBM parameters are also listed.

Uploaded by

neeraj12121

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views17 pages

Datagiri: Presented 17 November By: Himanshu Shrivastava

Uploaded by

neeraj12121

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

DataGiri

Presented 17th November

By : Himanshu Shrivastava

1
Objective of Machine Learning?

• Solving real life problems using data

• Find a function from data which is able to make accurate predictions in future

• This function/model should be an implementable and interpretable solution

 https://www.wired.com/2012/04/netflix-prize-costs/

 https://www.risk.net/asset-management/6119616/blackrock-shelves-unexplainable-ai-liquidity-models

2
Good Visualizations!!!
Tree based models : http://www.r2d3.us/visual-intro-to-machine-learning-part-1/
Bias Variance tradeoff: http://www.r2d3.us/visual-intro-to-machine-learning-part-2/
Bias Variance Trade Off: (Model Complexity vs Error)

3
Additive Modeling
 Foundation of bagging and boosting algorithms

 Add a bunch of simple terms together to create a more complicated

expression

 A useful technique because we can often conjure up the simple

terms more easily than cracking the overall function in one go

4
Illustration:

Mathematically,

5
Random Forest Classifier – Uses Bagging

Training Data  For each test example,

every decision tree votes.
Building  Example is labeled as the
Decision class having majority of the
Trees votes
....…
Bagging

6
AdaBoost

 Fit an additive model (ensemble) in a forward stage-wise manner

 In each stage, introduce a weak learner to compensate the shortcomings of existing weak
learners

 “Shortcomings” are identified by high-weight data points

7
8
Building Intuition – Gradient Boosting Machines
• You are given (x1, y1),(x2, y2), ...,(xn, yn), and the task is to fit a model F(x) to minimize square loss.
• Suppose your friend wants to help you and gives you a model F. You check his model and find the
model is good but not perfect.
• There are some mistakes: F(x1) = 0.8, while y1 = 0.9, and F(x2) = 1.4 while y2 = 1.3...

• How can you improve this model?

Rules of the game:

• You are not allowed to remove anything from F or change any parameter in F
• You can add an additional model (regression tree) h to F, so the new prediction will be F(x) + h(x).

9
10
 yi − F(xi) are called residuals

 These are the parts that existing model F cannot do well. The role of h is
to compensate the shortcoming of existing model F

 If the new model F + h is still not satisfactory, we can add another

regression tree...

 We are improving the predictions of training data, is the procedure also

useful for test data?

 Yes! Because we are building a model, and the model can be applied to
test data as well.

11
Gradient Boosting
• Gradient Boosting = Gradient Descent + Boosting

• Fit an additive model (ensemble) in a forward stage-wise manner

• In each stage, introduce a weak learner to compensate the shortcomings of existing weak
learners

• In Gradient Boosting, “shortcomings” are identified by residuals/gradients

• Recall that, in Adaboost, “shortcomings” are identified by high-weight data points

• Both high-weight data points and gradients tell us how to improve our model.

12
GBM / Residual Modeling?

13
GBM (Gradient Boosting Machine)
Sequence of simple trees built in succession on the prediction residuals of preceding tree so Algorithm:
as to improve on the prediction • First tree fitted to
data/mean(y) can be used
• Residuals are computed
• Regression tree is fitted to the
residuals from the preceding
tree
Compute Residuals • Process repeated through a
Rmi = yi – Fm-1(xi) chain of successive trees
• Final predicted value is sum of
weighted contribution of each
tree

• Rmi is the residual of ith

observation of mth tree
• i is the indexes observations
Fm=Fm-1 + last regression tree • Fm-1 is the weighted sum of all
of residuals previous m-1 regression trees

Features:
• They often have a degree of accuracy that cannot be obtained using a large, single-tree model.
• Can handle hundreds or thousands of potential predictor variables.
• Irrelevant predictor variables are identified automatically and do not affect the predictive model.
• They are invariant under all (strictly) monotone transformations of the predictor variables. So transformations such as (a*x+b),
log(x) or exp(x) do not affect the model.
• Can handle both continuous and categorical predictor and target variables.

14
Illustration of Iterative boosting
In case of Gaussian regression, gradient boosting is equivalent to iteratively re-fitting the residuals of the model

mstop is chosen using the cross validation technique such that it maximizes accuracy

y = (0.5 - 0.9 e-50 *x2) x + 0.02 e Residuals

15
Important parameters of GBM
Total number of trees to fit. This
Available options are "gaussian"
is equivalent to the number of
(squared error), "laplace" N-trees
iterations and the number of
(absolute loss), "bernoulli"
basis functions in the additive
(logistic regression for 0-1
expansion.
outcomes), "adaboost" (the
AdaBoost exponential loss for 0-
Distribution Shrinkage
1 outcomes), "poisson" (count
outcomes), and "coxph" (right
censored observations).

GBM
This is to retard the learning rate
of the series, so that series is
longer and accuracy is better
Cross
Interactions
Validation
Depth to which interaction must
be considered is specified using
this parameter. Bag Fraction

Segments of data are used for

model building and validation
based on this parameter. Also, it
helps in finding max iter.
At each iteration, only a random
fraction, bag, of the residuals is
selected and tree is built on this
subset.

16
Thank You 

A Second Course in Statistics: 8th Regression Analysis, Edition, William
No ratings yet
A Second Course in Statistics: 8th Regression Analysis, Edition, William
407 pages
CFA Level II Formula Sheet 2025 by Fabian Moa
No ratings yet
CFA Level II Formula Sheet 2025 by Fabian Moa
56 pages
21csc305p Machine Learning Unit 5
No ratings yet
21csc305p Machine Learning Unit 5
61 pages
ML 1 PPT Unit 1
No ratings yet
ML 1 PPT Unit 1
93 pages
Understanding Machine Learning Algorithms - in Depth
No ratings yet
Understanding Machine Learning Algorithms - in Depth
167 pages
05 - Ensemble Learning
No ratings yet
05 - Ensemble Learning
39 pages
ML 7th Sem AIML ITE Notes Complete LONG
No ratings yet
ML 7th Sem AIML ITE Notes Complete LONG
202 pages
Next - Level - Data - Science - Sample Chapter
No ratings yet
Next - Level - Data - Science - Sample Chapter
37 pages
Lecture 7 - ARCH and GARCH Volatility Modelling
0% (1)
Lecture 7 - ARCH and GARCH Volatility Modelling
71 pages
Handout9 Trees Bagging Boosting
100% (1)
Handout9 Trees Bagging Boosting
23 pages
Module 1 ML Mumbai University
No ratings yet
Module 1 ML Mumbai University
47 pages
UBS Pay Slip
No ratings yet
UBS Pay Slip
24 pages
PSCS511 - Machine Learning
No ratings yet
PSCS511 - Machine Learning
23 pages
Class Adv Classification V
No ratings yet
Class Adv Classification V
50 pages
Session 5
No ratings yet
Session 5
36 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
ML11 Generalization
No ratings yet
ML11 Generalization
40 pages
Chapter Five
No ratings yet
Chapter Five
42 pages
5 - EnsembleModeling
No ratings yet
5 - EnsembleModeling
80 pages
09 EnsembleLearning
No ratings yet
09 EnsembleLearning
36 pages
Ensemble Final
No ratings yet
Ensemble Final
41 pages
Ensemble Learning
No ratings yet
Ensemble Learning
52 pages
Session 10 - Ensemble Methods (XGBoost)
No ratings yet
Session 10 - Ensemble Methods (XGBoost)
37 pages
Chapter 14
No ratings yet
Chapter 14
63 pages
Literature Review On Logistic Regression
100% (2)
Literature Review On Logistic Regression
7 pages
DM - Lecture 4
No ratings yet
DM - Lecture 4
65 pages
Lec 29
No ratings yet
Lec 29
33 pages
Data Science Interview Questions: Answer Here
No ratings yet
Data Science Interview Questions: Answer Here
54 pages
AI in India
No ratings yet
AI in India
64 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
107 Boostong Models
No ratings yet
107 Boostong Models
27 pages
Gradient Boosted Trees: Dr. Geetha Kuntoji
No ratings yet
Gradient Boosted Trees: Dr. Geetha Kuntoji
24 pages
Chapter 2. Forecasting in Logistics
No ratings yet
Chapter 2. Forecasting in Logistics
33 pages
15EC34T
No ratings yet
15EC34T
131 pages
2025 Ensemble Learning
No ratings yet
2025 Ensemble Learning
25 pages
Statistics For Lawyers and Law For Statistics
No ratings yet
Statistics For Lawyers and Law For Statistics
26 pages
Multiple Imputation Presentation
No ratings yet
Multiple Imputation Presentation
23 pages
Use Machine Learning To Forecast Future Earnings
No ratings yet
Use Machine Learning To Forecast Future Earnings
31 pages
All About ML
No ratings yet
All About ML
18 pages
22 Boosting
No ratings yet
22 Boosting
32 pages
AI ML K6rn1i 54 Merged
No ratings yet
AI ML K6rn1i 54 Merged
6 pages
TensorFlow For Machine Intelligence
No ratings yet
TensorFlow For Machine Intelligence
306 pages
Al3451 Ia 2 Answer Key
No ratings yet
Al3451 Ia 2 Answer Key
12 pages
Gradient Boosting
No ratings yet
Gradient Boosting
20 pages
Unit 3 by GPT
No ratings yet
Unit 3 by GPT
10 pages
Ensemble Learning
No ratings yet
Ensemble Learning
30 pages
Gradient Boosting
No ratings yet
Gradient Boosting
17 pages
Advanced Quantitative Methods
No ratings yet
Advanced Quantitative Methods
125 pages
Module 3.5 Ensemble Learning XGBoost
No ratings yet
Module 3.5 Ensemble Learning XGBoost
26 pages
Machine Learning Interviews - Lessons From Both Sides - FSDL
100% (2)
Machine Learning Interviews - Lessons From Both Sides - FSDL
70 pages
Lesson 8 - Ensemble Learning
No ratings yet
Lesson 8 - Ensemble Learning
61 pages
Unit V - Big Data Programming
No ratings yet
Unit V - Big Data Programming
22 pages
LECTURE+NOTES Boosting
No ratings yet
LECTURE+NOTES Boosting
8 pages
Git Basics: Learn Do Real Work
No ratings yet
Git Basics: Learn Do Real Work
56 pages
Ensaio Proficiência CBR
No ratings yet
Ensaio Proficiência CBR
60 pages
SVM Presentation
No ratings yet
SVM Presentation
27 pages
ML Cheat
No ratings yet
ML Cheat
9 pages
GBMs
No ratings yet
GBMs
13 pages
Accelerated Data Science Introduction To Machine Learning Algorithms
No ratings yet
Accelerated Data Science Introduction To Machine Learning Algorithms
37 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
ML CheatSheet
No ratings yet
ML CheatSheet
14 pages
Dbms Lab File Final
No ratings yet
Dbms Lab File Final
68 pages
Song Et. Al (2020) The-Relationship-Between-Board
No ratings yet
Song Et. Al (2020) The-Relationship-Between-Board
10 pages
Ensemble Machine Learning Approach
No ratings yet
Ensemble Machine Learning Approach
13 pages
Ensemble Learning Methods
100% (1)
Ensemble Learning Methods
24 pages
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
No ratings yet
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
6 pages
Baysian Final
No ratings yet
Baysian Final
7 pages
Boosting Reduces Bias
No ratings yet
Boosting Reduces Bias
7 pages
02 Section 12.4.1 QR Code Content
No ratings yet
02 Section 12.4.1 QR Code Content
8 pages
Review of Final Exam 2
No ratings yet
Review of Final Exam 2
11 pages
Boosting
No ratings yet
Boosting
2 pages
Genus Statistics of The Virgo N-Body Simulations and The 1.2-Jy Redshift Survey
No ratings yet
Genus Statistics of The Virgo N-Body Simulations and The 1.2-Jy Redshift Survey
20 pages
Boosting Algorithms: Regularization, Prediction and Model Fitting
No ratings yet
Boosting Algorithms: Regularization, Prediction and Model Fitting
29 pages
Principal Component Analysis (PCA) : Dimensionality Reduction Using Pca
No ratings yet
Principal Component Analysis (PCA) : Dimensionality Reduction Using Pca
21 pages
Gross Errors. Systematic Errors. All The Above
No ratings yet
Gross Errors. Systematic Errors. All The Above
7 pages
Large Scale Machine Learning With Python - XGBOOST - P236
No ratings yet
Large Scale Machine Learning With Python - XGBOOST - P236
19 pages
Introduction To Matlab Tutorial 11
No ratings yet
Introduction To Matlab Tutorial 11
37 pages
Assumption of Linear Regression
No ratings yet
Assumption of Linear Regression
6 pages
Ray: A Distributed Framework For Emerging AI Applications
No ratings yet
Ray: A Distributed Framework For Emerging AI Applications
19 pages
General Concepts of Point Estimation
No ratings yet
General Concepts of Point Estimation
9 pages
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
36 pages
Assignment 3 - Muge Zorlu
No ratings yet
Assignment 3 - Muge Zorlu
6 pages
Human Activity Reco
No ratings yet
Human Activity Reco
17 pages
Missing Value 11
No ratings yet
Missing Value 11
14 pages
Ch17 Curve Fitting
No ratings yet
Ch17 Curve Fitting
44 pages
GitHub Slide Deck
No ratings yet
GitHub Slide Deck
20 pages
Part Ii - Time Series Analysis: C5 ARIMA (Box-Jenkins) Models
No ratings yet
Part Ii - Time Series Analysis: C5 ARIMA (Box-Jenkins) Models
14 pages
Beryllium Lymphocyte Proliferation Testing (BeLPT)
No ratings yet
Beryllium Lymphocyte Proliferation Testing (BeLPT)
32 pages
CPC Cooperative Patent Classification
No ratings yet
CPC Cooperative Patent Classification
11 pages
Petrophysics-Driven Well Log Quality Control Using Machine Learning-2
No ratings yet
Petrophysics-Driven Well Log Quality Control Using Machine Learning-2
15 pages
Comparison of Probability Distribution Functions For Fitting Distillation Curves of Petroleum
No ratings yet
Comparison of Probability Distribution Functions For Fitting Distillation Curves of Petroleum
9 pages
Conference Title: Keywords - Global Warming, Air Pollution, Vehicle Emission, Smart System, Machine Learning
No ratings yet
Conference Title: Keywords - Global Warming, Air Pollution, Vehicle Emission, Smart System, Machine Learning
14 pages
Spss
100% (1)
Spss
26 pages
Machine Learning: Classification & Decision Trees
No ratings yet
Machine Learning: Classification & Decision Trees
24 pages
Tuto 6 Optimisation ENSIA
No ratings yet
Tuto 6 Optimisation ENSIA
3 pages
Machine Learning
No ratings yet
Machine Learning
3 pages
Metabolic Denitrosation of jY-Nitrosodimethylamine in Vivo in The Rat
No ratings yet
Metabolic Denitrosation of jY-Nitrosodimethylamine in Vivo in The Rat
8 pages
Pengaruh Citra Merek (Brand Image) Terhadap Loyalitas Konsumen Bakso Bakar Pak Man Kota Malang
No ratings yet
Pengaruh Citra Merek (Brand Image) Terhadap Loyalitas Konsumen Bakso Bakar Pak Man Kota Malang
7 pages
Ds-6900Udi Series Decoder: Features and Functions
No ratings yet
Ds-6900Udi Series Decoder: Features and Functions
8 pages
Tune: A Research Platform For Distributed Model Selection and Training
No ratings yet
Tune: A Research Platform For Distributed Model Selection and Training
8 pages
Towards A Statistical Paradigm For Climate: Change
No ratings yet
Towards A Statistical Paradigm For Climate: Change
9 pages
Lecture 2. Relaxing The Assumptions of CLRM - 0
No ratings yet
Lecture 2. Relaxing The Assumptions of CLRM - 0
17 pages
Improving Regressors Using Boosting Techniques: Observations, XX
No ratings yet
Improving Regressors Using Boosting Techniques: Observations, XX
9 pages
Y Division Nformation: Emission
No ratings yet
Y Division Nformation: Emission
4 pages
CP 4
No ratings yet
CP 4
2 pages
IJCB2020 For Review Template
No ratings yet
IJCB2020 For Review Template
4 pages
35 Stope Optimization
No ratings yet
35 Stope Optimization
8 pages
Storage Choke, Fully Potted Resign: Description Applications
No ratings yet
Storage Choke, Fully Potted Resign: Description Applications
3 pages
Interglobe Technologies: About The Company
No ratings yet
Interglobe Technologies: About The Company
3 pages
Use and Abuse of Ancova
No ratings yet
Use and Abuse of Ancova
6 pages
Artgan Content: Bout Roject
No ratings yet
Artgan Content: Bout Roject
2 pages
Time Series
No ratings yet
Time Series
2 pages
David Owald: Early Active Zone Assembly in Drosophila
No ratings yet
David Owald: Early Active Zone Assembly in Drosophila
1 page
Functional Flow PDF
No ratings yet
Functional Flow PDF
1 page
International Youth Day 2017 Flyer
No ratings yet
International Youth Day 2017 Flyer
1 page
DS18B20 (Temperature Sensor)
No ratings yet
DS18B20 (Temperature Sensor)
20 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet