0% found this document useful (0 votes)

7 views37 pages

AML L2 Logistic Regression

Uploaded by

gogimurali546

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views37 pages

AML L2 Logistic Regression

Uploaded by

gogimurali546

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

ADVANCED MACHINE

LEARNING
Module 1: Generalized Regression and HDLSS
problems
Instructor: Amit Sethi
Co-developer: Neeraj Kumar
TAs: Gaurav Yadav, Niladri Bhattacharya
Page: AdvancedMachineLearning.weebly.com
IITG Course No: EE 622
Module objectives
• Understand linear and generalized linear
regression

• Understand logistic regression as a special GLM

• Appreciate the link between log reg and linear

discriminants

• Understand the role of various penalties in

HDLSS case
Linear regression

Source: Wikipedia
Solutions to linear regression

Generalized least squares

Source: Wikipedia
Minimizing the Lp norm of error

Source: Wikipedia
Logistic regression as a GLM
• GLM

• Exponential family

Sources: Wikipedia and Why the logistic function? A tutorial discussion on probabilities and neural networks, by Michael I. Jordan ftp://psyche.mit.edu/pub/jordan/uai.ps
Bernoulli distribution  logistic function

Source: Why the logistic function? A tutorial discussion on probabilities and neural networks, by Michael I. Jordan ftp://psyche.mit.edu/pub/jordan/uai.ps
Two sides of the same coin
Generative vs. Discriminative (“diagnostic”)

Source: Why the logistic function? A tutorial discussion on probabilities and neural networks, by Michael I. Jordan ftp://psyche.mit.edu/pub/jordan/uai.ps
Inference in the generative model

Source: Why the logistic function? A tutorial discussion on probabilities and neural networks, by Michael I. Jordan ftp://psyche.mit.edu/pub/jordan/uai.ps
Logistic function is a natural choice for
Gaussian class conditional densities

Source: Why the logistic function? A tutorial discussion on probabilities and neural networks, by Michael I. Jordan ftp://psyche.mit.edu/pub/jordan/uai.ps
Looks familiar?

Source: Why the logistic function? A tutorial discussion on probabilities and neural networks, by Michael I. Jordan ftp://psyche.mit.edu/pub/jordan/uai.ps
Generative vs. discriminative
Generative Discriminative

• Belief network A is more • More robust

modular
• Class-conditional densities are • Don’t need precise model
likely to be local, characteristic specification, so long as it is
functions of the ob jects being from exponential family
classiffied, invariant to the
nature and number of the • Requires fewer
other classes
parameters
• More “natural”
• Deciding what kind of object to • O(n) as opposed to O(n2)
generate and then generating
it from a recipe
• More efficient to estimate
mode, if correct

Source: Why the logistic function? A tutorial discussion on probabilities and neural networks, by Michael I. Jordan ftp://psyche.mit.edu/pub/jordan/uai.ps
Cross entropy loss function for ML ϴ

Source: Why the logistic function? A tutorial discussion on probabilities and neural networks, by Michael I. Jordan ftp://psyche.mit.edu/pub/jordan/uai.ps
Optimize for the parameters

Source: Why the logistic function? A tutorial discussion on probabilities and neural networks, by Michael I. Jordan ftp://psyche.mit.edu/pub/jordan/uai.ps
Other issues
• What about nonlinear discriminant functions?

• Is logistic nonlinearity required in hidden layers of

neural networks?
Regularization in regression
• Why regularize?

• Reduce variance, at the cost of bias

• Increase test (validation) accuracy

• Get interpretable models

• How to regularize?

• Shrink coefficients

• Reduce features
Coefficient shrinkage using ridge

Source: Regression Shrinkage and Selection via the Lasso, by Robert Tibshirani, Journal of Royal Stat. Soc., 1996
Subset selection
• Set the coefficients with lowest absolute value to zero
LASSO both selects and shrinks

Source: Regression Shrinkage and Selection via the Lasso, by Robert Tibshirani, Journal of Royal Stat. Soc., 1996
Level sets of Lq norm of coefficients

Which one is ridge? Subset selection? Lasso?

Source: Regression Shrinkage and Selection via the Lasso, by Robert Tibshirani, Journal of Royal Stat. Soc., 1996
Geometry of Lasso and Ridge

Source: Regression Shrinkage and Selection via the Lasso, by Robert Tibshirani, Journal of Royal Stat. Soc., 1996
Coefficient shrinkage in orthonormal case

Subset selection Ridge

Lasso Garotte
Source: Regression Shrinkage and Selection via the Lasso, by Robert Tibshirani, Journal of Royal Stat. Soc., 1996
Can you argue the case for LASSO’s
coefficient shrinkage pattern in orthonormal
case?

Source: Regression Shrinkage and Selection via the Lasso, by Robert Tibshirani, Journal of Royal Stat. Soc., 1996
Lasso can flip signs of LS coeffs for d>2

Source: Regression Shrinkage and Selection via the Lasso, by Robert Tibshirani, Journal of Royal Stat. Soc., 1996
What about non-orthonormal case?

Source: Regression Shrinkage and Selection via the Lasso, by Robert Tibshirani, Journal of Royal Stat. Soc., 1996
Lasso coeff paths with decreasing λ

Source: Regression Shrinkage and Selection via the Lasso, by Robert Tibshirani, Journal of Royal Stat. Soc., 1996
Compare to coeff shrinkage path of ridge

Source: Sci-kit learn tutorial

Lasso as Bayes estimate

Source: Regression Shrinkage and Selection via the Lasso, by Robert Tibshirani, Journal of Royal Stat. Soc., 1996
Smoothly Clipped Absolute Deviation
(SCAD) Penalty

Source: Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties, by Fan and Li, Journal of Am. Stat. Assoc., 2001
Thresholding in three cases: No alteration
of large coefficients by SCAD and Hard

Source: Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties, by Fan and Li, Journal of Am. Stat. Assoc., 2001
Motivation for elastic net
• The p >> n problem and grouped selection
• Microarrays: p > 10,000 and n < 100.
• For those genes sharing the same biological “pathway”,
the correlations among them can be high.
• LASSO limitations
• If p > n, the lasso selects at most n variables. The
number of
• Grouped variables: the lasso fails to do grouped
selection. It tends to select one variable from a group
and ignore the others.

Source: Elastic net, by Zou and Hastie

Elastic net: Use both L2 and L2 penalties

Source: Elastic net, by Zou and Hastie

Geometry of elastic net

Source: Elastic net, by Zou and Hastie

Elastic net selects correlated variables as
“group”

Source: Elastic net, by Zou and Hastie

Elastic net selects correlated variables as
“group” and stabilizes the coefficient paths

Source: Elastic net, by Zou and Hastie

Why L2 penalty keeps coefficients of
groups together?
• Try to think of an example with correlated variables
Summary
• General
• Linear regression is a model with good mathematical properties
• Using a link function and iterative optimization, it can be used for
models from the exponential family
• Logistic regression is a natural choice Bayesian optimal models for
class conditional densities from exponential families
• The dispersion parameters of the two classes have to be the same
• Regularization and variable elimination in HDLSS
problems
• GLMs can be penalized for regularization
• Ridge penalty only shrinks the coefficients
• LASSO penalty selects a subset and produces constant shrinkage
• SCAD penalty only selects a subset
• Elastic net keep correlated variables together, while behaving like
LASSO

Private Hotel Management Colleges in Delhi NCR
No ratings yet
Private Hotel Management Colleges in Delhi NCR
4 pages
Examples Regression
No ratings yet
Examples Regression
19 pages
CBSE Class 3 Science Birds MCQS, Multiple Choice Questions
No ratings yet
CBSE Class 3 Science Birds MCQS, Multiple Choice Questions
21 pages
Chapter2 Annotated Part2
No ratings yet
Chapter2 Annotated Part2
30 pages
Cpar M1
No ratings yet
Cpar M1
23 pages
DLP in EDUC 105 - GROUP2
No ratings yet
DLP in EDUC 105 - GROUP2
5 pages
Hope 3 Q2 - Module 1
No ratings yet
Hope 3 Q2 - Module 1
28 pages
CS 304.A Training Models
No ratings yet
CS 304.A Training Models
149 pages
Cheatsheet Supervised Learning
100% (1)
Cheatsheet Supervised Learning
4 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
UnivariateRegression 3
No ratings yet
UnivariateRegression 3
81 pages
BFCAI BigDataAnalytics Lecture#5 2
No ratings yet
BFCAI BigDataAnalytics Lecture#5 2
69 pages
Ficha Avaliação Inglês 5ºano Animais
100% (1)
Ficha Avaliação Inglês 5ºano Animais
5 pages
MLT Unit 3 Part 2
No ratings yet
MLT Unit 3 Part 2
57 pages
Unit 2
No ratings yet
Unit 2
92 pages
Unit 2 - ML - SRM
No ratings yet
Unit 2 - ML - SRM
89 pages
Lec 5
No ratings yet
Lec 5
53 pages
Extensions Beyond Linear Regression: Topics in Data Science
No ratings yet
Extensions Beyond Linear Regression: Topics in Data Science
66 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
67 pages
Linear Regression Model - Applied - Part 1&2
No ratings yet
Linear Regression Model - Applied - Part 1&2
69 pages
03 Linear Models
No ratings yet
03 Linear Models
46 pages
Lecture 7 Loss Function and Regularization
No ratings yet
Lecture 7 Loss Function and Regularization
38 pages
Sparse Linear Regression
No ratings yet
Sparse Linear Regression
45 pages
Lasso Slides Tibsharani
No ratings yet
Lasso Slides Tibsharani
44 pages
(Ebook PDF) Basic Concepts in Clinical Biochemistry A Practical Guide 1st Edition by Vijay Kumar, Kiran Dip Gill 9811081867 9789811081866 Full Chapters PDF Download
100% (4)
(Ebook PDF) Basic Concepts in Clinical Biochemistry A Practical Guide 1st Edition by Vijay Kumar, Kiran Dip Gill 9811081867 9789811081866 Full Chapters PDF Download
44 pages
9-2 LSMW Iklc23
No ratings yet
9-2 LSMW Iklc23
31 pages
CH 4
No ratings yet
CH 4
41 pages
Develop A Competencies Framework For Digital Transformation in The Banking Industry
No ratings yet
Develop A Competencies Framework For Digital Transformation in The Banking Industry
52 pages
2.SupervisedLearning Error
No ratings yet
2.SupervisedLearning Error
32 pages
Glmnet
No ratings yet
Glmnet
42 pages
Chapter 4 - Linear Model: Prepared By: Shier Nee, SAW Based On: Probabilistic Machine Learning by Kevin Murphy
No ratings yet
Chapter 4 - Linear Model: Prepared By: Shier Nee, SAW Based On: Probabilistic Machine Learning by Kevin Murphy
42 pages
Bengkel eRPH PPD KUDAT
No ratings yet
Bengkel eRPH PPD KUDAT
42 pages
Fileml
No ratings yet
Fileml
54 pages
Limited Dependent Variables
No ratings yet
Limited Dependent Variables
34 pages
04 - Linear-Classification-2024
No ratings yet
04 - Linear-Classification-2024
65 pages
Final ML
No ratings yet
Final ML
54 pages
Exegeses ANOVA III
No ratings yet
Exegeses ANOVA III
26 pages
AGLM Explanation
No ratings yet
AGLM Explanation
26 pages
Slide 2
No ratings yet
Slide 2
30 pages
MADE EASY GATE 2019 Rank Predictor - Rank Calculator and Estimator PDF
No ratings yet
MADE EASY GATE 2019 Rank Predictor - Rank Calculator and Estimator PDF
30 pages
SESlides 5
No ratings yet
SESlides 5
21 pages
Week6 1 GLM
No ratings yet
Week6 1 GLM
28 pages
Notes MSM
No ratings yet
Notes MSM
66 pages
Data Analysis
No ratings yet
Data Analysis
70 pages
RigNotes15 PDF
No ratings yet
RigNotes15 PDF
130 pages
Elastic Net
No ratings yet
Elastic Net
29 pages
Meier Et Al. - 2008 - The Group Lasso For Logistic Regression
No ratings yet
Meier Et Al. - 2008 - The Group Lasso For Logistic Regression
22 pages
15 GLM
No ratings yet
15 GLM
32 pages
Zouhastie 05
No ratings yet
Zouhastie 05
20 pages
Neural Network
No ratings yet
Neural Network
14 pages
14.170: Programming For Economists: Melissa Dell Matt Notowidigdo Paul Schrimpf
No ratings yet
14.170: Programming For Economists: Melissa Dell Matt Notowidigdo Paul Schrimpf
52 pages
Lecture 4 (Parts 3 and 4) - LR With Gradient Descent and Logistic Regression
No ratings yet
Lecture 4 (Parts 3 and 4) - LR With Gradient Descent and Logistic Regression
13 pages
Tibshirani Lasso
No ratings yet
Tibshirani Lasso
22 pages
Journal of Statistical Software: Regularization Paths For Generalized Linear Models Via Coordinate Descent
No ratings yet
Journal of Statistical Software: Regularization Paths For Generalized Linear Models Via Coordinate Descent
22 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
VLJ Training Guidelines
No ratings yet
VLJ Training Guidelines
14 pages
Stats216 hw2
No ratings yet
Stats216 hw2
21 pages
Revisiting Revisiting Logistic Regression & Naïve Logistic Regression & Naïve Bayes Bayes
No ratings yet
Revisiting Revisiting Logistic Regression & Naïve Logistic Regression & Naïve Bayes Bayes
46 pages
Job Duties and Tasks For: "Registered Nurse"
No ratings yet
Job Duties and Tasks For: "Registered Nurse"
7 pages
Cs 7265 Big Data Analytics Regularization On Linear Model: Mingon Kang, PH.D Computer Science, Kennesaw State University
No ratings yet
Cs 7265 Big Data Analytics Regularization On Linear Model: Mingon Kang, PH.D Computer Science, Kennesaw State University
24 pages
Logistic Regression (Probability Concepts) and Perceptron
No ratings yet
Logistic Regression (Probability Concepts) and Perceptron
20 pages
Narrative Report
No ratings yet
Narrative Report
24 pages
Module 4: Recommended Exercises: Problem 1: KNN (Exercise 2.4.7 in ISL Textbook, Slightly Modified)
No ratings yet
Module 4: Recommended Exercises: Problem 1: KNN (Exercise 2.4.7 in ISL Textbook, Slightly Modified)
6 pages
Language Day: Activity1: Fill in The Gaps With: He, Him, His. A-B - C
No ratings yet
Language Day: Activity1: Fill in The Gaps With: He, Him, His. A-B - C
11 pages
Cheatsheet Supervised Learning
No ratings yet
Cheatsheet Supervised Learning
4 pages
CQF ML Lab Estimating Default Probability With Logistic Regression
No ratings yet
CQF ML Lab Estimating Default Probability With Logistic Regression
7 pages
A Note On The Group Lasso and A Sparse Group Lasso PDF
No ratings yet
A Note On The Group Lasso and A Sparse Group Lasso PDF
9 pages
SCHOLARSHIP AGREEMENT FORM 2023 PUBLIC - v3
No ratings yet
SCHOLARSHIP AGREEMENT FORM 2023 PUBLIC - v3
4 pages
Panini 90%
No ratings yet
Panini 90%
2 pages
Introduction To Curve Fitting
No ratings yet
Introduction To Curve Fitting
10 pages
10 Regression, Including Least-Squares Linear and Logistic Regression
No ratings yet
10 Regression, Including Least-Squares Linear and Logistic Regression
5 pages
Module 3.3 Classification Models, An Overview
No ratings yet
Module 3.3 Classification Models, An Overview
11 pages
Lab 4: Logistic Regression: PSTAT 131/231, Winter 2019
No ratings yet
Lab 4: Logistic Regression: PSTAT 131/231, Winter 2019
10 pages
September 19 - 23, 2022 DLL EIM 12
100% (6)
September 19 - 23, 2022 DLL EIM 12
3 pages
Circ 11-293 14 June Annex IALA Members RD Summary
No ratings yet
Circ 11-293 14 June Annex IALA Members RD Summary
3 pages
SWG 632 - Assignment F
No ratings yet
SWG 632 - Assignment F
2 pages
JD - Senior Manager - IT Service Management - DHL Global Forwarding Freight (DGFF) GSC
No ratings yet
JD - Senior Manager - IT Service Management - DHL Global Forwarding Freight (DGFF) GSC
2 pages
Pasacao Central School
No ratings yet
Pasacao Central School
2 pages
A Simple But Effective Logistic Regression Derivation
No ratings yet
A Simple But Effective Logistic Regression Derivation
6 pages
Relationship in The Filipino Family
No ratings yet
Relationship in The Filipino Family
2 pages
Regression Shrinkage and Selection Via The Lasso: A Retrospective
No ratings yet
Regression Shrinkage and Selection Via The Lasso: A Retrospective
10 pages
Six Steps Cheat Sheet and Template With Probe Revised 2015-10-19
No ratings yet
Six Steps Cheat Sheet and Template With Probe Revised 2015-10-19
3 pages
University of Kashmir
No ratings yet
University of Kashmir
4 pages
Appendix Nonlinear Regression
No ratings yet
Appendix Nonlinear Regression
5 pages
Facilitating Learning
92% (61)
Facilitating Learning
23 pages
Pharma Sales Executives Across Tamilnadu
No ratings yet
Pharma Sales Executives Across Tamilnadu
1 page
Open House Parent Quiz
No ratings yet
Open House Parent Quiz
2 pages