Lec18 Logistic Regression

The document compares Naïve Bayes and Perceptron, highlighting that Naïve Bayes predicts classes based on probabilities while Perceptron assigns classes based on the sign of a linear combination of inputs. It also discusses Logistic Regression, which uses a probabilistic approach to classify data and emphasizes the importance of the cost function and maximum likelihood estimation in training the model. Additionally, it covers regularized logistic regression to prevent overfitting by adding a penalty term to the cost function.

Uploaded by

Sabalpara Jay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views17 pages

Lec18 Logistic Regression

Uploaded by

Sabalpara Jay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Naïve Bayes vs Perceptron

• Naïve Bayes predict classes based on the probability of the instance

being that class
• It learns 𝑃 𝑌 = 𝑦𝑘 𝑋 = 𝒙𝑖
• Perceptron doesn’t produce probability estimate
• It estimates 𝜽 from the training data
• For new data find the sign of 𝜽𝑇 𝒙 𝑖
• Based on the sign assign a class
Logistic Regression
Logistic Regression
• It takes a probabilistic approach to learn a classifier (a function)
• ℎ𝜽 𝒙 should give 𝑝 𝑦 = 1|𝒙; 𝜽
• We want 0 ≤ ℎ𝜽 𝒙 ≤ 1
• Logistic regression model :
ℎ𝜽 𝒙 = 𝑔(𝜽𝑇 𝒙)

1
𝑔 𝑧 =
1 + 𝑒 −𝑧
𝜽𝑇𝒙
1 𝑒
ℎ𝜽 𝒙 = 𝑇 = 𝑇
1+ 𝑒 −𝜽 𝒙
1 + 𝑒𝜽 𝒙
• The sigmoid first computes real-valued score and then squashes it between
0 and 1 to make it as a probability score.
Interpreting Hypothesis Output
𝜽𝑇𝒙
1 𝑒
• ℎ𝜽 𝒙 = estimated 𝑝 𝑦 = 1|𝒙; 𝜽 = 𝑇 = 𝑇
1+𝑒 −𝜽 𝒙 1+𝑒 𝜽 𝒙
• Note: 𝑝 𝑦 = 0|𝒙; 𝜽 + 𝑝 𝑦 = 1|𝒙; 𝜽 = 1
1 1
• So, 𝑝 𝑦 = 0|𝒙; 𝜽 = 1 − 𝑝 𝑦 = 1|𝒙; 𝜽 = 1 − 𝑇 = 𝑇
1+𝑒 −𝜽 𝒙 1+𝑒 𝜽 𝒙
• The log-odds (logits) of the model
𝑝 𝑦 = 1|𝒙; 𝜽 𝜽𝑇 𝒙
log = log 𝑒 = 𝜽𝑇 𝒙
𝑝 𝑦 = 0|𝒙; 𝜽

• Thus if 𝜽𝑇 𝒙 > 𝟎 then the positive class more probable

Logistic Regression
• ℎ𝜽 𝒙 = 𝑔 𝜽𝑇 𝒙
1
•𝑔 𝑧 =
1+𝑒 −𝑧
• 𝜽𝑇 𝒙 should be large negative values for negative instances
• 𝜽𝑇 𝒙 should be large positive values for positive instances

• Assume a threshold and predict

• 𝑦 = 1 if ℎ𝜽 𝒙 ≥ 0.5 (𝜽𝑇 𝒙 ≥ 0)
• 𝑦 = 0 if ℎ𝜽 𝒙 < 0.5 (𝜽𝑇 𝒙 < 0)
Non-linear Decision Boundary
• Can apply basis function expansion to features

1
𝑥1
𝑥2
1 𝑥1 𝑥2
2
•𝒙= 1 →
𝑥 𝑥 1
𝑥2 𝑥22
𝑥12 𝑥2
𝑥1 𝑥22
⋮
Logistic Regression Cost Function
• Should not use the squared loss as in case of linear regression
𝑛
1 (𝑖) (𝑖) 2
𝐽 𝜽 = ෍ ℎ𝜽 𝒙 −𝑦
2𝑛
𝑖=1
• The logistic regression model will lead to a non-convex cost function
1
ℎ𝜽 𝒙 = −𝜽 𝑇𝒙
1+𝑒
Finding the Cost Function via MLE
• Likelihood of the data is given by 𝐿 𝜽 = ∏𝑛𝑖=1 𝑝(𝑦 𝑖 |𝒙 𝑖 ; 𝜽)
• 𝜽 that maximizes the likelihood
𝑛

𝜽𝑀𝐿𝐸 = arg max 𝐿 𝜽 = arg max ෑ 𝑝(𝑦 𝑖 |𝒙 𝑖 ; 𝜽)

𝜽 𝜽
𝑖=1 𝑛

𝜽𝑀𝐿𝐸 = arg max log 𝐿 𝜽 = arg max log ෑ 𝑝(𝑦 𝑖 |𝒙 𝑖 ; 𝜽)

𝜽 𝜽
𝑖=1
= arg max σ𝑛𝑖=1 log 𝑝(𝑦 𝑖 |𝒙 𝑖 ; 𝜽)
𝜽
Finding the Cost Function via MLE
• Each label 𝑦 𝑖 is binary with probability ℎ𝜽 𝒙(𝑖)
• Assume Bernoulli likelihood
𝑛

𝑝 𝒚|𝑿, 𝜽 = ෑ 𝑝(𝑦 𝑖 |𝒙 𝑖 ; 𝜽)
𝑖=1
𝑦𝑖 1−𝑦 𝑖
= ∏𝑛𝑖=1 ℎ𝜽 𝒙 𝑖
1 − ℎ𝜽 𝒙(𝑖)
• The log-likelihood
𝑛

𝑙 𝜽 = ෍ 𝑦 𝑖 log ℎ𝜽 𝒙(𝑖) + 1 − 𝑦 𝑖
log 1 − ℎ𝜽 𝒙(𝑖)
𝑖=1
The Cost Function
• Maximizing 𝑙 𝜽 is equivalent to minimizing the NLL
𝑛

𝐽 𝜽 = 𝑁𝐿𝐿 𝜽 = − ෍ 𝑦 𝑖 log ℎ𝜽 𝒙(𝑖) + 1 − 𝑦 𝑖

log 1 − ℎ𝜽 𝒙(𝑖)
𝑖=1
• Cost of a single instance
−log ℎ𝜽 𝒙 𝑖𝑓 𝑦 = 1
𝑐𝑜𝑠𝑡 ℎ𝜽 𝒙 , 𝑦 = ൝
−log 1 − ℎ𝜽 𝒙 𝑖𝑓 𝑦 = 0
• The objective function
𝐽 𝜽 = σ𝑛𝑖=1 𝑐𝑜𝑠𝑡 ℎ𝜽 𝒙(𝑖) , 𝑦 𝑖
Intuition
−log ℎ𝜽 𝒙 𝑖𝑓 𝑦 = 1
• 𝑐𝑜𝑠𝑡 ℎ𝜽 𝒙 , 𝑦 = ൝
−log 1 − ℎ𝜽 𝒙 𝑖𝑓 𝑦 = 0
• If 𝑦 = 1
• 𝑐𝑜𝑠𝑡 = 0 for correct prediction
• As ℎ𝜽 𝒙 → 0, 𝑐𝑜𝑠𝑡 → ∞
• Mistakes should get large penalties
• e.g., predict ℎ𝜽 𝒙 = 0, but 𝑦 = 1
Intuition
−log ℎ𝜽 𝒙 𝑖𝑓 𝑦 = 1
• 𝑐𝑜𝑠𝑡 ℎ𝜽 𝒙 , 𝑦 = ൝
−log 1 − ℎ𝜽 𝒙 𝑖𝑓 𝑦 = 0
• If 𝑦 = 0
• 𝑐𝑜𝑠𝑡 = 0 for correct prediction
• As (1 − ℎ𝜽 𝒙 ) → 0, 𝑐𝑜𝑠𝑡 → ∞
• Mistakes should get large penalties
• e.g., predict ℎ𝜽 𝒙 = 0, but 𝑦 = 1
MAP formulation
Regularized Logistic Regression
• 𝐽 𝜽 = − σ𝑛𝑖=1 𝑦 𝑖 log ℎ𝜽 𝒙(𝑖) + 1 − 𝑦 𝑖
log 1 − ℎ𝜽 𝒙(𝑖)
• We can regularize the logistic regression as
𝑑

𝐽𝑟𝑒𝑔 𝜽 = 𝐽 𝜽 + 𝜆 ෍ 𝜃𝑗2
𝑗=1
2
=𝐽 𝜽 +𝜆 𝜽 2
𝐽𝑟𝑒𝑔 𝜽
𝑛 𝑑

= − ෍ 𝑦 𝑖 log ℎ𝜽 𝒙(𝑖) + 1 − 𝑦 𝑖
log 1 − ℎ𝜽 𝒙(𝑖) + 𝜆 ෍ 𝜃𝑗2
𝑖=1 𝑗=1
Estimating the Parameter

Octo-M Eng
No ratings yet
Octo-M Eng
10 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Reviewer in Research (PART 1) G11
100% (1)
Reviewer in Research (PART 1) G11
6 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
67 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
Logistic Regression
No ratings yet
Logistic Regression
34 pages
Logistic Regression
No ratings yet
Logistic Regression
61 pages
04 Probability and Learning PDF
No ratings yet
04 Probability and Learning PDF
34 pages
Lecture Notes 6 Logistic Regression
No ratings yet
Lecture Notes 6 Logistic Regression
8 pages
Lecture 03 Logistic Regression
No ratings yet
Lecture 03 Logistic Regression
34 pages
Lecture 8 Logistic Regression
No ratings yet
Lecture 8 Logistic Regression
34 pages
ML DSBA Lab2
No ratings yet
ML DSBA Lab2
4 pages
04 - Linear-Classification-2024
No ratings yet
04 - Linear-Classification-2024
65 pages
Algorithms Notes
No ratings yet
Algorithms Notes
66 pages
Classification-Introduction, Logistic Regression
No ratings yet
Classification-Introduction, Logistic Regression
26 pages
Ch2Regression and Regularization1
No ratings yet
Ch2Regression and Regularization1
45 pages
Logistic Regression: Some Slides Adapted From Dan Jurfasky and Brendan O'Connor
No ratings yet
Logistic Regression: Some Slides Adapted From Dan Jurfasky and Brendan O'Connor
53 pages
Binary Logistic Regression 2
No ratings yet
Binary Logistic Regression 2
43 pages
Week 4 Logistic
No ratings yet
Week 4 Logistic
21 pages
4.logistic Regression
No ratings yet
4.logistic Regression
16 pages
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
100% (1)
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
42 pages
Logistic Regression
No ratings yet
Logistic Regression
26 pages
05 LogisticRegression PDF
No ratings yet
05 LogisticRegression PDF
23 pages
Lecture 5 - Logistic Regression
No ratings yet
Lecture 5 - Logistic Regression
28 pages
W8 - Logistic Regression
No ratings yet
W8 - Logistic Regression
18 pages
06 Logistic Regression PDF
No ratings yet
06 Logistic Regression PDF
10 pages
01B DL2023 LinearModels
No ratings yet
01B DL2023 LinearModels
47 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
3-LG Eval
No ratings yet
3-LG Eval
52 pages
Lecture W3
No ratings yet
Lecture W3
28 pages
Exp 2
No ratings yet
Exp 2
7 pages
5 LR Apr 7 2021
No ratings yet
5 LR Apr 7 2021
94 pages
M02Logistic Regression Logistic RegressioLogistic Regressionn
No ratings yet
M02Logistic Regression Logistic RegressioLogistic Regressionn
19 pages
Lecture Note #9 - PEC-CS701E
No ratings yet
Lecture Note #9 - PEC-CS701E
41 pages
Notes 05
No ratings yet
Notes 05
51 pages
Business Analytics & Machine Learning: Logistic and Poisson Regressions
No ratings yet
Business Analytics & Machine Learning: Logistic and Poisson Regressions
62 pages
Logistic Regression by IntuitiveAI v2.5
No ratings yet
Logistic Regression by IntuitiveAI v2.5
8 pages
Lecture 3. Classification
No ratings yet
Lecture 3. Classification
60 pages
Sample Research Paper
No ratings yet
Sample Research Paper
26 pages
Introduction To Machine Learning: 2 Linear Classifiers
No ratings yet
Introduction To Machine Learning: 2 Linear Classifiers
4 pages
Binary Classification and Logistic Regression
No ratings yet
Binary Classification and Logistic Regression
7 pages
Text Classification Using Logistics Regression
No ratings yet
Text Classification Using Logistics Regression
64 pages
Log Reg Skimed - Ipynb - Colab
No ratings yet
Log Reg Skimed - Ipynb - Colab
10 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
No ratings yet
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
153 pages
Binary Logistic Regression From Scratch
No ratings yet
Binary Logistic Regression From Scratch
10 pages
Lec 05
No ratings yet
Lec 05
53 pages
Lecture 07
No ratings yet
Lecture 07
26 pages
CS229 Supplemental Lecture Notes: 1 Binary Classification
No ratings yet
CS229 Supplemental Lecture Notes: 1 Binary Classification
7 pages
Chapter02 Introduction To DeepLearning
No ratings yet
Chapter02 Introduction To DeepLearning
84 pages
2021 Logistic Regression
No ratings yet
2021 Logistic Regression
33 pages
Logistic Regression and Naive Bayes
No ratings yet
Logistic Regression and Naive Bayes
4 pages
Logistic Regression (Probability Concepts) and Perceptron
No ratings yet
Logistic Regression (Probability Concepts) and Perceptron
20 pages
Logistic Regression
No ratings yet
Logistic Regression
78 pages
Ch03 LogisticRegression
No ratings yet
Ch03 LogisticRegression
79 pages
Logistic - Regression Class 3
No ratings yet
Logistic - Regression Class 3
88 pages
06 Logistic Regression
No ratings yet
06 Logistic Regression
55 pages
23 LogisticRegression
No ratings yet
23 LogisticRegression
67 pages
Logisticregression 2021
No ratings yet
Logisticregression 2021
78 pages
5 LR Apr 7 2021
No ratings yet
5 LR Apr 7 2021
93 pages
Exercises of Logarithms and Exponentials
From Everand
Exercises of Logarithms and Exponentials
Simone Malacrida
No ratings yet
Principles of Flight
100% (1)
Principles of Flight
28 pages
Discriminant Analysis Presentation
No ratings yet
Discriminant Analysis Presentation
21 pages
Final Presentation
No ratings yet
Final Presentation
52 pages
SAP HR ABAP Interview Questions
No ratings yet
SAP HR ABAP Interview Questions
3 pages
Ncert Exemplar Jan2021 Solutions Class 12 Maths Chapter 6
No ratings yet
Ncert Exemplar Jan2021 Solutions Class 12 Maths Chapter 6
21 pages
Influence of Coke Calcining Level On Anode Real Density LC and Other
No ratings yet
Influence of Coke Calcining Level On Anode Real Density LC and Other
6 pages
Journal of Electromagnetic Waves and Applications
No ratings yet
Journal of Electromagnetic Waves and Applications
12 pages
Mathematical Language and Symbols: Libeeth B. Guevarra Department of Mathematics and Natural Sciences
No ratings yet
Mathematical Language and Symbols: Libeeth B. Guevarra Department of Mathematics and Natural Sciences
34 pages
Lab Assignment On Library Management System
No ratings yet
Lab Assignment On Library Management System
6 pages
Contoh Algoritma AES
No ratings yet
Contoh Algoritma AES
17 pages
A Study On Straight-Line Tracking Bicycle
100% (1)
A Study On Straight-Line Tracking Bicycle
10 pages
Thesis For Shampoo
No ratings yet
Thesis For Shampoo
20 pages
Consumer Lighting
0% (1)
Consumer Lighting
28 pages
Dental Plaster
100% (1)
Dental Plaster
3 pages
IKS Notes
No ratings yet
IKS Notes
14 pages
Position Time Graphs Worksheet
No ratings yet
Position Time Graphs Worksheet
3 pages
Historical Business: Academic Layout
No ratings yet
Historical Business: Academic Layout
19 pages
19 LS 600h L / LS 600h (Before Nov. 2009 Production) : Power Trunk Lid
No ratings yet
19 LS 600h L / LS 600h (Before Nov. 2009 Production) : Power Trunk Lid
1 page
Cochrane-Scale and Arpeggio Resources
100% (3)
Cochrane-Scale and Arpeggio Resources
428 pages
Ode Lecture Notes
No ratings yet
Ode Lecture Notes
72 pages
A Comparison of Optical Emission Atomic Emission Spectros
No ratings yet
A Comparison of Optical Emission Atomic Emission Spectros
2 pages
Ge - 650
No ratings yet
Ge - 650
1,046 pages
Exploring Ambiguous Harmony With Triadic Slash Chords: BAJP - Year 4 Written Analysis Project
100% (1)
Exploring Ambiguous Harmony With Triadic Slash Chords: BAJP - Year 4 Written Analysis Project
39 pages
Competitive Exams: Electronics Mcqs (Practice-Test 5 of 13) : Examrace
No ratings yet
Competitive Exams: Electronics Mcqs (Practice-Test 5 of 13) : Examrace
3 pages
Lecture 4-5-6 - Casting - Processes PDF
No ratings yet
Lecture 4-5-6 - Casting - Processes PDF
94 pages
6479ec930e2d9 It140 Module 3 Assingment
No ratings yet
6479ec930e2d9 It140 Module 3 Assingment
4 pages
EPOCH 1000 Software Update 3-2011
No ratings yet
EPOCH 1000 Software Update 3-2011
49 pages
Physics of Roller Coasters Presentation v3
No ratings yet
Physics of Roller Coasters Presentation v3
3 pages

Lec18 Logistic Regression

Uploaded by

Lec18 Logistic Regression

Uploaded by

Naïve Bayes vs Perceptron

• Naïve Bayes predict classes based on the probability of the instance

• Thus if 𝜽𝑇 𝒙 > 𝟎 then the positive class more probable

• Assume a threshold and predict

𝜽𝑀𝐿𝐸 = arg max 𝐿 𝜽 = arg max ෑ 𝑝(𝑦 𝑖 |𝒙 𝑖 ; 𝜽)

𝜽𝑀𝐿𝐸 = arg max log 𝐿 𝜽 = arg max log ෑ 𝑝(𝑦 𝑖 |𝒙 𝑖 ; 𝜽)

𝐽 𝜽 = 𝑁𝐿𝐿 𝜽 = − ෍ 𝑦 𝑖 log ℎ𝜽 𝒙(𝑖) + 1 − 𝑦 𝑖

You might also like