0% found this document useful (0 votes)

71 views4 pages

Introduction To Machine Learning: 2 Linear Classifiers

1. The document introduces linear classifiers for classification problems, focusing on logistic regression. 2. Linear classifiers separate data into categories using hyperplanes defined by a weight vector w. The decision rule assigns categories based on which side of the hyperplane a data point falls. 3. Logistic regression learns the weights w using gradient descent or Newton's method to minimize a loss function measuring errors in classification. It uses a logistic loss function instead of the 0-1 loss for easier optimization.

Uploaded by

Sandeep Kumar Yadlapalli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

71 views4 pages

Introduction To Machine Learning: 2 Linear Classifiers

Uploaded by

Sandeep Kumar Yadlapalli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

2 Linear Classifiers

Linear Classifiers
w0
Introduction to Machine Learning 1

Logistic Regression x1 w1
P
x2 w2 {-1,+1}
Varun Chandola Activation function w0 +
P d
.. .. j=1 wj xj ≥ 0

February 13, 2019 . .

xd wd

Outline inputs weights

Decision Rule
Contents yi =
−1 if w0 + w> xi < 0
+1 if w0 + w> xi ≥ 0
1 Classification 1

2 Linear Classifiers 2 Geometric Interpretation

2.1 Linear Classification via Hyperplanes . . . . . . . . . . . . . . 2 x2 +1
w> x = −w0 −1
3 Logistic Regression 5
3.1 Using Gradient Descent for Learning Weights . . . . . . . . . 7
3.2 Using Newton’s Method . . . . . . . . . . . . . . . . . . . . . 7

w
ŵ =
1 Classification |w|

Supervised Learning - Classification

w0
− |w|
• Target y is categorical x1

• e.g., y ∈ {−1, +1} (binary classification)

• A possible problem formulation: Learn f such that y = f (x) 2.1 Linear Classification via Hyperplanes
• Separates a D-dimensional space into two half-spaces

• Defined by w ∈ <D

2
y

0
>
w

0
w

0
+

=
·x

0
w

<
·x

0
w
w

+
·x
w

w
x

w0 k
kw
– Orthogonal to the hyperplane
– This w goes through the origin
– How do you check if a point lies “above” or “below” w?
xxx
– What happens for points on w? xxxx x
For a hyperplane that passes through the origin, a point x will lie above the
hyperplane if w> x > 0 and will lie below the plane if w> x < 0, otherwise.
This can be further understood by understanding that bf w> x is essentially • w> x + w0 < 0 ⇒ y = −1
equal to |w||x| cos θ, where θ is the angle between w and x.
• Find a hyperplane that separates the data
• Add a bias w0
– . . . if the data is linearly separable
– w0 > 0 - move along w
– w0 < 0 - move opposite to w • But there can be many choices!

• How to check if point lies above or below w? • Find the one with lowest error
>
– If w x + w0 > 0 then x is above
– Else, below Learning w

• Decision boundary represented by the hyperplane w • What is an appropriate loss function?

• For binary classification, w points towards the positive class 0-1 Loss

Decision Rule • Number of mistakes in training data

y = sign(w> x + w0 ) n
X
J(w) = min I(yi (w> xi + w0 ) < 0)
• w> x + w0 ≥ 0 ⇒ y = +1 w,w0
i=1

3 4
• Hard to optimize Logistic Loss Function

• Solution - replace it with a mathematically manageable loss • For one training observation,
– if yi = +1, the probability of the predicted value to be +1
Different Loss Functions 1
pi =
Note 1 + exp (−w> xi )
From now on, assuming that intercept and constant terms are included in w
– if yi = −1, the probability of the predicted value to be -1
and xi , respectively.
1 1
pi = 1 − =
• Squared Loss - Perceptron 1 + exp (−w> xi ) 1 + exp (w> xi )
N – In general
1X 1
J(w) = (yi − w> xi )2 (1) pi =
2 i=1 1 + exp (−yi w> xi )
• For logistic regression, the objective is to minimize the negative of the
• Logistic Loss - Logistic Regression
log probability:
n
1X Xn n
X
J(w) = log (1 + exp (−yi w> xi )) (2) J(w) = − log (pi ) = log (1 + exp (−yi w> xi ))
n i=1
i=1 i=1

• Hinge Loss - Support Vector Machine Learning Logistic Regression Model

n
X • Direct minimization??
J(w) = max (0, 1 − yi w> xi ) (3) – No closed form solution for minimizing error
i=1
• Gradient Descent
3 Logistic Regression • Newton’s Method

Geometric Interpretation To understand why there is no closed form solution for maximizing the log-
likelihood, we first differentiate J(w) with respect to w.
• Use regression to predict discrete values ∇J(w) =
Xn
• Squash output to [0, 1] using sigmoid function d
J(w) = log(1 + exp (−yi w> xi ))
dw
• Output less than 0.5 is one class and greater than 0.5 is the other i=1
n
1X yi
= − xi
Probabilistic Interpretation n i=1 1 + exp (yi w> xi )

• Probability of x to belong to class +1 Obviously, given that ∇J(w) is a non-linear function of w, a closed form
solution is not possible.

5 6
3.1 Using Gradient Descent for Learning Weights
• Compute gradient of J(w) with respect to w
• A convex function of w with a unique global minima
n
1X yi
∇J(w) = − xi
n i=1 1 + exp (yi w> xi )
• Update rule:
d
wk+1 = wk − η LL(wk )
dwk

−0.5

10
5 10
4 6 8
0 0 2

3.2 Using Newton’s Method

• Setting η is sometimes tricky
• Too large – incorrect results
• Too small – slow convergence
• Another way to speed up convergence:

Newton’s Method
wk+1 = wk − ηH−1
k ∇J(wk )

Hessian n
1X exp (yi w> xi )
H(w) = x i x>
i
n i=1 (1 + exp (yi w> xi ))2

References

Top 100 ML Interview Q&A
100% (1)
Top 100 ML Interview Q&A
39 pages
CSCI-43646364 S25 - Lecture 4
No ratings yet
CSCI-43646364 S25 - Lecture 4
92 pages
Unit II
100% (1)
Unit II
13 pages
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
100% (1)
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
42 pages
Cell2Cell The Churn Game
50% (2)
Cell2Cell The Churn Game
13 pages
FRA Project Report Milestone 1 PDF
No ratings yet
FRA Project Report Milestone 1 PDF
29 pages
Predictive Maintenance of CNC Machine
100% (1)
Predictive Maintenance of CNC Machine
44 pages
Logistic Regression
No ratings yet
Logistic Regression
78 pages
04 - Linear-Classification-2024
No ratings yet
04 - Linear-Classification-2024
65 pages
Ch03 LogisticRegression
No ratings yet
Ch03 LogisticRegression
79 pages
3-LG Eval
No ratings yet
3-LG Eval
52 pages
Text Classification Using Logistics Regression
No ratings yet
Text Classification Using Logistics Regression
64 pages
01B DL2023 LinearModels
No ratings yet
01B DL2023 LinearModels
47 pages
Logisticregression 2021
No ratings yet
Logisticregression 2021
78 pages
03-Logistic Regression
No ratings yet
03-Logistic Regression
59 pages
7 Logistic-Regression
No ratings yet
7 Logistic-Regression
63 pages
Binary Logistic Regression 2
No ratings yet
Binary Logistic Regression 2
43 pages
Fileml
No ratings yet
Fileml
54 pages
Final ML
No ratings yet
Final ML
54 pages
Notes 05
No ratings yet
Notes 05
51 pages
Cheatsheet Supervised Learning
100% (1)
Cheatsheet Supervised Learning
4 pages
Machine Learning: Probabilistic View of Linear Regression Logistic Regression Hyperplane Based Classifiers and Perceptron
No ratings yet
Machine Learning: Probabilistic View of Linear Regression Logistic Regression Hyperplane Based Classifiers and Perceptron
67 pages
Logistic Regression
No ratings yet
Logistic Regression
51 pages
Lec 02 LogisticReg
No ratings yet
Lec 02 LogisticReg
33 pages
Multimedia Application L9
No ratings yet
Multimedia Application L9
43 pages
Algorithms Notes
No ratings yet
Algorithms Notes
66 pages
Lecture W3
No ratings yet
Lecture W3
28 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
67 pages
DDA3020 Lecture 06 Logistic Regression
No ratings yet
DDA3020 Lecture 06 Logistic Regression
47 pages
Ch2Regression and Regularization1
No ratings yet
Ch2Regression and Regularization1
45 pages
Logistic Regression Training DR Anil
No ratings yet
Logistic Regression Training DR Anil
38 pages
M146 Lec3 Sidenotes S25
No ratings yet
M146 Lec3 Sidenotes S25
29 pages
Lecture 5 - Logistic Regression
No ratings yet
Lecture 5 - Logistic Regression
28 pages
Logistic Regression
No ratings yet
Logistic Regression
26 pages
2021 Logistic Regression
No ratings yet
2021 Logistic Regression
33 pages
04 LogisticRegression
No ratings yet
04 LogisticRegression
29 pages
Logistic Regression
No ratings yet
Logistic Regression
34 pages
Lec 4
No ratings yet
Lec 4
24 pages
3 LogisticRegression
No ratings yet
3 LogisticRegression
30 pages
Logistic Regression (Probability Concepts) and Perceptron
No ratings yet
Logistic Regression (Probability Concepts) and Perceptron
20 pages
Logistic Regression: Some Slides Adapted From Dan Jurfasky and Brendan O'Connor
No ratings yet
Logistic Regression: Some Slides Adapted From Dan Jurfasky and Brendan O'Connor
53 pages
Lect4 Log Reg
No ratings yet
Lect4 Log Reg
20 pages
Week 4 Logistic
No ratings yet
Week 4 Logistic
21 pages
Machine Learning Unit 2 Que and Ans
No ratings yet
Machine Learning Unit 2 Que and Ans
16 pages
M02Logistic Regression Logistic RegressioLogistic Regressionn
No ratings yet
M02Logistic Regression Logistic RegressioLogistic Regressionn
19 pages
CS60010: Deep Learning: Spring 2021
No ratings yet
CS60010: Deep Learning: Spring 2021
32 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
Lec 20
No ratings yet
Lec 20
16 pages
A Layman's Guide To The Project
No ratings yet
A Layman's Guide To The Project
34 pages
Logistic Regression
No ratings yet
Logistic Regression
9 pages
Lecture 05 - Logistic Regression
No ratings yet
Lecture 05 - Logistic Regression
10 pages
Logistic Regression: Adapted From: Tom Mitchell's Machine Learning Book Evan Wei Xiang and Qiang Yang
No ratings yet
Logistic Regression: Adapted From: Tom Mitchell's Machine Learning Book Evan Wei Xiang and Qiang Yang
15 pages
4 Linear Regression Additional Notes
No ratings yet
4 Linear Regression Additional Notes
8 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Binary Classification and Logistic Regression
No ratings yet
Binary Classification and Logistic Regression
7 pages
Lecture Notes 6 Logistic Regression
No ratings yet
Lecture Notes 6 Logistic Regression
8 pages
Week 3 Lecture Notes
No ratings yet
Week 3 Lecture Notes
7 pages
Machine Learning - Logistic Regression
No ratings yet
Machine Learning - Logistic Regression
16 pages
cs188 Fa23 Note22
No ratings yet
cs188 Fa23 Note22
3 pages
CS229 Supplemental Lecture Notes: 1 Binary Classification
No ratings yet
CS229 Supplemental Lecture Notes: 1 Binary Classification
7 pages
ML DSBA Lab2
No ratings yet
ML DSBA Lab2
4 pages
A Tutorial of Machine Learning
No ratings yet
A Tutorial of Machine Learning
16 pages
BFIS - Bullying and Friendship Interview Schedule
100% (1)
BFIS - Bullying and Friendship Interview Schedule
2 pages
Barkha Ahuja Resume
No ratings yet
Barkha Ahuja Resume
1 page
IPT Report
No ratings yet
IPT Report
46 pages
Foreign Market Entry Mode in The Hotel Industry - The Impact of Country - and Firm-Specific Factors
No ratings yet
Foreign Market Entry Mode in The Hotel Industry - The Impact of Country - and Firm-Specific Factors
15 pages
Neural Networks Handout
No ratings yet
Neural Networks Handout
7 pages
Aspire Systems - Final
No ratings yet
Aspire Systems - Final
15 pages
Statistical Modelling of Epidemiological Data
No ratings yet
Statistical Modelling of Epidemiological Data
87 pages
Logit and Probit Models
50% (2)
Logit and Probit Models
11 pages
Standard Chartered - Final
No ratings yet
Standard Chartered - Final
3 pages
Dahya McConnell and Travlos (2002) JoF
No ratings yet
Dahya McConnell and Travlos (2002) JoF
23 pages
Leadership Principles
No ratings yet
Leadership Principles
1 page
Unit 4 Classification
No ratings yet
Unit 4 Classification
15 pages
Learn Perl in About 2 Hours 30 Minutes: by Sam Hughes
No ratings yet
Learn Perl in About 2 Hours 30 Minutes: by Sam Hughes
23 pages
DSME2040 Regression Students
No ratings yet
DSME2040 Regression Students
35 pages
Financial Analytics - BA Presentation Final
No ratings yet
Financial Analytics - BA Presentation Final
19 pages
W 19276
No ratings yet
W 19276
73 pages
Prevalence of Undernutrition Among Musahar Children Aged Between 12 To 59 Months in Urban Siraha District, Nepal
No ratings yet
Prevalence of Undernutrition Among Musahar Children Aged Between 12 To 59 Months in Urban Siraha District, Nepal
8 pages
Fundamental Analysis and Subsequent Stock Returns (Journal of Accounting and Economics, Vol. 15, Issue 2-3) (1992)
No ratings yet
Fundamental Analysis and Subsequent Stock Returns (Journal of Accounting and Economics, Vol. 15, Issue 2-3) (1992)
30 pages
School of Information Technology and Engineering B.Tech It Information Technolgy Opearting Systems - Ite 209
No ratings yet
School of Information Technology and Engineering B.Tech It Information Technolgy Opearting Systems - Ite 209
2 pages
Factors Affecting Mode Choice of Work Trips Gaza
No ratings yet
Factors Affecting Mode Choice of Work Trips Gaza
13 pages
Cancers 15 00569 v2
No ratings yet
Cancers 15 00569 v2
13 pages
Credit Risk Prediction With and Without Weights of Evidence
No ratings yet
Credit Risk Prediction With and Without Weights of Evidence
20 pages
Prediksi Financial Distress Kasus Industri Manufaktur Pendekatan Model Regresi Logistik
No ratings yet
Prediksi Financial Distress Kasus Industri Manufaktur Pendekatan Model Regresi Logistik
13 pages
Datapath 1
No ratings yet
Datapath 1
10 pages
Bus Accident Severity and Passenger Injury - Evidence From Denmark
No ratings yet
Bus Accident Severity and Passenger Injury - Evidence From Denmark
14 pages
Prediction of Nonunion After Nonoperative Treatment of A Proximal Humeral Fracture
No ratings yet
Prediction of Nonunion After Nonoperative Treatment of A Proximal Humeral Fracture
13 pages
HTML
No ratings yet
HTML
1 page
Determinants of Loan Repayment The Case
No ratings yet
Determinants of Loan Repayment The Case
16 pages
BA - IIM Ahmedabad - Harbingers
No ratings yet
BA - IIM Ahmedabad - Harbingers
9 pages
The Gettier Intuition From South America To Asia
No ratings yet
The Gettier Intuition From South America To Asia
25 pages
Desrosières Alain A Politics of Knowledge-Tools The Case of Statistics
No ratings yet
Desrosières Alain A Politics of Knowledge-Tools The Case of Statistics
17 pages
Predicting Gold Prices: Megan Potoski
No ratings yet
Predicting Gold Prices: Megan Potoski
5 pages
00 Zeroth Review Template
No ratings yet
00 Zeroth Review Template
14 pages
Influencing Factors For The Purchase Intention of Consumers Choosing Bioplastic Products in Germany PDF
No ratings yet
Influencing Factors For The Purchase Intention of Consumers Choosing Bioplastic Products in Germany PDF
11 pages
Dentistry 06 00051
No ratings yet
Dentistry 06 00051
8 pages
Smart Agricultural System
No ratings yet
Smart Agricultural System
7 pages
Speed and Distance Measurement Using Ultrasonic Sensor
No ratings yet
Speed and Distance Measurement Using Ultrasonic Sensor
4 pages
Assignment - Personal Letter of Expression: Psychology and Sociology
No ratings yet
Assignment - Personal Letter of Expression: Psychology and Sociology
3 pages
Audio Codec
No ratings yet
Audio Codec
3 pages
Expt No:1 (A) Rom To Ram Transfer Date: 12 JAN
No ratings yet
Expt No:1 (A) Rom To Ram Transfer Date: 12 JAN
2 pages
Expt No: 2 (C) POPING Values From Stack To Registers Date: 19-Jan-2016
No ratings yet
Expt No: 2 (C) POPING Values From Stack To Registers Date: 19-Jan-2016
2 pages
Expt No:3 (B) Addition of BCD Data Date: 2 FEB
No ratings yet
Expt No:3 (B) Addition of BCD Data Date: 2 FEB
2 pages
Internship Day To Day Activities
No ratings yet
Internship Day To Day Activities
2 pages
Hostel Student'S Leave Form: Office of The Chief Warden
No ratings yet
Hostel Student'S Leave Form: Office of The Chief Warden
1 page
Cheat Sheet 2.0 Copy 4
No ratings yet
Cheat Sheet 2.0 Copy 4
2 pages

Introduction To Machine Learning: 2 Linear Classifiers

Uploaded by

Introduction To Machine Learning: 2 Linear Classifiers

Uploaded by

2 Linear Classifiers

February 13, 2019 . .

Outline inputs weights

2 Linear Classifiers 2 Geometric Interpretation

Supervised Learning - Classification

• e.g., y ∈ {−1, +1} (binary classification)

• Decision boundary represented by the hyperplane w • What is an appropriate loss function?

Decision Rule • Number of mistakes in training data

• Hinge Loss - Support Vector Machine Learning Logistic Regression Model

3.2 Using Newton’s Method

You might also like