0% found this document useful (0 votes)

13 views

Logistic Regression

This document provides an overview of logistic regression for classification problems. It discusses how logistic regression uses a logistic function to predict the probability of an observation belonging to a class, rather than predicting a discrete class as in linear regression. It describes the logistic regression hypothesis, decision boundary, cost function, and advantages over linear regression for classification tasks. Logistic regression learns model parameters through gradient descent to minimize the cross-entropy loss function.

Uploaded by

Raksa Kun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

Logistic Regression

Uploaded by

Raksa Kun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

Logistic

Regression

Lecturer : Lyheang UNG

Table Of Content

Introduction 01 02 Hypothesis

Decision Boundary 03 04 Cost Function

Gradient Descent 05 06 Multi-Class Classification

TOC
Pros & Cons 07
1. Introduction
Classification is a problem of supervised learning, which tries to learn a mapping function
from any input variables to output variable, which is a discrete or categorical value.

x ℎ(x) 𝑦 ∈ ℕ
1. Introduction
Examples:

Email Spam Detection Online Fraud Detection

1. Introduction
Examples:

Sentiment Analysis Iris Classification

Digits Classification
1. Logistic Regression
● Logistic Regression is a popular statistical model which is used in classification
problem to identify the class of a given observation by predicting the output value of
categorical variable.
● It mostly deals with binary/binomial classification whose output value is binary value,
which can be either 0 or 1 (true or false). However, it can be used for multiclass/multi-
binomial classification.
● In this lecture, we will focus mainly on binary classification to simplify the problem.
1. How does it work?
● Logistic Regression measures the relationship between the dependent variable (the
label, what we want to predict) and one or more independent variables (the input
features) by estimating the probability using the underlying logistic function.
● These probability is then transformed into binary value in order to actually make a
prediction by using a threshold.

𝑥1 1
𝑥2
. Logistic
𝑦 = Probability
. Regression
𝑥𝑛 0

≥ Threshold=0.5?
1. Sigmoid Function
Sigmoid Function (or Logistic Function) is an S-sharped curve function that can takes any
real-valued number and maps it into a value between range of 0 and 1, but never exactly at
those limit.

1
𝑔 𝑥 =
1 + 𝑒 −𝑥

Sigmoid Function
1. Why not Linear Regression?
Example: Human Obesity Prediction

● How to use linear regression to solve

Obese/1
classification when its prediction
×××× ××
values are continuous and not bound
between 0 and 1?

Not Obese/0 × × × × × ×
Weight
1. Why not Linear Regression?
Example: Human Obesity Prediction

ℎw (x)
● To deal with classification, we can set
Obese/1
a threshold of output value e.g. 0.5.
×××× ××

if ℎw x ≥ 0.5 ⟹ Obese/1
0.5
if ℎw x < 0.5 ⟹ Not Obese/0
Obese
Not Obese/0 ××××××
𝑥1 Weight
1. Why not Linear Regression?
Example: Human Obesity Prediction

ℎw (x) ● Linear regression is not robust as the

predictions have high error rate when
Obese/1 ×××× ×× ××
there are the examples on the
extreme right as this learning
algorithm is sensitive to outliers.
0.5
● The red examples are classified as
“Not Obese” while they are ”Obese”.
Obese
Not Obese/0 ××××××
𝑥1 Weight
1. Linear Regression vs. Logistic Regression
Linear Regression Logistic Regression
Target variable is an continuous Target variable is a discrete

Input variables have any measurement level Input variables have any measurement level

Require linear relationship among dependent Not necessary for linear relationship between
and independent variable dependent and independent variable
Independent variables can be correlated with Independent variables must not be correlated
each other with each other
Predicted value are the mean of the target Predicted values are the probability of a
variable at the given values of the input particular levels of the target variable at the
variables given values of input variables
1. Linear Regression vs. Logistic Regression
Example: Human Obesity Prediction - Logistic regression is more robust than linear
regression.

Obese × × × × ×× Obese ××××××

Not Obese ×××××× Not Obese ××××××

Linear Regression Logistic Regression

2. Hypothesis
Logistic Regression of d input variables x = (𝑥1 , 𝑥2 , … , 𝑥𝑑 ) ∈ ℝ𝑑 can be represented by the
following hypothesis. It is derived by applying sigmoid function to the sum weighed of the
input variables to output the probability.
𝑤0
1
ℎw x = 𝑔 w 𝑇 x , 𝑔 𝑥 =
1 + 𝑒 −𝑥 𝑥1 𝑤1
1 𝑤2 Σ ℎw (x)
⟹ ℎw x = 𝑇x
1 + 𝑒 −w 𝑥2
𝑤𝑑
𝑔(w 𝑇 x)
= 𝑃(𝑦 = 1|x, w)
𝑥𝑑
2. Hypothesis
The following figure shows that ℎw (x) is greater than or equals 0.5, when the w 𝑇 x is greater
than or equals 0; and ℎw (x) is less than 0.5 whenever, w 𝑇 x is less than 0.

1
ℎw x = 𝑇x
1 + 𝑒 −w
ℎw (x)
𝑇
≥ 0.5, 𝑖𝑓 w x ≥ 0
ℎw x = ൝
< 0.5, 𝑖𝑓 w 𝑇 x < 0

w𝑇 x
3. Decision Boundary
● In order to map ℎw (x) to discrete value of either
0 or 1, a threshold value of 0.5 is defined as a
𝑐𝑙𝑎𝑠𝑠 1 ℎw (x) tipping point above which observations will be
classified into class 0 – negative class or class
1 – positive class.
𝑐𝑙𝑎𝑠𝑠 0
≥ 0.5 → 𝑐𝑙𝑎𝑠𝑠 1
ℎw x = ቊ
< 0.5 → 𝑐𝑙𝑎𝑠𝑠 0

w𝑇 x
3. Decision Boundary

𝑥2
● This figure shows the decision boundary of
Logistic Regression with two input variables x =
Class 1 (𝑥1 , 𝑥2 ).
● All the observations which are classified into
class 1, locates above the decision line, while the
ℎw (x) ≥ 0.5 observations of class 0 are below the decision
line.
Class 0 ≥ 0.5 → 𝑐𝑙𝑎𝑠𝑠 1
ℎw x < 0.5 ℎw x = ቊ
< 0.5 → 𝑐𝑙𝑎𝑠𝑠 0
𝑥1
4. Cost Function
In logistic regression, we cannot use Mean Squared Error (MSE) as the non-linearity of the
cost function 𝐽(𝑤), due to the sigmoid function. It results MSE a non-convex function with
many local minimums. In such case, gradient descent cannot be used to learn the
parameters.

1
𝐽 w = σ𝑛𝑖=1 (ℎw (x𝑖 ) − 𝑦𝑖 )2
2𝑛
1
ℎw x𝑖 = 𝑇
𝐽(w)
1+ 𝑒 −w x𝑖

w
4. Cost Function
● Instead of MSE, we use a cost function, called Cross-Entropy Loss, also known as
Negative Log-likelihood, which is a convex function and can be minimized by gradient
descent.

● For a single training instance 𝑥𝑖 , Cross-Entropy Loss is defined as follows:

𝐽 w = −𝑦𝑖 log ℎw (x𝑖 ) + 1 − 𝑦𝑖 log(1 − ℎw x𝑖 )

● For the whole training set of 𝑛 instances, it is the average of overall costs.
𝑛
1
𝐽 w = − ෍ 𝑦𝑖 log ℎw (x𝑖 ) + 1 − 𝑦𝑖 log(1 − ℎw x𝑖 )
𝑛
𝑖=1
4. Cost Function – Intuition
● Cross-Entropy Loss aims to find parameter w
ෝ so that the model estimates high
probabilities for the positive instances (𝑦 = 1) and low probabilities for the negative
instances (𝑦 = 0).
● Cross-Entropy Loss is composed of two convex functions with respect to each class.

𝐽 w = −𝑦𝑖 log ℎw (x) + 1 − 𝑦𝑖 log(1 − ℎw x )

− 𝑙𝑜𝑔 ℎw (x) 𝑖𝑓 𝑦 = 1
𝐽(w) = ൝
− 𝑙𝑜𝑔 1 − ℎw x 𝑖𝑓 𝑦 = 0
Loss

ℎw (x)
5. Gradient Descent
● Given n examples { x𝑖 , 𝑦1 , x2 , 𝑦2 , . . , (x𝑛 , 𝑦𝑛 )} such that xi ∈ ℝ𝑑 , the cost function:
𝑛
1
𝐽 w = − ෍[𝑦𝑖 log ℎw xi + 1 − 𝑦𝑖 log(1 − ℎw (xi )]
𝑛
𝑖=1
● The gradient or partial derivative of cost function with respect to each parameter w𝑖 :

𝑛
𝜕 1
𝐽 w = ෍ ℎw xi − 𝑦𝑖 𝑥i
𝜕𝑤𝑖 𝑛
𝑖=1
5. Gradient Descent
● The batch gradient descent algorithm of logistic regression is outlined as follows:

Initialize 𝑤0 , … , 𝑤𝑑 with random values

For all data points:
make predictions – compute ℎw (xi )
For each weight 𝑤𝑖 :
# compute the gradient and update new
weight
𝜂
𝑤i = 𝑤i − σ𝑛𝑖=1 ℎw xi − 𝑦𝑖 𝑥i
𝑛
Until weights converges
5. Gradient Descent
● The batch gradient descent algorithm of logistic regression is outlined as follows:

𝑝Ƹ = ℎw (x)
6. Multi-Class Classification
● What is the intuition behind the multi-class classification in Logistic Regression?

𝑥2
× × ×
+
+
+ × × × Class 1: +
+ + ×
×
+ + Class 2: ×
∆ ∆ Class 3: ∆
∆
∆ ∆
∆ ∆
∆
𝑥1
6. One vs. Rest Classification
● Logistic Regression is mainly focus on binary classification. However, it can used to
solve with multi-class classification by using a technique, called ” One vs. Rest
Classification”.
● ”One vs. Rest Classification” strategy involves training a single classifier per class, with
the samples of that class as positive samples and all the other samples as negatives.
The predictions are made using the model that is the most confident.
6. One vs. Rest Classification
𝑥2 ×× ×
Class 1: + +
+ + × ×× ×
++ 𝑘
Class 2: × ++ × ℎw x = 𝑃 𝑦 = 𝑘 x, w
Class 3: ∆ ∆∆ ∆
∆ ∆∆ ∆
∆
𝑥1
6. One vs. Rest Classification
𝑥2 ×× ×
Class 1: + +
+ + × ×× ×
++ 𝑘
Class 2: × ++ × ℎw x = 𝑃 𝑦 = 𝑘 x, w
Class 3: ∆ ∆∆ ∆
∆ ∆∆ ∆
∆
𝑥1

1
ℎw (x)
×× ×
++
+++ ××× ×
++ ×
∆∆ ∆
∆ ∆∆ ∆
∆
6. One vs. Rest Classification
𝑥2 ×× ×
Class 1: + +
+ + × ×× ×
++ 𝑘
Class 2: × ++ × ℎw x = 𝑃 𝑦 = 𝑘 x, w
Class 3: ∆ ∆∆ ∆
∆ ∆∆ ∆
∆
𝑥1

1
ℎw (x)
×× × ×× ×
++ ++
+++ ××× × +++ ××× ×
++ × ++ ×
∆∆ ∆ ∆∆ ∆
∆ ∆∆ ∆ ∆ ∆∆ ∆
∆ ∆
6. One vs. Rest Classification
𝑥2 ×× ×
Class 1: + +
+ + × ×× ×
++ 𝑘
Class 2: × ++ × ℎw x = 𝑃 𝑦 = 𝑘 x, w
Class 3: ∆ ∆∆ ∆
∆ ∆∆ ∆
∆
𝑥1

1
ℎw (x)
×× × ×× × ×× ×
++ ++ ++
+++ ××× × +++ ××× × +++ ××× ×
++ × ++ × ++ ×
3
∆∆ ∆ ∆∆ ∆ ∆∆ ∆ ℎw (x)
∆ ∆∆ ∆ ∆ ∆∆ ∆ ∆ ∆∆ ∆
∆ ∆ ∆
6. One vs. Rest Classification

1
ℎw x = 𝑃 𝑦 = 1 x =0.89 𝑦=1

2
+ ℎw x = 𝑃 𝑦 = 2 x =0.10

x 3
ℎw x = 𝑃 𝑦 = 3 x =0.01
6. Softmax Regression
● Logistic Regression can be generalized to support multiple classes directly, without
having to train and combine multiple binary classifiers. This is called Softmax
Regression or Multinomial Logistic Regression.
● The idea is quite simple: when given an instance x, Softmax Regression first computes
a score 𝑠𝑘 (x) for each class k, then estimates the probability of each class by applying
the softmax function (also called the normalized exponential) to the scores.

1.3 0.02
5.1 0.90
𝑒 𝑧𝑖
𝜎 z 𝑖 = 𝑛 2.2 Softmax-𝜎 0.05
σ𝑗=1 𝑒 𝑧𝑗
0.7 0.01
1.1 0.02
z 𝜎(z)
6. Softmax Regression
𝑤10
𝑠1 = w1𝑇 x
𝑤11
Σ 𝑦1 𝑃 𝑦=1x
𝑥1 𝑤21
𝑤31 𝑤20
𝑠2 = w2𝑇 x

softmax
𝑥2 Σ 𝑦2 𝑃(𝑦 = 2|x)

𝑤30

𝑥𝑑 𝑠3 = w3𝑇 x
Σ 𝑦3 𝑃(𝑦 = 3|x)

𝑛 𝐾
𝑒 𝑠𝑘(x) 1 (𝑖)
𝑃 𝑦=𝑘 =𝜎 𝑠 x = , 𝐽 𝑤 = − ෍ ෍ 𝑦𝑘 log(𝑃(𝑖) 𝑦 = 𝑘 )
𝑘 σ𝐾 𝑠𝑗 (x) 𝑛
𝑗=1 𝑒 𝑖=1 𝑘=1
7. Pros & Cons
● Pros:
○ Easy to implement and interpret results
○ Inexpensive computation
○ Not require feature scaling
● Cons:
○ Not able to handle a large number of categorical output variable
○ Vulnerable to overfitting
○ Cannot solve non-linear data
Q&A

07 PRAMIN 1 Diaphragm Design 201611071
No ratings yet
07 PRAMIN 1 Diaphragm Design 201611071
45 pages
Essays On The Topic of Sports and Exercise
No ratings yet
Essays On The Topic of Sports and Exercise
13 pages
09_23ECE216_LogisticRegression
No ratings yet
09_23ECE216_LogisticRegression
40 pages
ML DSBA Lab2
No ratings yet
ML DSBA Lab2
4 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
Lecture Notes 6 Logistic Regression
No ratings yet
Lecture Notes 6 Logistic Regression
8 pages
M02Logistic Regression Logistic RegressioLogistic Regressionn
No ratings yet
M02Logistic Regression Logistic RegressioLogistic Regressionn
19 pages
Chp2 Logistic Regression
No ratings yet
Chp2 Logistic Regression
6 pages
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
100% (1)
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
42 pages
Logistic Regression
No ratings yet
Logistic Regression
42 pages
Lecture 5_Logistic Regression (1)
No ratings yet
Lecture 5_Logistic Regression (1)
28 pages
3-LG_Eval
No ratings yet
3-LG_Eval
52 pages
Lecture Note #9_PEC-CS701E
No ratings yet
Lecture Note #9_PEC-CS701E
41 pages
Logistic_Regression_Class_Notes
No ratings yet
Logistic_Regression_Class_Notes
3 pages
L4 - Logistic Regression - B
No ratings yet
L4 - Logistic Regression - B
45 pages
Classification-Introduction, Logistic Regression
No ratings yet
Classification-Introduction, Logistic Regression
26 pages
ML-Unit 4
No ratings yet
ML-Unit 4
29 pages
11-Logistic Regression
No ratings yet
11-Logistic Regression
27 pages
Logistic - Regression Class 3
No ratings yet
Logistic - Regression Class 3
88 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
67 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
Logistic regression
No ratings yet
Logistic regression
12 pages
Algorithms Notes
No ratings yet
Algorithms Notes
66 pages
DS203 2024 01 02 LogisticRegression
No ratings yet
DS203 2024 01 02 LogisticRegression
38 pages
Lecture 8 Logistic Regression
No ratings yet
Lecture 8 Logistic Regression
34 pages
Lecture 03 Logistic Regression
No ratings yet
Lecture 03 Logistic Regression
34 pages
A Tutorial of Machine Learning
No ratings yet
A Tutorial of Machine Learning
16 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
Slide 2
No ratings yet
Slide 2
30 pages
Regression and Classification
No ratings yet
Regression and Classification
15 pages
logistic regression
No ratings yet
logistic regression
6 pages
Module-2_Logistic Regression in Machine Learning
No ratings yet
Module-2_Logistic Regression in Machine Learning
28 pages
Sample Research Paper
No ratings yet
Sample Research Paper
26 pages
Ch2Regression and Regularization1
No ratings yet
Ch2Regression and Regularization1
45 pages
Lecture 4-Logistic-Regression
No ratings yet
Lecture 4-Logistic-Regression
50 pages
Linear and Logistic Regression
No ratings yet
Linear and Logistic Regression
21 pages
Introduction To Machine Learning: 2 Linear Classifiers
No ratings yet
Introduction To Machine Learning: 2 Linear Classifiers
4 pages
Logistic Regression
No ratings yet
Logistic Regression
20 pages
Logistic Regression
No ratings yet
Logistic Regression
37 pages
D2S1 - Classification Algorithms
No ratings yet
D2S1 - Classification Algorithms
30 pages
cs188 Fa23 Note22
No ratings yet
cs188 Fa23 Note22
3 pages
3. LR, decision tree
No ratings yet
3. LR, decision tree
48 pages
06LogisticRegression
No ratings yet
06LogisticRegression
55 pages
ML CLASS 5 Logistic Regression Algorithm
No ratings yet
ML CLASS 5 Logistic Regression Algorithm
16 pages
W8 - Logistic Regression
No ratings yet
W8 - Logistic Regression
18 pages
ML_MU_Unit_2 - Supervised Learning-Classification Techniques
No ratings yet
ML_MU_Unit_2 - Supervised Learning-Classification Techniques
153 pages
Logistic Regression Video
No ratings yet
Logistic Regression Video
37 pages
Logistic Regression
No ratings yet
Logistic Regression
74 pages
Unit II
100% (1)
Unit II
13 pages
SUPERVISED MACHINE LEARNING
No ratings yet
SUPERVISED MACHINE LEARNING
56 pages
B-56 Sanket Jambhulkar MLA-3
No ratings yet
B-56 Sanket Jambhulkar MLA-3
7 pages
Intro to Linear and Logistic Reg
No ratings yet
Intro to Linear and Logistic Reg
5 pages
Logistic Regressions
No ratings yet
Logistic Regressions
11 pages
Notes 05
No ratings yet
Notes 05
51 pages
Lec12 Logreg
No ratings yet
Lec12 Logreg
41 pages
06 Logistic Regression PDF
No ratings yet
06 Logistic Regression PDF
10 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
Lecture 4-Logistic Regression
No ratings yet
Lecture 4-Logistic Regression
20 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
IntroCrypto
No ratings yet
IntroCrypto
24 pages
Introduction - Cont.
No ratings yet
Introduction - Cont.
19 pages
Introduction
No ratings yet
Introduction
28 pages
Data Preprocessing
No ratings yet
Data Preprocessing
56 pages
Iso 384 1978
No ratings yet
Iso 384 1978
11 pages
Grade 12 Maths Model Exam
No ratings yet
Grade 12 Maths Model Exam
57 pages
BSA Test of Difference May 10 2023
No ratings yet
BSA Test of Difference May 10 2023
9 pages
Read The Passage Below Which Contains 10 Mistakes. Identify The Mistakes and Write The Corrections. There Is An Example at The Beginning
No ratings yet
Read The Passage Below Which Contains 10 Mistakes. Identify The Mistakes and Write The Corrections. There Is An Example at The Beginning
3 pages
Low Cost Housing: Ar. Srijana Shakya Tamrakar
No ratings yet
Low Cost Housing: Ar. Srijana Shakya Tamrakar
21 pages
Cascadable Silicon Bipolar MMIC Amplifier: Technical Data
No ratings yet
Cascadable Silicon Bipolar MMIC Amplifier: Technical Data
4 pages
Sonipat-Agriculture and Irrigation
No ratings yet
Sonipat-Agriculture and Irrigation
52 pages
Cbcs MAth 4 April 2023
No ratings yet
Cbcs MAth 4 April 2023
3 pages
Translation Transformation Lesson Plan For Module 4 For Fourth Form Mathematics
No ratings yet
Translation Transformation Lesson Plan For Module 4 For Fourth Form Mathematics
8 pages
Log
No ratings yet
Log
201 pages
Weather Data For Zanzibar
No ratings yet
Weather Data For Zanzibar
2 pages
Format of A Term Paper Outline
100% (1)
Format of A Term Paper Outline
9 pages
IELTS Cue Cards - May'24 To Aug'24
No ratings yet
IELTS Cue Cards - May'24 To Aug'24
4 pages
CurriculumVitaeof Tahazibafor Job Application
No ratings yet
CurriculumVitaeof Tahazibafor Job Application
3 pages
Materi Awareness ISO 37001:2018
No ratings yet
Materi Awareness ISO 37001:2018
160 pages
Mobility Insights LCA For Electric Buses1
No ratings yet
Mobility Insights LCA For Electric Buses1
14 pages
Agro-Forestry Project in Libmanan, Camarines Sur, Philippines
No ratings yet
Agro-Forestry Project in Libmanan, Camarines Sur, Philippines
9 pages
Baleé-14-Historical Ecology and The Explanation of Diversity-Amazonian Case Studies
No ratings yet
Baleé-14-Historical Ecology and The Explanation of Diversity-Amazonian Case Studies
15 pages
SUMMARY OF UCE-NLSC UNEB RESULTS 2024
No ratings yet
SUMMARY OF UCE-NLSC UNEB RESULTS 2024
1 page
Uetz 1991 - Habitat Structure and Spider Foraging
No ratings yet
Uetz 1991 - Habitat Structure and Spider Foraging
24 pages
Stop Living Life As A Victim Worksheet
No ratings yet
Stop Living Life As A Victim Worksheet
5 pages
Central Park Tower a Pinnacle of Structural and Architectural Innovation
No ratings yet
Central Park Tower a Pinnacle of Structural and Architectural Innovation
10 pages
Soil Profile Water Content Determination: Vadose Zone Journal August 2006
No ratings yet
Soil Profile Water Content Determination: Vadose Zone Journal August 2006
15 pages
Bilingualism in Development
No ratings yet
Bilingualism in Development
26 pages
Science 8 Unit 9 Quiz
No ratings yet
Science 8 Unit 9 Quiz
2 pages
Full Download Encyclopedia of Inland Waters (All 4 Set Volume) 2nd Edition Klement Tockner (Editor in Chief) PDF
83% (6)
Full Download Encyclopedia of Inland Waters (All 4 Set Volume) 2nd Edition Klement Tockner (Editor in Chief) PDF
64 pages
Sample Test For Axis Bank: Section A - Quantitative Aptitude
No ratings yet
Sample Test For Axis Bank: Section A - Quantitative Aptitude
3 pages
Food and Formalin Detector Using Machine Learning Approach: October 2019
No ratings yet
Food and Formalin Detector Using Machine Learning Approach: October 2019
7 pages

Logistic Regression

Uploaded by

Logistic Regression

Uploaded by

Logistic

Lecturer : Lyheang UNG

Decision Boundary 03 04 Cost Function

Gradient Descent 05 06 Multi-Class Classification

Email Spam Detection Online Fraud Detection

Sentiment Analysis Iris Classification

● How to use linear regression to solve

ℎw (x) ● Linear regression is not robust as the

Obese × × × × ×× Obese ××××××

Not Obese ×××××× Not Obese ××××××

Linear Regression Logistic Regression

● For a single training instance 𝑥𝑖 , Cross-Entropy Loss is defined as follows:

𝐽 w = −𝑦𝑖 log ℎw (x) + 1 − 𝑦𝑖 log(1 − ℎw x )

Initialize 𝑤0 , … , 𝑤𝑑 with random values

You might also like