0% found this document useful (0 votes)

2 views28 pages

CH3 Logistic Regression 2020

The document discusses supervised learning with a focus on logistic regression for classification problems, particularly binary classification. It covers key concepts such as hypothesis representation, decision boundaries, cost functions, and gradient descent, while highlighting the advantages of logistic regression over linear regression. Additionally, it explains the sigmoid function, interpretation of hypothesis outputs, and the formulation of a convex cost function necessary for effective model training.

Uploaded by

Kouraichi Zeineb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views28 pages

CH3 Logistic Regression 2020

Uploaded by

Kouraichi Zeineb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Supervised learning

Logistic regression

1
Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

Binary classification
 Classification problems
o Email  spam/not spam?
o Online transactions  fraudulent?
o Tumor  Malignant/benign
 Variable in these problems is Y
o Y is either 0 or 1
 0 = negative class (absence of something)
 1 = positive class (presence of something)
 Start with binary class problems
 Later look at multiclass classification problem, although this is just an extension of binary
classification
Machine learning Mourad ZAIED RTIM 2
Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

How do we develop a classification algorithm?

o Tumour size vs malignancy (0 or 1)

H w ( x)  W T x
o We could use linear regression
 Then threshold the classifier output (i.e.
anything over some value is yes, else no)
In our example below linear regression
with thresholding seems to work

Machine learning Mourad ZAIED RTIM 3

Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

Issues with linear regression in classification problems

 We can see above this does a reasonable job of stratifying the data points into one of two
classes
o But what if we had a single Yes with a very huge tumour ?
o This would lead to classifying all the existing yeses as nos
 Another issues with linear regression
o We know Y is 0 or 1
o Hypothesis can give values large than 1 or less than 0
 So, logistic regression generates a value where is always either 0 or 1
o Logistic regression is a classification algorithm - don't be confused
Machine learning Mourad ZAIED RTIM 4
Classification problem Logistic function
Hypothesis representation Hypothesis output
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

What function is used to represent our hypothesis in classification?

 We want our classifier to output values between 0 and 1
o When using linear regression we did H ( x)  W T x
w
o For classification hypothesis representation we do H ( x)  g (W T x)
w
g ( z)  1 z where z  
1 e
 This is the sigmoid function, or the logistic function

 If we combine these equations we can write out the hypothesis as

H w ( x)  1 where z  
1  eW
T
x

Machine learning Mourad ZAIED RTIM 5

Classification problem Logistic function
Hypothesis representation Hypothesis output
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

What does the sigmoid function look like? (2/2)

 Crosses 0.5 at the origin, then flattens out
 Asymptotes at 0 and 1
g(z)

Machine learning Mourad ZAIED RTIM 6

Useful property of the derivative of the sigmoid function

d 1
g '( z)  z
dz 1 e
 1 (e z )
(1 e z )2
 1 (1 
1 )
(1 e z ) (1 e z )
 g ( z )(1 g ( z ))

Machine learning Mourad ZAIED RTIM 7

Interpreting hypothesis output (1/2)

 When hw(x) outputs a number, we treat that value as the estimated probability that y=1 on input x
o Example
 If X is a feature vector with x0 = 1 (as always) and x1 = tumourSize
 Hw(x) = 0.7  Tells a patient they have a 70% chance of a tumor being malignant
o We can write this using the following notation  Hw(x) = P(y=1|x ; w)
o P(y=1|x ; w) : Probability that y=1, given x, parameterized by w
 Since this is a binary classification task we know y = 0 or 1
o So the following must be true
P(y=1|x ; w) + P(y=0|x ; w) = 1  P(y=0|x ; w) = 1 - P(y=1|x ; w)
Machine learning Mourad ZAIED RTIM 8
Classification problem Logistic function
Hypothesis representation Hypothesis output
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

Interpreting hypothesis output (2/2)

o We can predict y = 1 when Hw(x) >=0.5 (Else we predict y = 0)
o When is it exactly that Hw(x) is greater than 0.5?
 Look at sigmoid function
o g(z) is greater than or equal to 0.5 when z is greater than or equal to 0 g(z)

 So if z is positive, g(z) is greater than 0.5

 z = wT x
 So when
 wT x >= 0
 Then Hw >= 0.5
z
Machine learning Mourad ZAIED RTIM 9
Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

Decision boundary
 So what we've shown is that the hypothesis predicts y = 1 when wT x >= 0
o The corollary of that when wT x < 0 then the hypothesis predicts y = 0
o Let's use this to better understand how the hypothesis makes its predictions

 Hw(x) = g(w0 + w1x1 + w2x2)

Machine learning Mourad ZAIED RTIM 10

Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

Decision boundary
 Example w0 = -3, w1 = 1 and w2 = 1
 Our parameter vector is a column vector with the above values  wT is a row vector = [-3,1,1]
 The z here becomes wT x
o We predict "y = 1" if
 -3x0 + 1x1 + 1x2 >= 0  -3 + x1 + x2 >= 0
 We can also re-write this as
o If (x1 + x2 >= 3) then we predict y = 1
o If we plot
 x1 + x2 = 3 we graphically plot our decision boundary
Machine learning Mourad ZAIED RTIM 11
Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

Non-linear decision boundaries

 Get logistic regression to fit a complex non-linear data set
o Like polynomial regress add higher order terms
o So say we have
 Hw(x) = g(w0 + w1x1+ w2x2+ w3x12 + w4x22)
 We take the transpose of the w vector times the input vector
 Say wT was [-1,0,0,1,1] then we say;
 Predict that "y = 1" if
 -1 + x12 + x22 >= 0  x12 + x22 >= 1
 If we plot x12 + x22 = 1
 This gives us a circle with a radius of 1 around 0
Machine learning Mourad ZAIED RTIM 12
Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

Non-linear decision boundaries

 Mean we can build more complex decision boundaries by fitting complex
parameters to this (relatively) simple hypothesis
 More complex decision boundaries?
o By using higher order polynomial terms, we can get even more
complex decision boundaries

Machine learning Mourad ZAIED RTIM 13

Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

Cost function of the linear regression in ‘Logistic Regression’ is non-convex

1 m (i )
C ( w)   (y  H w ( x ( i ) )) 2
2m i 1
 If we try to use the cost function of
the linear regression in ‘Logistic
Regression’ then it would be of no
use as it would end up being
a non-convex function with many
local minimums, in which it would
be very difficult to minimize the
cost value and find the global
minimum.
Non-convex cost function
Machine learning Mourad ZAIED RTIM 14
Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

A convex logistic regression cost function

• We need a different, convex cost function which means we can apply gradient descent

• We want a classifier that produces very high Hw(x) when y=1, and conversely very low Hw(x)
when y=0. We hope that a is very close to y for each sample

• In other words,
• If y=1 we want to maximize Hw(x)
• If y=0 we want to maximize 1-Hw(x)
• If we combine (1) and (2), we want to maximize:
y  1 y 
 H  x   1  H  x  
w w

• Maximizing above is equivalent to maximizing:

 y
log  H w  x   1  H w  x  
1 y 
   y log  H ( x)   1  y  log  1  H ( x) 
w w

Machine learning Mourad ZAIED RTIM 15

Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

A convex logistic regression cost function

• or we want to minimize:

  y log  H w ( x)   1  y  log  1  H w ( x) 

• Generally, in Machine Learning we like to minimize the loss. That is why we changed the
direction. This is by convention.

• The above formula defines a cost function (or loss) for only one sample. We also need a
loss function for multiple samples (which we will call C(W)).

1 m (i)
     
C ( w)    y log H w ( x ( i ) )  1  y ( i ) log 1  H w ( x ( i ) )
m i 1

Machine learning Mourad ZAIED RTIM 16
Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

Gradient of ∁( ) 1/3

1 m
C ( w)  -   ylog  H w ( x)  + 1  y  log  1  H w ( x)  
m i 1
C ( w) 1 m  log  H w ( x)  log  1  H w ( x) 
 -  y + 1  y  
w j m i 1  w j w j 
 H w ( x)   1  H w ( x) 
 
1 m  w j w j 
-  y + 1  y 
m i 1  H w ( x) 1  H w ( x) 
 
 

Machine learning Mourad ZAIED RTIM 17

Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

Gradient of ∁( ) 2/3
 
 1 
 n

  wk x k
H w ( x)   fog  w  1
 1 e 
k 0 j
With f  and g  w0 x0  ...  w j x j  ...  wn xn
w j w j w j 1  e x

fog '  ( f 'og ) g ' f '  f (1  f ) g '  xj

H w ( x)
 f ( g )(1  f ( g )) x j  H w ( x)(1  H w ( x)) x j
w j
 (1  H w ( x)) H ( x)
 w  H w ( x)(1  H w ( x)) x j
w j w j
Machine learning Mourad ZAIED RTIM 18
Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

Gradient of ∁( ) ) 3/3

C ( w) 1 m  H w ( x)(1  H w ( x)) x j H w ( x)(1  H w ( x)) x j 

 -  y - 1  y  
w j m i 1  H w ( x) 1  H w ( x) 
1 m
 -   y (1  H w ( x)) x j  1  y  H w ( x) x j 
m i 1
1 m
 -   x j ( y  yH w ( x)  H w ( x)  yH w ( x)) 
m i 1
1 m
 -  ( y  H w ( x)) x j
m i 1

Machine learning Mourad ZAIED RTIM 19

Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

Gradient descent algorithm for logistic regression

Start from any initial value w j

{
m
 (i ) (i ) (i )
wj  wj 
m
 (
i 1
y  H w ( x )) x j

•This equation is the same as the linear regression rule

•The only difference is that our definition for the hypothesis has changed

Machine learning Mourad ZAIED RTIM 20

Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression m
 (i )
Multiclass classification problems wj  wj  (y  H w ( x (i ) )) x (ji )
m i 1

1ière iteration:
Base d’apprentissage • w0=0.1+0.01/4((0-g(w0x0(1)+w1x1(1)+w2x2 (1)) x0(1)+
(0-g(w0 x0(2) +w1x1(2)+w2x2(2)) x0(2) +
x0 x1 x2 Y (1-g(w0 x0(3 +w1x1(3)+w2x2(3)) x0(3) +
1 -0.1 1.4 0 (1-g(w0 x0(4) +w1x1(4)+w2x2(4)) x0(4) )
1 -0.5 -0.1 0 =0.1+0.01/4((0-g(0.1-0.2*0.1+0.3*1.4)+
1 1.3 0.9 1 (0-g(0.1-0.2*0.5-0.3*0.1) +
1 -0.6 0.4 1 (1-g(0.1+0.2*1.3+0.3*0.9) +
(1-g(0.1-0.2*0.6+0.3*0.4)
Valeurs initiales des wi: w0= 0.1 w1= 0.2 w2= 0.3
=0.1+0.01/4((0-g(0.5)+ (0-g(-0.03) + (1-g(0.63)) +(1-g(0.1))
= 0.1+0.01/4(2.78442238235- 0.92460027612+2.40347419836-
Taux d’apprentissage alpha = 0.01
0.37330225703)
1 = 0.1+0.01/4(5.7391945998)=0.1+0.0025* 3.88999404756
( )= =0.10972498511
1 + exp(− )
Machine learning Mourad ZAIED RTIM 21
Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

• w1=0.2+0.01/4((0-g(w0x0(1)+w1x1(1)+w2x2 (1)) x1(1)+

(0-g(w0 x0(2) +w1x1(2)+w2x2(2)) x1(2) +  m
(i )
(1-g(w0 x0(3 +w1x1(3)+w2x2(3)) x1(3) + wj  wj  (y  H w ( x (i ) )) x (ji )
(1-g(w0 x0(4) +w1x1(4)+w2x2(4)) x1(4) )
m i 1

=0.2+0.01/4((0-g(0.1-0.2*0.1+0.3*1.4)*(-0.1)+
(0-g(0.1-0.2*0.5-0.3*0.1) )*(-0.5)+ Base d’apprentissage
(1-g(0.2+0.2*1.3+0.3*0.9) *(1.3) +
(1-g(0.1-0.2*0.6+0.3*0.4) *(-0.6) x0 x1 x2 Y
1 -0.1 1.4 0
=0.2+0.01/4((0-g(0.5) *(-0.1)+ (0-g(-0.03) )*(-0.5) + (1-g(0.63)) *(1.3) +(1- 1 -0.5 -0.1 0
g(0.1)) *(-0.6) 1 1.3 0.9 1
= 0.2+0.01/4(2.78442238235*(-0.1)- 0.92460027612*(0.5)+ 2.40347419836 1 -0.6 0.4 1
*(1.3) -0.37330225703) *(-0.6))
= 0.2+0.01/4(-0.27844223823 -0.46230013806 -
3.12451645787+0.22398135421)
=0.2+0.0025*(-3.64127747995) =0.1908968063
Machine learning Mourad ZAIED RTIM 22
Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

• w2=0.3+0.01/4((0-g(w0x0(1)+w1x1(1)+w2x2 (1)) x2(1)+

(0-g(w0 x0(2) +w1x1(2)+w2x2(2)) x2(2) +  m
(i )
(1-g(w0 x0(3 +w1x1(3)+w2x2(3)) x2(3) + wj  wj  (y  H w ( x (i ) )) x (ji )
(1-g(w0 x0(4) +w1x1(4)+w2x2(4)) x2(4) )
m i 1

=0.2+0.01/4((0-g(0.1-0.2*0.1+0.3*1.4)*(1.4)+
(0-g(0.1-0.2*0.5-0.3*0.1) )*(-0.1)+ Base d’apprentissage
(1-g(0.2+0.2*1.3+0.3*0.9) *(0.9) +
(1-g(0.1-0.2*0.6+0.3*0.4) *(0.4) x0 x1 x2 Y
1 -0.1 1.4 0
=0.2+0.01/4((0-g(0.5) *(1.4)+ (0-g(-0.03) )*(- 0.1) + (1-g(0.63)) *(0.9) +(1- 1 -0.5 -0.1 0
g(0.1)) *(0.4) 1 1.3 0.9 1
= 0.2+0.01/4(2.78442238235*(1.4)- 0.92460027612*(-0.1)+ 2.40347419836 1 -0.6 0.4 1
*(0.9) -0.37330225703) *(0.4))
= 0.2+0.01/4(3.89819133529+ 0.09246002761+ 2.16312677852 -
0.14932090281)
=0.2+0.0025*(6.00445723861) = 0.21501114309
Machine learning Mourad ZAIED RTIM 23
Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

2ière iteration:
• w0=0.10972498511+0.01/4((0-g(w0x0(1)+w1x1(1)+w2x2 (1)) x0(1)+
(0-g(w0 x0(2) +w1x1(2)+w2x2(2)) x0(2) +
(1-g(w0 x0(3 +w1x1(3)+w2x2(3)) x0(3) +
(1-g(w0 x0(4) +w1x1(4)+w2x2(4)) x0(4) )

• w1=0.1908968063+0.01/4((0-g(w0x0(1)+w1x1(1)+w2x2 (1)) x1(1)+

(0-g(w0 x0(2) +w1x1(2)+w2x2(2)) x1(2) +
(1-g(w0 x0(3 +w1x1(3)+w2x2(3)) x1(3) +
(1-g(w0 x0(4) +w1x1(4)+w2x2(4)) x1(4) )

• w2=0.21501114309+0.01/4((0-g(w0x0(1)+w1x1(1)+w2x2 (1)) x2(1)+

(0-g(w0 x0(2) +w1x1(2)+w2x2(2)) x2(2) +
(1-g(w0 x0(3 +w1x1(3)+w2x2(3)) x2(3) +
(1-g(w0 x0(4) +w1x1(4)+w2x2(4)) x2(4) )

Machine learning Mourad ZAIED RTIM 24

Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

Gradient descent algorithm for logistic regression simulation

Machine learning Mourad ZAIED RTIM 25

Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

One vs. all technique 1/3

 Getting logistic regression for multiclass classification using one vs. all
 Multiclass - more than yes or no (1 or 0)
o Classification with multiple classes for assignment

Machine learning Mourad ZAIED RTIM 26

Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

One vs. all technique 2/3

 Given a dataset with three classes, how do we get a learning algorithm to work?

o Use one vs. all classification make binary classification work for multiclass classification

 One vs. all classification

o Split the training set into three separate binary classification problems
 i.e. create a new fake training set

 Triangle (1) vs crosses and squares (0) hw1(x)

 P(y=1 | x1; w)

 Crosses (1) vs triangle and square (0) hw2(x)

 P(y=1 | x2; w)

 Square (1) vs crosses and square (0) hw3(x)

 P(y=1 | x3; w)
Machine learning Mourad ZAIED RTIM 27
Classification problem
Hypothesis representation
Decision boundary
Cost function for logistic regression
Gradient descent for logistic regression
Multiclass classification problems

One vs. all technique 3/3

 Overall
o Train a logistic regression classifier hw(i)(x) for each class i to predict the probability that
y=i
o On a new input, x to make a prediction, pick the class i that maximizes the probability
that hw(i)(x) = 1

Machine learning Mourad ZAIED RTIM 28

Lecture 3. Classification
No ratings yet
Lecture 3. Classification
60 pages
Pls-Sem Using Smartpls 3: School of Management Research Seminar
No ratings yet
Pls-Sem Using Smartpls 3: School of Management Research Seminar
50 pages
Case 1
100% (2)
Case 1
6 pages
Factor Analysis
No ratings yet
Factor Analysis
16 pages
CH3 Logistic Regression 2024
No ratings yet
CH3 Logistic Regression 2024
31 pages
The Linear Regression Model: An Overview: Damodar Gujarati
No ratings yet
The Linear Regression Model: An Overview: Damodar Gujarati
17 pages
Machine Learning Project: Raghul Harish
100% (2)
Machine Learning Project: Raghul Harish
46 pages
Algorithms Notes
No ratings yet
Algorithms Notes
66 pages
ML-Unit I - Logistic Regression
No ratings yet
ML-Unit I - Logistic Regression
102 pages
DSCTP 2022 1 ML Slides
No ratings yet
DSCTP 2022 1 ML Slides
351 pages
Logistic Regression
No ratings yet
Logistic Regression
37 pages
CSE445 T4a Logistic Regression
No ratings yet
CSE445 T4a Logistic Regression
38 pages
Sample Research Paper
No ratings yet
Sample Research Paper
26 pages
Logistic Regression
No ratings yet
Logistic Regression
24 pages
Lecture 5 - Logistic Regression
No ratings yet
Lecture 5 - Logistic Regression
28 pages
Week 3 Lecture Notes
No ratings yet
Week 3 Lecture Notes
7 pages
04 LogisticRegression
No ratings yet
04 LogisticRegression
29 pages
Logistic Regression
No ratings yet
Logistic Regression
74 pages
03-Logistic Regression
No ratings yet
03-Logistic Regression
59 pages
DSCTP 2022 1 ML Slides
No ratings yet
DSCTP 2022 1 ML Slides
110 pages
Slide 2
No ratings yet
Slide 2
30 pages
Lecture3 Logistic Regression Classifier V0
No ratings yet
Lecture3 Logistic Regression Classifier V0
41 pages
12 - Bài Toán Phân L P - LR - v2
No ratings yet
12 - Bài Toán Phân L P - LR - v2
130 pages
DS203 2024 01 02 LogisticRegression
No ratings yet
DS203 2024 01 02 LogisticRegression
38 pages
Data Science L19 - LogisticRegression
No ratings yet
Data Science L19 - LogisticRegression
52 pages
Logistic Regression
No ratings yet
Logistic Regression
34 pages
Machine Learning 2
No ratings yet
Machine Learning 2
19 pages
Principal Components Analysis
No ratings yet
Principal Components Analysis
50 pages
3 Logistic Regression and Regularization
No ratings yet
3 Logistic Regression and Regularization
42 pages
M02Logistic Regression Logistic RegressioLogistic Regressionn
No ratings yet
M02Logistic Regression Logistic RegressioLogistic Regressionn
19 pages
Ch2Regression and Regularization1
No ratings yet
Ch2Regression and Regularization1
45 pages
Lecture 8 Logistic Regression
No ratings yet
Lecture 8 Logistic Regression
34 pages
Lecture 03 Logistic Regression
No ratings yet
Lecture 03 Logistic Regression
34 pages
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
100% (1)
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
42 pages
3-LG Eval
No ratings yet
3-LG Eval
52 pages
AML AfterMid Merged
No ratings yet
AML AfterMid Merged
389 pages
Lec12 Logreg
No ratings yet
Lec12 Logreg
41 pages
06 Logistic Regression
No ratings yet
06 Logistic Regression
55 pages
06 Logistic Regression PDF
No ratings yet
06 Logistic Regression PDF
10 pages
AC-ED L04 - Logistic Regression, Regularization
No ratings yet
AC-ED L04 - Logistic Regression, Regularization
80 pages
M146 Lec3 Sidenotes S25
No ratings yet
M146 Lec3 Sidenotes S25
29 pages
Gansp Awareness Quiz PDF
No ratings yet
Gansp Awareness Quiz PDF
13 pages
ML - Logistic Regression&KNN
No ratings yet
ML - Logistic Regression&KNN
48 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
3 LogisticRegression
No ratings yet
3 LogisticRegression
30 pages
Classification-Introduction, Logistic Regression
No ratings yet
Classification-Introduction, Logistic Regression
26 pages
2021 Logistic Regression
No ratings yet
2021 Logistic Regression
33 pages
04 - Linear-Classification-2024
No ratings yet
04 - Linear-Classification-2024
65 pages
Logistic Regression by IntuitiveAI v2.5
No ratings yet
Logistic Regression by IntuitiveAI v2.5
8 pages
Lecture W3
No ratings yet
Lecture W3
28 pages
Lecture Notes Chapt13
No ratings yet
Lecture Notes Chapt13
15 pages
Introduction To Machine Learning: Dr. Muhammad Amjad Iqbal
No ratings yet
Introduction To Machine Learning: Dr. Muhammad Amjad Iqbal
20 pages
A Layman's Guide To The Project
No ratings yet
A Layman's Guide To The Project
34 pages
Arima Model
No ratings yet
Arima Model
19 pages
Logistic Regression
No ratings yet
Logistic Regression
42 pages
Lec1 PDF
No ratings yet
Lec1 PDF
56 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
A Tutorial of Machine Learning
No ratings yet
A Tutorial of Machine Learning
16 pages
Lecture Notes 6 Logistic Regression
No ratings yet
Lecture Notes 6 Logistic Regression
8 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
67 pages
Lecture 12 (Data Analysis and Interpretation
No ratings yet
Lecture 12 (Data Analysis and Interpretation
16 pages
Chapter02 Introduction To DeepLearning
No ratings yet
Chapter02 Introduction To DeepLearning
84 pages
Final Quiz
No ratings yet
Final Quiz
6 pages
Lecture 3 - Regression
No ratings yet
Lecture 3 - Regression
47 pages
Lecture 8: Gradient Descent and Logistic Regression
No ratings yet
Lecture 8: Gradient Descent and Logistic Regression
39 pages
Logistic Regression
No ratings yet
Logistic Regression
10 pages
DA Mid 2
No ratings yet
DA Mid 2
17 pages
Logistic Regression (Probability Concepts) and Perceptron
No ratings yet
Logistic Regression (Probability Concepts) and Perceptron
20 pages
Việt Cường
No ratings yet
Việt Cường
14 pages
Regression Results
No ratings yet
Regression Results
7 pages
Chapter 2
No ratings yet
Chapter 2
59 pages
Linear Regression
No ratings yet
Linear Regression
22 pages
Regression and Generalization
No ratings yet
Regression and Generalization
67 pages
PAS GTU Study Material E-Notes 3 13082020055612PM
No ratings yet
PAS GTU Study Material E-Notes 3 13082020055612PM
59 pages
Modern Statistics For The Social and Behavioral Sciences: A Practical Introduction Second Edition. Edition Wilcox
No ratings yet
Modern Statistics For The Social and Behavioral Sciences: A Practical Introduction Second Edition. Edition Wilcox
56 pages
Saint Gba334 Module 3 Quiz 2
0% (1)
Saint Gba334 Module 3 Quiz 2
2 pages
Good and Bad Customers For Granting Credit: Genpact Data Science Prodegree Logistic Regression: Problem Statement
No ratings yet
Good and Bad Customers For Granting Credit: Genpact Data Science Prodegree Logistic Regression: Problem Statement
2 pages
Regression in Open Office/ Libre Office: The Linear Regression Equation
No ratings yet
Regression in Open Office/ Libre Office: The Linear Regression Equation
2 pages
This Study Resource Was: Answer
No ratings yet
This Study Resource Was: Answer
5 pages
Hydrology Assignment
No ratings yet
Hydrology Assignment
6 pages
A Short Introduction To Linear Mixed Models
No ratings yet
A Short Introduction To Linear Mixed Models
5 pages
Iris Classification
No ratings yet
Iris Classification
6 pages
OTM Correlation Regression Dec 23
No ratings yet
OTM Correlation Regression Dec 23
8 pages
Kode R Untuk Anova
No ratings yet
Kode R Untuk Anova
7 pages
Presentación Modelo 3
No ratings yet
Presentación Modelo 3
8 pages
To Improve The Performance of Models Predicting Ba
No ratings yet
To Improve The Performance of Models Predicting Ba
6 pages
Simplified ML Algorithms
No ratings yet
Simplified ML Algorithms
3 pages
Advantages and Disadvantage of PCA
No ratings yet
Advantages and Disadvantage of PCA
2 pages
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
From Everand
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
Fouad Sabry
No ratings yet

CH3 Logistic Regression 2020

Uploaded by

CH3 Logistic Regression 2020

Uploaded by

Supervised learning

How do we develop a classification algorithm?

o Tumour size vs malignancy (0 or 1)

Machine learning Mourad ZAIED RTIM 3

Issues with linear regression in classification problems

What function is used to represent our hypothesis in classification?

 If we combine these equations we can write out the hypothesis as

Machine learning Mourad ZAIED RTIM 5

What does the sigmoid function look like? (2/2)

Machine learning Mourad ZAIED RTIM 6

Useful property of the derivative of the sigmoid function

Machine learning Mourad ZAIED RTIM 7

Interpreting hypothesis output (1/2)

Interpreting hypothesis output (2/2)

 So if z is positive, g(z) is greater than 0.5

 Hw(x) = g(w0 + w1x1 + w2x2)

Machine learning Mourad ZAIED RTIM 10

Non-linear decision boundaries

Non-linear decision boundaries

Machine learning Mourad ZAIED RTIM 13

Cost function of the linear regression in ‘Logistic Regression’ is non-convex

A convex logistic regression cost function

• Maximizing above is equivalent to maximizing:

Machine learning Mourad ZAIED RTIM 15

A convex logistic regression cost function

Machine learning Mourad ZAIED RTIM 17

fog '  ( f 'og ) g ' f '  f (1  f ) g '  xj

C ( w) 1 m  H w ( x)(1  H w ( x)) x j H w ( x)(1  H w ( x)) x j 

Machine learning Mourad ZAIED RTIM 19

Gradient descent algorithm for logistic regression

•This equation is the same as the linear regression rule

Machine learning Mourad ZAIED RTIM 20

• w1=0.2+0.01/4((0-g(w0x0(1)+w1x1(1)+w2x2 (1)) x1(1)+

• w2=0.3+0.01/4((0-g(w0x0(1)+w1x1(1)+w2x2 (1)) x2(1)+

• w1=0.1908968063+0.01/4((0-g(w0x0(1)+w1x1(1)+w2x2 (1)) x1(1)+

• w2=0.21501114309+0.01/4((0-g(w0x0(1)+w1x1(1)+w2x2 (1)) x2(1)+

Machine learning Mourad ZAIED RTIM 24

Gradient descent algorithm for logistic regression simulation

Machine learning Mourad ZAIED RTIM 25

One vs. all technique 1/3

Machine learning Mourad ZAIED RTIM 26

One vs. all technique 2/3

 One vs. all classification

 Triangle (1) vs crosses and squares (0) hw1(x)

 Crosses (1) vs triangle and square (0) hw2(x)

 Square (1) vs crosses and square (0) hw3(x)

One vs. all technique 3/3

Machine learning Mourad ZAIED RTIM 28

You might also like