0% found this document useful (0 votes)

11 views41 pages

Lecture Note #9 - PEC-CS701E

Uploaded by

halderriya56732

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views41 pages

Lecture Note #9 - PEC-CS701E

Uploaded by

halderriya56732

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

Logistic Regression

Logistic Regression is a Machine Learning algorithm which is used for the classification
problems, it is a predictive analysis algorithm and based on the concept of probability.
•Logistic regression predicts the output of a categorical dependent variable. Therefore
the outcome must be a categorical or discrete value. It can be either Yes or No, 0 or 1,
true or False, etc. but instead of giving the exact value as 0 and 1, it gives the
probabilistic values which lie between 0 and 1.

•In Logistic regression, instead of fitting a regression

line, we fit an "S" shaped logistic function, which
predicts two maximum values (0 or 1).
•The curve from the logistic function indicates the
likelihood of something such as whether the cells are
cancerous or not, a mouse is obese or not based on its
weight, etc.
Logistic Function (Sigmoid Function):
•The sigmoid function is a mathematical function used to
map the predicted values to probabilities.
•It maps any real value into another value within a range of 0
and 1.
•The value of the logistic regression must be between 0 and
1, which cannot go beyond this limit, so it forms a curve like
the "S" form. The S-form curve is called the Sigmoid function
or the logistic function.
•In logistic regression, we use the concept of the threshold
value, which defines the probability of either 0 or 1. Such as
values above the threshold value tends to 1, and a value
below the threshold values tends to 0.
Assumptions for Logistic Regression:

•The dependent variable must be categorical in nature.

•The independent variable should not have multi-collinearity.
Logistic Regression Equation:
The Logistic regression equation can be obtained from the Linear Regression equation. The mathematical steps
to get Logistic Regression equations are given below:
•We know the equation of the straight line can be written as:

•In Logistic Regression y can be between 0 and 1 only, so for this let's divide the above equation by (1-y):

•But we need range between -[infinity] to +[infinity], then take logarithm of the equation it will become:

The above equation is the final equation for Logistic Regression.

Type of Logistic Regression:
• On the basis of the categories, Logistic Regression can be classified
into three types:
• Binomial: In binomial Logistic regression, there can be only two
possible types of the dependent variables, such as 0 or 1, Pass or Fail,
etc.
• Multinomial: In multinomial Logistic regression, there can be 3 or
more possible unordered types of the dependent variable, such as
"cat", "dogs", or "sheep"
• Ordinal: In ordinal Logistic regression, there can be 3 or more
possible ordered types of dependent variables, such as "low",
"Medium", or "High".
Linear Regression vs Logistic Regression
Linear Regression vs Logistic Regression
Linear Regression Logistic Regression
Linear regression is used to predict the continuous Logistic Regression is used to predict the categorical
dependent variable using a given set of independent dependent variable using a given set of independent
variables. variables.
Linear Regression is used for solving Regression problem. Logistic regression is used for solving Classification
problems.
In Linear regression, we predict the value of continuous In logistic Regression, we predict the values of categorical
variables. variables.
In linear regression, we find the best fit line, by which we In Logistic Regression, we find the S-curve by which we can
can easily predict the output. classify the samples.
Least square estimation method is used for estimation of Maximum likelihood estimation method is used for
accuracy. estimation of accuracy.
The output for Linear Regression must be a continuous The output of Logistic Regression must be a Categorical
value, such as price, age, etc. value such as 0 or 1, Yes or No, etc.
In Linear regression, it is required that relationship In Logistic regression, it is not required to have the linear
between dependent variable and independent variable relationship between the dependent and independent
must be linear. variable.
In linear regression, there may be collinearity between the In logistic regression, there should not be collinearity
Logistic Regression

• Hypothesis representation

• Cost function

• Logistic regression with gradient descent

• Regularization

• Multi-class classification
Logistic Regression

• Hypothesis representation

• Cost function

• Logistic regression with gradient descent

• Regularization

• Multi-class classification
1 (Yes)
Malignant?

0 (No)
Tumor Size
ℎ𝜃 𝑥 = 𝜃 ⊤ 𝑥

• Threshold classifier output ℎ𝜃 𝑥 at 0.5

• If ℎ𝜃 𝑥 ≥ 0.5, predict “𝑦 = 1”
• If ℎ𝜃 𝑥 < 0.5, predict “𝑦 = 0”
Classification: 𝑦 = 1 or 𝑦 = 0

⊤
ℎ𝜃 𝑥 = 𝜃 𝑥 (from linear regression)
can be > 1 or < 0

Logistic regression: 0 ≤ ℎ𝜃 𝑥 ≤ 1

Logistic regression is actually for classification

Hypothesis representation
• Want 0 ≤ ℎ𝜃 𝑥 ≤ 1 1
ℎ𝜃 𝑥 = −𝜃 ⊤𝑥
• ℎ𝜃 𝑥 = 𝑔 𝜃 ⊤ 𝑥 , 1+ 𝑒
1
where 𝑔 𝑧 =
1+𝑒 −𝑧
𝑔(𝑧)

• Sigmoid function
• Logistic function 𝑧
Interpretation of hypothesis output
• ℎ𝜃 𝑥 = estimated probability that 𝑦 = 1 on input 𝑥

𝑥0 1
• Example: If 𝑥 = x =
1 tumorSize
• ℎ𝜃 𝑥 = 0.7

• Tell patient that 70% chance of tumor being malignant

Logistic regression
⊤ 𝑔(𝑧)
ℎ𝜃 𝑥 = 𝑔 𝜃 𝑥
1
𝑔 𝑧 =
1 + 𝑒 −𝑧
𝑧 = 𝜃⊤𝑥
Suppose predict “y = 1” if ℎ𝜃 𝑥 ≥ 0.5
𝑧 = 𝜃 ⊤𝑥 ≥ 0
predict “y = 0” if ℎ𝜃 𝑥 < 0.5
𝑧 = 𝜃 ⊤𝑥 < 0
Decision boundary
• ℎ𝜃 𝑥 = 𝑔(𝜃0 + 𝜃1 𝑥1 + 𝜃2 𝑥2 )

Age
E.g., 𝜃0 = −3, 𝜃1 = 1, 𝜃2 = 1

Tumor Size

• Predict “𝑦 = 1” if −3 + 𝑥1 + 𝑥2 ≥ 0
Hypothesis representation
• Logistic regression hypothesis representation
1 1
ℎ𝜃 𝑥 = ⊤ =
1 + 𝑒 −𝜃 𝑥 1 + 𝑒 −(𝜃0+𝜃1𝑥1+𝜃2𝑥2+⋯+𝜃𝑛𝑥𝑛)
• Consider learning f: 𝑋 → 𝑌, where
• 𝑋 is a vector of real-valued features 𝑋1 , ⋯ , 𝑋𝑛 ⊤
• 𝑌 is Boolean
• Assume all 𝑋𝑖 are conditionally independent given 𝑌
• Model 𝑃 𝑋𝑖 𝑌 = 𝑦𝑘 as Gaussian 𝑁 𝜇𝑖𝑘 , 𝜎𝑖
• Model 𝑃 𝑌 as Bernoulli 𝜋
Logistic Regression

• Hypothesis representation

• Cost function

• Logistic regression with gradient descent

• Regularization

• Multi-class classification
Training set with 𝑚 examples
{ 𝑥 1 ,𝑦 1 , 𝑥 2 ,𝑦 2 ,⋯, 𝑥 𝑚 ,𝑦 𝑚
𝑥0
𝑥1
𝑥∈ ⋮ 𝑥0 = 1, 𝑦 ∈ {0, 1}
𝑥𝑛

1
ℎ𝜃 𝑥 = −𝜃 ⊤𝑥
1+ 𝑒
How to choose parameters 𝜃?
Cost function for Linear Regression
𝑚 𝑚
1 𝑖 𝑖 2 1
𝐽 𝜃 = ෍ ℎ𝜃 𝑥 −𝑦 = ෍ Cost(ℎ𝜃 (𝑥 𝑖 ), 𝑦))
2𝑚 𝑚
𝑖=1 𝑖=1

1 2
Cost(ℎ𝜃 𝑥 , 𝑦) = ℎ𝜃 𝑥 − 𝑦
2
Cost function for Logistic Regression

−log ℎ𝜃 𝑥 if 𝑦 = 1
Cost(ℎ𝜃 𝑥 , 𝑦) = ቐ
−log 1 − ℎ𝜃 𝑥 if 𝑦 = 0

if 𝑦 = 1 if 𝑦 = 0

0 ℎ𝜃 𝑥 1 0 ℎ𝜃 𝑥 1
Logistic regression cost function
−log ℎ𝜃 𝑥 if 𝑦 = 1
• Cost(ℎ𝜃 𝑥 , 𝑦) = ቐ
−log 1 − ℎ𝜃 𝑥 if 𝑦 = 0

• Cost ℎ𝜃 𝑥 , 𝑦 = −𝑦 log h𝜃 x − (1 − y) log 1 − ℎ𝜃 𝑥

• If 𝑦 = 1: Cost ℎ𝜃 𝑥 , 𝑦 = −log ℎ𝜃 𝑥
• If 𝑦 = 0: Cost ℎ𝜃 𝑥 , 𝑦 = −log 1 − ℎ𝜃 𝑥
Logistic regression
𝑚
1
𝐽 𝜃 = ෍ Cost(ℎ𝜃 (𝑥 𝑖 ), 𝑦 (𝑖) ))
𝑚
𝑖=1
1
= − σ𝑚 𝑖=1 𝑦 (𝑖) log ℎ𝜃 𝑥 (𝑖) + (1 − 𝑦 (𝑖) ) log 1 − ℎ𝜃 𝑥 (𝑖)
𝑚

Learning: fit parameter 𝜃 Prediction: given new 𝑥

1
min 𝐽(𝜃) Output ℎ𝜃 𝑥 = −𝜃⊤ 𝑥
𝜃 1+𝑒
Where does the cost come from?
• Training set with 𝑚 examples
𝑥 1 ,𝑦 1 , 𝑥 2 ,𝑦 2
,⋯, 𝑥 𝑚
,𝑦 𝑚

• Maximum likelihood estimate for parameter 𝜃

𝜃MLE = argmax 𝑃𝜃 𝑥 1 , 𝑦 1 , 𝑥 2 , 𝑦 2 , ⋯ , 𝑥 𝑚
,𝑦 𝑚
𝜃 𝑚

= argmax ෑ 𝑃𝜃 𝑥 𝑖 ,𝑦 𝑖
𝜃
𝑖=1
• Maximum conditional likelihood estimate for parameter 𝜃
• Goal: choose 𝜃 to maximize conditional likelihood of training data
1
• 𝑃𝜃 𝑌 = 1 𝑋 = 𝑥 = ℎ𝜃 𝑥 = ⊤
1+𝑒 −𝜃 𝑥
−𝜃⊤𝑥
𝑒
• 𝑃𝜃 𝑌 = 0 𝑋 = 𝑥 = 1 − ℎ𝜃 𝑥 = ⊤
1+𝑒 −𝜃 𝑥

1 1 2 2 𝑚 𝑚
• Training data D = 𝑥 ,𝑦 , 𝑥 ,𝑦 ,⋯, 𝑥 ,𝑦
• Data likelihood = ς𝑚
𝑖=1 𝑃𝜃 𝑥 𝑖 ,𝑦 𝑖

• Data conditional likelihood = ς𝑚 𝑃

𝑖=1 𝜃 𝑦 (𝑖)
|𝑥 𝑖

𝑚 (𝑖) 𝑖
𝜃MCLE = argmax ς𝑖=1 𝑃𝜃 𝑦 |𝑥
𝜃
Expressing conditional log-likelihood
𝑚 𝑚

𝐿 𝜃 = log ෑ 𝑃𝜃 𝑦 (𝑖) |𝑥 𝑖 = ෍ log 𝑃𝜃 𝑦 (𝑖) |𝑥 𝑖

𝑚 𝑖=1 𝑖=1

= ෍ 𝑦 (𝑖) log 𝑃𝜃 𝑦 (𝑖) = 1|𝑥 𝑖 + 1−𝑦 𝑖 log 𝑃𝜃 𝑦 (𝑖) = 0|𝑥 𝑖

𝑖=1
= σ𝑚𝑖=1 𝑦 (𝑖) log (ℎ𝜃 (𝑥 (𝑖) )) + 1 − 𝑦 𝑖 log(1 − ℎ𝜃 (𝑥 (𝑖) ))

−log ℎ𝜃 𝑥 if 𝑦 = 1
Cost(ℎ𝜃 𝑥 , 𝑦) = ቐ
−log 1 − ℎ𝜃 𝑥 if 𝑦 = 0
Logistic Regression

• Hypothesis representation

• Cost function

• Logistic regression with gradient descent

• Regularization

• Multi-class classification
Gradient descent
𝑚
1
𝐽 𝜃 =− ෍ 𝑦 (𝑖) log ℎ𝜃 𝑥 (𝑖) + (1 − 𝑦 (𝑖) ) log 1 − ℎ𝜃 𝑥 (𝑖)
𝑚
𝑖=1
Goal: min 𝐽(𝜃) Good news: Convex function!
𝜃 Bad news: No analytical solution

Repeat { (Simultaneously update all 𝜃𝑗 )

𝜕 𝑚
𝜃𝑗 ≔ 𝜃𝑗 − 𝛼 𝐽(𝜃) 𝜕 1 𝑖 (𝑖) (𝑖)
𝜕𝜃𝑗 𝐽 𝜃 = ෍(ℎ𝜃 𝑥 −𝑦 ) 𝑥𝑗
𝜕𝜃𝑗 𝑚
} 𝑖=1
Gradient descent
𝑚
1
𝐽 𝜃 =− ෍ 𝑦 (𝑖) log ℎ𝜃 𝑥 (𝑖) + (1 − 𝑦 (𝑖) ) log 1 − ℎ𝜃 𝑥 (𝑖)
𝑚
𝑖=1
Goal: min 𝐽(𝜃)
𝜃

Repeat { (Simultaneously update all 𝜃𝑗 )

𝑚
1 𝑖 (𝑖) (𝑖)
𝜃𝑗 ≔ 𝜃𝑗 − 𝛼 ෍ ℎ𝜃 𝑥 −𝑦 𝑥𝑗
𝑚
𝑖=1
}
Gradient descent for Linear Regression
Repeat {
𝑚
1 𝑖 (𝑖) (𝑖) ⊤
𝜃𝑗 ≔ 𝜃𝑗 − 𝛼 ෍ ℎ𝜃 𝑥
𝑚
−𝑦 𝑥𝑗 ℎ𝜃 𝑥 = 𝜃 𝑥
𝑖=1
}

Gradient descent for Logistic Regression

Repeat {
𝑚
1 (𝑖)
1
𝜃𝑗 ≔ 𝜃𝑗 − 𝛼 ෍ ℎ𝜃 𝑥 𝑖
−𝑦 (𝑖)
𝑥𝑗 ℎ𝜃 𝑥 = −𝜃 ⊤𝑥
𝑚
𝑖=1
1+ 𝑒
}
Logistic Regression

• Hypothesis representation

• Cost function

• Logistic regression with gradient descent

• Regularization

• Multi-class classification
How about MAP?
• Maximum conditional likelihood estimate (MCLE)

𝑚 (𝑖) 𝑖
𝜃MCLE = argmax ς𝑖=1 𝑃𝜃 𝑦 |𝑥
𝜃

• Maximum conditional a posterior estimate (MCAP)

𝑚 (𝑖) 𝑖
𝜃MCAP = argmax ς𝑖=1 𝑃𝜃 𝑦 |𝑥 𝑃(𝜃)
𝜃
Prior 𝑃(𝜃)
• Common choice of 𝑃(𝜃):
• Normal distribution, zero mean, identity covariance
• “Pushes” parameters towards zeros
• Corresponds to Regularization
• Helps avoid very large weights and overfitting
MLE vs. MAP
• Maximum conditional likelihood estimate (MCLE)
𝑚
1 𝑖 (𝑖) (𝑖)
𝜃𝑗 ≔ 𝜃𝑗 − 𝛼 ෍ ℎ𝜃 𝑥 −𝑦 𝑥𝑗
𝑚
𝑖=1

• Maximum conditional a posterior estimate (MCAP)

𝑚
1 𝑖 (𝑖) (𝑖)
𝜃𝑗 ≔ 𝜃𝑗 − 𝛼𝜆𝜃𝑗 − 𝛼 ෍ ℎ𝜃 𝑥 −𝑦 𝑥𝑗
𝑚
𝑖=1
Logistic Regression

• Hypothesis representation

• Cost function

• Logistic regression with gradient descent

• Regularization

• Multi-class classification
Multi-class classification
• Email foldering/taggning: Work, Friends, Family, Hobby

• Medical diagrams: Not ill, Cold, Flu

• Weather: Sunny, Cloudy, Rain, Snow

Binary classification Multiclass classification

𝑥2 𝑥2

𝑥1 𝑥1
One-vs-all (one-vs-rest) 𝑥2
1
ℎ𝜃 𝑥
𝑥1
𝑥2
2 𝑥2
ℎ𝜃 𝑥

𝑥1 𝑥1
Class 1:
Class 2: 3
ℎ𝜃 𝑥 𝑥2
Class 3:
𝑖
ℎ𝜃 𝑥 = 𝑃 𝑦 = 𝑖 𝑥; 𝜃 (𝑖 = 1, 2, 3) 𝑥1
One-vs-all
𝑖
• Train a logistic regression classifier
ℎ𝜃 𝑥 for
each class 𝑖 to predict the probability that 𝑦 = 𝑖

• Given a new input 𝑥, pick the class 𝑖 that

maximizes
𝑖
max ℎ𝜃 𝑥
i
Generative Approach Discriminative Approach
Ex: Naïve Bayes Ex: Logistic regression

Estimate 𝑃(𝑌) and 𝑃(𝑋|𝑌) Estimate 𝑃(𝑌|𝑋) directly

(Or a discriminant function: e.g., SVM)

Prediction Prediction
𝑦ො = argmax𝑦 𝑃 𝑌 = 𝑦 𝑃(𝑋 = 𝑥|𝑌 = 𝑦) 𝑦ො = 𝑃(𝑌 = 𝑦|𝑋 = 𝑥)
Things to remember
1
• Hypothesis representation ℎ𝜃 𝑥 =
1 + 𝑒 −𝜃
⊤𝑥

−log ℎ𝜃 𝑥 if 𝑦 = 1
• Cost function Cost(ℎ𝜃 𝑥 , 𝑦) = ቐ
−log 1 − ℎ𝜃 𝑥 if 𝑦 = 0

• Logistic regression with gradient descent

𝑚
1 (𝑖)
𝜃𝑗 ≔ 𝜃𝑗 − 𝛼 ෍ ℎ𝜃 𝑥 𝑖 − 𝑦 (𝑖) 𝑥𝑗
• Regularization 𝑚
𝑖=1 𝑚
1 (𝑖)
𝜃𝑗 ≔ 𝜃𝑗 − 𝛼𝜆𝜃𝑗 − 𝛼 ෍ ℎ𝜃 𝑥 𝑖 − 𝑦 (𝑖) 𝑥𝑗
𝑚
• Multi-class classification 𝑖
𝑖=1
max ℎ𝜃 𝑥
i
Logistic regression
Advantages:
– Makes no assumptions about distributions of classes in feature
space
– Easily extended to multiple classes (multinomial regression)
– Natural probabilistic view of class predictions
– Quick to train
– Very fast at classifying unknown records
– Good accuracy for many simple data sets
– Resistant to overfitting
– Can interpret model coefficients as indicators of feature
importance

Disadvantages:
– Linear decision boundary

Lecture 4-Logistic Regression
No ratings yet
Lecture 4-Logistic Regression
20 pages
Week 8
No ratings yet
Week 8
38 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
7 pages
Lecture 07
No ratings yet
Lecture 07
26 pages
Side by Side Extra L1 U3 - Teacher's Guide
No ratings yet
Side by Side Extra L1 U3 - Teacher's Guide
22 pages
Lecture 08
No ratings yet
Lecture 08
42 pages
Algorithms Notes
No ratings yet
Algorithms Notes
66 pages
Humanities and Social Sciences (Humss) Grade 11 Grade 12: ST Century From The Philippines and The World
83% (6)
Humanities and Social Sciences (Humss) Grade 11 Grade 12: ST Century From The Philippines and The World
1 page
Logistic Regression by Nirzona
No ratings yet
Logistic Regression by Nirzona
11 pages
Logistic Regression
No ratings yet
Logistic Regression
36 pages
Logistic Regression
No ratings yet
Logistic Regression
34 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
W5S01 - PM-Logistic Regression
No ratings yet
W5S01 - PM-Logistic Regression
17 pages
Logistic Regression
No ratings yet
Logistic Regression
17 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
ML-Unit 4
No ratings yet
ML-Unit 4
29 pages
Logistic Regression
No ratings yet
Logistic Regression
10 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
Ch2Regression and Regularization1
No ratings yet
Ch2Regression and Regularization1
45 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
53 pages
Linear and Logistic Regression
No ratings yet
Linear and Logistic Regression
21 pages
Logistics Regression
No ratings yet
Logistics Regression
8 pages
PCCAIML601
No ratings yet
PCCAIML601
7 pages
B.Tech V KCS055 Unit2 2
No ratings yet
B.Tech V KCS055 Unit2 2
7 pages
ML Assignment Kv2
No ratings yet
ML Assignment Kv2
10 pages
ML Assignment
No ratings yet
ML Assignment
20 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
10 pages
Classification-Introduction, Logistic Regression
No ratings yet
Classification-Introduction, Logistic Regression
26 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
Report Logistic Regression
No ratings yet
Report Logistic Regression
21 pages
Exp 2 121a1047 ML Lavanya Kurup Div C C3
No ratings yet
Exp 2 121a1047 ML Lavanya Kurup Div C C3
8 pages
Aiml Unit 3 1
No ratings yet
Aiml Unit 3 1
9 pages
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
100% (1)
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
42 pages
MLStackCafe QAS 1672810525772
No ratings yet
MLStackCafe QAS 1672810525772
12 pages
Certificate: Jawahar Navodaya Vidyalaya
No ratings yet
Certificate: Jawahar Navodaya Vidyalaya
13 pages
Chp2 Logistic Regression
No ratings yet
Chp2 Logistic Regression
6 pages
ML (08-08-2024)
No ratings yet
ML (08-08-2024)
5 pages
2-Logistic Regression
No ratings yet
2-Logistic Regression
15 pages
Surigao Del Sur State University: Post-Harvest Handling of Perishable Crops
67% (3)
Surigao Del Sur State University: Post-Harvest Handling of Perishable Crops
17 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
3 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
3 pages
MACHINE LEARNING Presentation Logistic Regression
No ratings yet
MACHINE LEARNING Presentation Logistic Regression
18 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
88 Embedded-Questions US
No ratings yet
88 Embedded-Questions US
18 pages
Wa0004.
No ratings yet
Wa0004.
9 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Logistic Regressions
No ratings yet
Logistic Regressions
11 pages
ML Lec-9
No ratings yet
ML Lec-9
13 pages
Columbine Report Pgs 4201-4300
No ratings yet
Columbine Report Pgs 4201-4300
101 pages
Logistic Regression
No ratings yet
Logistic Regression
22 pages
Logistic Regression
No ratings yet
Logistic Regression
9 pages
PCC cs301
No ratings yet
PCC cs301
26 pages
03 Logistic Regression
No ratings yet
03 Logistic Regression
23 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
What Is Logistic Regression
No ratings yet
What Is Logistic Regression
20 pages
Misc 5
No ratings yet
Misc 5
1 page
Experiment No 8
No ratings yet
Experiment No 8
4 pages
Chapter Two Dss
No ratings yet
Chapter Two Dss
3 pages
Georg Simmel: On Individuality and Social Forms
No ratings yet
Georg Simmel: On Individuality and Social Forms
3 pages
Work Immersion
No ratings yet
Work Immersion
39 pages
Data Mining Concepts Models and Techniques 1st Edition by Florin Gorunescu ISBN 3642197213 9783642197215 Download
100% (4)
Data Mining Concepts Models and Techniques 1st Edition by Florin Gorunescu ISBN 3642197213 9783642197215 Download
54 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
ML DSBA Lab2
No ratings yet
ML DSBA Lab2
4 pages
Lecture Note #8 - PEC-CS701E
No ratings yet
Lecture Note #8 - PEC-CS701E
20 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
Pec-Cs801d
No ratings yet
Pec-Cs801d
15 pages
LIT 2nd Reading
No ratings yet
LIT 2nd Reading
55 pages
Question Paper With Solutions
No ratings yet
Question Paper With Solutions
6 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
Lecture Note #5 - PEC-CS701E
No ratings yet
Lecture Note #5 - PEC-CS701E
16 pages
Lecture Note #3 - PEC-CS701E
No ratings yet
Lecture Note #3 - PEC-CS701E
27 pages
Narrative Report in Elln
100% (7)
Narrative Report in Elln
2 pages
Bai Tap Ve Tu Noi Trong Tieng Anh Linking Words Connectors
No ratings yet
Bai Tap Ve Tu Noi Trong Tieng Anh Linking Words Connectors
5 pages
Lecture Note #7 - PEC-CS701E
No ratings yet
Lecture Note #7 - PEC-CS701E
28 pages
Cartography
No ratings yet
Cartography
3 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
Future of Media and Journalism
No ratings yet
Future of Media and Journalism
47 pages
Lecture Note #2 - PEC-CS701E
No ratings yet
Lecture Note #2 - PEC-CS701E
19 pages
Lecture Note #1 - PEC-CS701E
No ratings yet
Lecture Note #1 - PEC-CS701E
19 pages
Udemy - AI 900 - Exam
No ratings yet
Udemy - AI 900 - Exam
17 pages
Lecture Note #6 - PEC-CS701E
No ratings yet
Lecture Note #6 - PEC-CS701E
11 pages
PRC FORM For SPUP NURSING STUDENTS
No ratings yet
PRC FORM For SPUP NURSING STUDENTS
6 pages
Chapter 4 Marzano
No ratings yet
Chapter 4 Marzano
2 pages
Udgam School For Children English 2022-23: Std. VII Poem-4 Chivvy (Notes)
No ratings yet
Udgam School For Children English 2022-23: Std. VII Poem-4 Chivvy (Notes)
2 pages
English - L4 - W-Listening and 1st Conditional.
No ratings yet
English - L4 - W-Listening and 1st Conditional.
7 pages
Study - As A Population Gets Older, Automation Accelerates - MIT News - Massachusetts Institute of Technology 2024 09 27 04x58 507.0 KB
No ratings yet
Study - As A Population Gets Older, Automation Accelerates - MIT News - Massachusetts Institute of Technology 2024 09 27 04x58 507.0 KB
5 pages
Social Studies Lesson Plan
No ratings yet
Social Studies Lesson Plan
7 pages
Intern Schedule
No ratings yet
Intern Schedule
1 page
Penilaian Kurikulum
No ratings yet
Penilaian Kurikulum
9 pages
Acp Presentation
No ratings yet
Acp Presentation
10 pages
Drill Instruction: The Sequence of Instruction
No ratings yet
Drill Instruction: The Sequence of Instruction
3 pages
Presentation
No ratings yet
Presentation
1 page
Week 5 PDF
No ratings yet
Week 5 PDF
3 pages
Birralee - Fair Sharing Story Part 1
No ratings yet
Birralee - Fair Sharing Story Part 1
3 pages
Eva Mick - Resume 3:25:16
No ratings yet
Eva Mick - Resume 3:25:16
3 pages
Educ. Infin. 6, 2089-6867 (2017) .: Behavioral Research, 3 (2), 96-101
No ratings yet
Educ. Infin. 6, 2089-6867 (2017) .: Behavioral Research, 3 (2), 96-101
2 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Lecture Note #9 - PEC-CS701E

Uploaded by

Lecture Note #9 - PEC-CS701E

Uploaded by

Logistic Regression

•In Logistic regression, instead of fitting a regression

•The dependent variable must be categorical in nature.

The above equation is the final equation for Logistic Regression.

• Logistic regression with gradient descent

• Logistic regression with gradient descent

• Threshold classifier output ℎ𝜃 𝑥 at 0.5

Logistic regression is actually for classification

• Tell patient that 70% chance of tumor being malignant

• Logistic regression with gradient descent

• Cost ℎ𝜃 𝑥 , 𝑦 = −𝑦 log h𝜃 x − (1 − y) log 1 − ℎ𝜃 𝑥

Learning: fit parameter 𝜃 Prediction: given new 𝑥

• Maximum likelihood estimate for parameter 𝜃

• Data conditional likelihood = ς𝑚 𝑃

𝐿 𝜃 = log ෑ 𝑃𝜃 𝑦 (𝑖) |𝑥 𝑖 = ෍ log 𝑃𝜃 𝑦 (𝑖) |𝑥 𝑖

= ෍ 𝑦 (𝑖) log 𝑃𝜃 𝑦 (𝑖) = 1|𝑥 𝑖 + 1−𝑦 𝑖 log 𝑃𝜃 𝑦 (𝑖) = 0|𝑥 𝑖

• Logistic regression with gradient descent

Repeat { (Simultaneously update all 𝜃𝑗 )

Repeat { (Simultaneously update all 𝜃𝑗 )

Gradient descent for Logistic Regression

• Logistic regression with gradient descent

• Maximum conditional a posterior estimate (MCAP)

• Maximum conditional a posterior estimate (MCAP)

• Logistic regression with gradient descent

• Medical diagrams: Not ill, Cold, Flu

• Weather: Sunny, Cloudy, Rain, Snow

• Given a new input 𝑥, pick the class 𝑖 that

Estimate 𝑃(𝑌) and 𝑃(𝑋|𝑌) Estimate 𝑃(𝑌|𝑋) directly

• Logistic regression with gradient descent

You might also like