0% found this document useful (0 votes)

16 views24 pages

Logistic Regression

Uploaded by

akrab.tech7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views24 pages

Logistic Regression

Uploaded by

akrab.tech7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

CSCI417

Machine Learning

Lecture # 6
Spring 2024

1
Tentative Course Topics

1.Machine Learning Basics

2.Classifying with k-Nearest Neighbors
3.Splitting datasets one feature at a time: decision trees
4.Classifying with probability theory: naïve Bayes
5.Linear/Logistic regression
6.Support vector machines
7.Model Evaluation and Improvement: Cross-validation, Grid Search, Evaluation Metrics, and
Scoring
8.Ensemble learning and improving classification with the AdaBoost meta-algorithm.
9.Introduction to Neural Networks - Building NN for classification (binary/multiclass)
10.Convolutional Neural Network (CNN)
11.Pretrained models (VGG, Alexnet,..)
12.Machine learning pipeline and use cases.

2
Agenda
Classification Problem:
Logistic Regression

Regularization
Regularization for Linear Regression

Regularization
Regularization for Logistic Regression

3
Classification Problem:

Logistic Regression

4
Classification
- Discrete outcomes.
- Binary , 0 negative , 1 positive
(normal / abnormal)
- Multi-class: a telescope that identifies
whether an object in the night sky is a
galaxy, star, or planet.

5
Hypothesis Representation
Hypothesis Representation
• Logistic regression model

Sigmoid Function
Logistic Function

𝑔 𝜃 𝑥

0.5

𝜃 𝑥

•We want our classifier to output values between 0 and 1

• When using linear regression we did hθ(x) = (θT x)
• For classification hypothesis representation we do hθ(x) = g((θT x))
7
Interpretation of hypothesis Output
• 𝜃 will give us the probability that our output is 1.
For example,
𝜃 gives us a probability of 70% that our output is 1.

• Our probability that our prediction is is just the complement of our probability that it is 1
For example,
if probability that it is 1 is , then
the probability that it is is .

8
Interpretation of hypothesis Output
• 𝜃 will give us the probability that our output is 1.
For example,

= , ( )=0.7…
tumourSize

 70% chance of a tumor being malignant.

• = 1-
Probability that , given , parameterized by

9
Binary logistic regression
• We have a set of feature vectors X with corresponding binary outputs

• We want to model p(ylx)

• By definition
• We want to transform the probability to remove the range restrictions, so can
take any real value.

10
Using ODDS
• We have a set of feature vectors X with corresponding binary outputs

• We want to model p(ylx)

• By definition
• We want to transform the probability to remove the range restrictions, as can
take any real value.

11
Hypothesis function (proof)
• We have a set of feature vectors X with corresponding binary outputs

• We want to model p(ylx)

• By definition
• We want to transform the probability to remove the range restrictions, as can
take any real value.

12
Hypothesis function

14
Maximum Likelihood Estimation (MLE)

15
Gradient Descent for Logistic Regression

16
Multiclass Classification (one-vs-all)

𝟏
𝒉𝜽 (𝒙)
(𝒊)
𝜽

Pick the class 𝑖 that maximize Not class 1

(𝒊)
𝜽

Suppose you have a multi-class classification problem with 𝑘 classes

(so 𝑦 ∈ {1,2 ⋯ , 𝑘}). Using the one-vs.-all method, how many different
logistic regression classifiers will you end up training?
23
The Problem of Overfitting
• Underfitting, or high bias, is when the
form of our hypothesis function
maps poorly to the trend of the data.
It is usually caused by a function that is
too simple or uses too few features.
• At the other extreme, overfitting, or
high variance, is caused by a
hypothesis function that fits the
available data but does not generalize
well to predict new data. It is usually
caused by a complicated function that
creates a lot of unnecessary curves and
angles unrelated to the data.

24
Addressing overfitting
• There are two main options to address the issue of overfitting:
1) Reduce the number of features:
– Manually select which features to keep.
– Use a model selection algorithm.
2) Regularization
– Keep all the features, but reduce the magnitude of parameters 𝜃 .
– Regularization works well when we have a lot of slightly useful features.

25
Regularization (To avoid overfitting)
Regularization for Linear Regression

26
Regularization Intuition

Price
Price

Size of house Size of house

• Suppose we penalize and make , really small.

• Small values for parameters
- simpler hypothesis
- less prone to overfitting
Regularization for linear Regression
• Simpler hypothesis  small values of 𝟏, 𝟐, … 𝒏
𝒎 𝒏
(𝒊) (𝒊) 𝟐 𝟐
𝜽 𝒋 𝒄𝒉𝒐𝒐𝒔𝒆 𝒔𝒎𝒂𝒍𝒍 𝜽
𝒊 𝟏 𝒋 𝟏 𝒕𝒐 𝒎𝒊𝒏 𝑱(𝜽)

• The λ, or lambda, is the regularization parameter.

It determines how much the costs of our theta parameters are inflated.
• Example: we have 2 sets of parameters =[1.35 3.5] and =[45.2 75.6]
– If λ is chosen to be 0, the cost function act as usual with no penalty on
 choose the large ones =[45.2 75.6]
– If λ is chosen to be large, small values of are chosen instead of large one.
 choose the large ones =[1.35 3.5]

28
Regularization for linear Regression
𝟏 𝒎 (𝒊) (𝒊) 𝟐 𝒏 𝟐
𝟐𝒎 𝒊 𝟏 𝜽 𝒋 𝟏 𝒋

• What if λ is set to an extremely large value (perhaps too large for our problem,
say λ= 1010)?
 it may smooth out the function too much and cause underfitting. Why?

“underfit” , ,
Price

 𝜽 𝟎
Size of house

X X X X
29
Regularization for linear Regression
• Gradient descent

<1
Regularization for Logistic Regression

x1
Cost function:
𝒎 𝒏
(𝒊) (𝒊) (𝒊) (𝒊) 𝟐
𝜽 𝜽 𝒋
𝒊 𝟏 𝒋 𝟏
Small values of parameters

Data Science L19_LogisticRegression
No ratings yet
Data Science L19_LogisticRegression
52 pages
Unit 1. Present Tenses (Latest)
No ratings yet
Unit 1. Present Tenses (Latest)
58 pages
Advanced 06 Final Test Icpna R
No ratings yet
Advanced 06 Final Test Icpna R
3 pages
(FREE PDF Sample) Psychopharmacology 2nd Ed A Mental Health Professional S Guide To Commonly Used Medications 2nd Edition Herbert Mwebe Ebooks
100% (1)
(FREE PDF Sample) Psychopharmacology 2nd Ed A Mental Health Professional S Guide To Commonly Used Medications 2nd Edition Herbert Mwebe Ebooks
49 pages
Lec06-PracticalML
No ratings yet
Lec06-PracticalML
40 pages
CH3 Logistic Regression 2020
No ratings yet
CH3 Logistic Regression 2020
28 pages
Chapter 2 - Logistic Regression
No ratings yet
Chapter 2 - Logistic Regression
88 pages
Lecture3 Logistic Regression Classifier V0
No ratings yet
Lecture3 Logistic Regression Classifier V0
41 pages
Lecture ai
No ratings yet
Lecture ai
40 pages
HUAWEI MatePad 12 X Quick Start Guide-(BKY-W09,01,en-us,MEAF)
No ratings yet
HUAWEI MatePad 12 X Quick Start Guide-(BKY-W09,01,en-us,MEAF)
32 pages
Logistic Regression(Probability Concepts) and Perceptron
No ratings yet
Logistic Regression(Probability Concepts) and Perceptron
20 pages
ML Classification Trupesh Patel
No ratings yet
ML Classification Trupesh Patel
39 pages
What are Crafty Buildy Strategy Simulation Games_ – How To Market A Game
No ratings yet
What are Crafty Buildy Strategy Simulation Games_ – How To Market A Game
28 pages
04- Linear-Classification-2024
No ratings yet
04- Linear-Classification-2024
65 pages
05 - CM2015 - Retrieving Data From The Web (2022-10)
No ratings yet
05 - CM2015 - Retrieving Data From The Web (2022-10)
9 pages
10. Binary Logistic Regression 2
No ratings yet
10. Binary Logistic Regression 2
43 pages
Lecture 3. Classification
No ratings yet
Lecture 3. Classification
60 pages
Regression
No ratings yet
Regression
39 pages
Logistic Regression
No ratings yet
Logistic Regression
74 pages
ML-1
No ratings yet
ML-1
24 pages
3-LG_Eval
No ratings yet
3-LG_Eval
52 pages
Slide 2
No ratings yet
Slide 2
30 pages
Effect of Organoclay Type and Clay Polyurethane Interaction Che - 2018 - Applied
No ratings yet
Effect of Organoclay Type and Clay Polyurethane Interaction Che - 2018 - Applied
11 pages
Lecture 4.2. Generalization and Regularization
No ratings yet
Lecture 4.2. Generalization and Regularization
23 pages
Level F Homework Answers
33% (3)
Level F Homework Answers
7 pages
01B-DL2023-LinearModels
No ratings yet
01B-DL2023-LinearModels
47 pages
Week 2 Introduction To Linear Models - Revised - v1
No ratings yet
Week 2 Introduction To Linear Models - Revised - v1
54 pages
Chapter Regression
No ratings yet
Chapter Regression
10 pages
L09 - Regularisation
No ratings yet
L09 - Regularisation
79 pages
Human Artificial Chromosome
No ratings yet
Human Artificial Chromosome
17 pages
Part Ii. Language I. Pronunciation: (1 Mark)
No ratings yet
Part Ii. Language I. Pronunciation: (1 Mark)
3 pages
Representatives Guide For MSC Conference 2019
No ratings yet
Representatives Guide For MSC Conference 2019
16 pages
03 Linear Models
No ratings yet
03 Linear Models
46 pages
Algorithms Notes
No ratings yet
Algorithms Notes
66 pages
06LogisticRegression
No ratings yet
06LogisticRegression
55 pages
Lecture 7 - Part A - Mutli Class and Overfitting and Regularization
No ratings yet
Lecture 7 - Part A - Mutli Class and Overfitting and Regularization
43 pages
module-5_SPI-assigned-topic-for-reporter
No ratings yet
module-5_SPI-assigned-topic-for-reporter
35 pages
A Layman's Guide to the Project
No ratings yet
A Layman's Guide to the Project
34 pages
Lec 3-5 (Function Approximation)
No ratings yet
Lec 3-5 (Function Approximation)
34 pages
Ch2Regression and Regularization1
No ratings yet
Ch2Regression and Regularization1
45 pages
M02Logistic Regression Logistic RegressioLogistic Regressionn
No ratings yet
M02Logistic Regression Logistic RegressioLogistic Regressionn
19 pages
College of Accountancy
No ratings yet
College of Accountancy
3 pages
3 LogisticRegression
No ratings yet
3 LogisticRegression
30 pages
Compliance Delivery Analyst - Job Description - JG5
No ratings yet
Compliance Delivery Analyst - Job Description - JG5
4 pages
AC-ED L04 - Logistic Regression, Regularization
No ratings yet
AC-ED L04 - Logistic Regression, Regularization
80 pages
05 OnScreen B1plus Module 5
No ratings yet
05 OnScreen B1plus Module 5
9 pages
BOC Gases
No ratings yet
BOC Gases
260 pages
Notes Chapter Logistic Regression
No ratings yet
Notes Chapter Logistic Regression
6 pages
Multimedia Assignment
No ratings yet
Multimedia Assignment
38 pages
Lab Manual 05
No ratings yet
Lab Manual 05
13 pages
Logistic Regression
No ratings yet
Logistic Regression
42 pages
Logistic Regression
No ratings yet
Logistic Regression
37 pages
Linear Regression Python Programming
No ratings yet
Linear Regression Python Programming
25 pages
Machine Learning 2
No ratings yet
Machine Learning 2
19 pages
Regularization 1704650055
No ratings yet
Regularization 1704650055
32 pages
3 Logistic Regression and Regularization
No ratings yet
3 Logistic Regression and Regularization
42 pages
Regression Analysis
No ratings yet
Regression Analysis
54 pages
Introduction To Machine Learning: Dr. Muhammad Amjad Iqbal
No ratings yet
Introduction To Machine Learning: Dr. Muhammad Amjad Iqbal
20 pages
Whalestox
No ratings yet
Whalestox
24 pages
Lecture 3_Regression (1)
No ratings yet
Lecture 3_Regression (1)
47 pages
Introduction To Machine Learning: 2 Linear Classifiers
No ratings yet
Introduction To Machine Learning: 2 Linear Classifiers
4 pages
A Tutorial of Machine Learning
No ratings yet
A Tutorial of Machine Learning
16 pages
07: Regularization: The Problem of Overfitting
No ratings yet
07: Regularization: The Problem of Overfitting
5 pages
Lec1 PDF
No ratings yet
Lec1 PDF
56 pages
07 Regularization
No ratings yet
07 Regularization
7 pages
Lecture 3 - Linear Regression
No ratings yet
Lecture 3 - Linear Regression
31 pages
Cost Function
No ratings yet
Cost Function
17 pages
Lec4 Oct12 2022 PracticalNotes LinearRegression
No ratings yet
Lec4 Oct12 2022 PracticalNotes LinearRegression
34 pages
Nursing Records & Reports
No ratings yet
Nursing Records & Reports
24 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
Introduction To Machine Learning: The Problem of Overfitting
No ratings yet
Introduction To Machine Learning: The Problem of Overfitting
8 pages
Election Law Cases
No ratings yet
Election Law Cases
185 pages
MLA TAB Lecture3
No ratings yet
MLA TAB Lecture3
70 pages
BITS F464 ML Lecture Notes
No ratings yet
BITS F464 ML Lecture Notes
86 pages
Lecture 02
No ratings yet
Lecture 02
43 pages
Lec 07-08 - Final
No ratings yet
Lec 07-08 - Final
32 pages
Formative Assessment vs. Summative Assessment
0% (1)
Formative Assessment vs. Summative Assessment
12 pages
Dutert
No ratings yet
Dutert
1 page
Rear Seat Assembly: Components
No ratings yet
Rear Seat Assembly: Components
6 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
TWSH Revised Notification
No ratings yet
TWSH Revised Notification
1 page
Machine Learning Shortnote
No ratings yet
Machine Learning Shortnote
14 pages
Allmand NL Pro Lite
No ratings yet
Allmand NL Pro Lite
32 pages
Random Optimization: Fundamentals and Applications
From Everand
Random Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
Question Bank Maths Class 10
No ratings yet
Question Bank Maths Class 10
73 pages
AS AD Worksheet: Part I: Aggregate Demand Questions
100% (1)
AS AD Worksheet: Part I: Aggregate Demand Questions
5 pages
Shallow Foundation On Soil Layers
100% (1)
Shallow Foundation On Soil Layers
16 pages
Mathematical Optimization: Fundamentals and Applications
From Everand
Mathematical Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
Medical Gases Calculations
No ratings yet
Medical Gases Calculations
14 pages
Is 802 (Part 1/sec 2) : 1992
0% (1)
Is 802 (Part 1/sec 2) : 1992
14 pages

Logistic Regression

Uploaded by

Logistic Regression

Uploaded by

CSCI417

1.Machine Learning Basics

•We want our classifier to output values between 0 and 1

 70% chance of a tumor being malignant.

• We want to model p(ylx)

• We want to model p(ylx)

• We want to model p(ylx)

Pick the class 𝑖 that maximize Not class 1

Suppose you have a multi-class classification problem with 𝑘 classes

Size of house Size of house

• Suppose we penalize and make , really small.

• The λ, or lambda, is the regularization parameter.

You might also like