0% found this document useful (0 votes)

13 views8 pages

Unit 2

Uploaded by

apdeshmukh371122

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views8 pages

Unit 2

Uploaded by

apdeshmukh371122

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Linear regression

Logistic regression
What is logistic regression? Logistic regression is a statistical method used for binary classification. It
predicts the probability that a given input belongs to a particular category. The output is a
probability value between 0 and 1, which is then mapped to two classes (e.g., 0 or 1).

How does logistic regression work?

Logistic regression uses the logistic function (also known as the sigmoid function) to model the
probability of the default class. The logistic function is defined as:

The logistic function is defined as:

P(X)=1/1+e−(β0 +β1 X)

where:

( P(X) ) is the probability of the outcome being 1 (or the positive class).

( \beta_0 ) is the intercept.

( \beta_1 ) is the coefficient for the predictor ( X ).

( e ) is the base of the natural logarithm.

Example: Predicting Whether a Student Passes an Exam

Scenario: We want to predict whether a student will pass an exam based on the number of hours
they studied.

Data: We have data from 20 students, including the number of hours they studied and whether they
passed the exam (Pass = 1, Fail = 0).

Hours Studied

Hours Studied Pass (1) / Fail (0)

0 0

1 0
2 0

3 0

4 1

Steps:

1.Model Specification:

We use the logistic regression model:

P(Pass)=1+e−(β0 +β1 ⋅Hours Studied)1

2.Parameter Estimation:

Using the data, we estimate the parameters ( \beta_0 ) and ( \beta_1 ). Let’s assume we get:

( \beta_0 = -4 )

( \beta_1 = 1 )

3.Prediction:

To predict the probability of passing for a student who studied 4 hours:

P(Pass)=1+e−(−4+1⋅4)1=1+e01=21=0.5

So, the probability of passing is 0.5 or 50%.

Interpretation:

Coefficients: ( \beta_0 ) is the intercept, and ( \beta_1 ) is the coefficient for the number of hours
studied.

Probability: The logistic function transforms the linear combination of inputs into a probability
between 0 and 1.

Ridge regression is a regularization technique used in linear regression. Here are the key
points:

1. Objective:
o Ridge regression aims to prevent overfitting by adding a penalty term to the
ordinary least squares (OLS) cost function.
o The penalty term encourages the model to have smaller coefficient values.
2. Cost Function:
o The ridge cost function is:

Cost=RSS(w)+λj=1∑Dwj2

 RSS(w) represents the residual sum of squares (similar to OLS).

 wj are the regression coefficients.
 λ controls the strength of regularization (higher λ means stronger
regularization).
3. Effect on Coefficients:
o Ridge regression shrinks the coefficients towards zero.
o It doesn’t force any coefficient to be exactly zero (unlike Lasso).
o All features are retained, but their impact is reduced.
4. Benefits:
o Helps stabilize model performance when dealing with multicollinearity
(highly correlated features).
o Reduces the risk of overfitting.
5. Trade-off:
o Choosing the right λ (hyperparameter) balances model fit and regularization.
o Cross-validation is often used to find the optimal value.

Problem Scenario:

6. Imagine we’re analyzing a supply chain delivery dataset.

7. Long-distance deliveries often contain a high number of items, while short-
distance deliveries have smaller inventories.
8. Delivery distance and item quantity are linearly correlated

Ridge Regression Twist:

 Ridge regression adds a penalty term to the cost function:

Cost=RSS(w)+λj=1∑Dwj2

o RSS(w) represents the residual sum of squares (similar to OLS).

o wj are the regression coefficients.
o λ controls the strength of regularization (higher λ means stronger
regularization).

Benefits of Ridge Regression:

 Stabilizes the model by shrinking coefficients.

 Reduces overfitting by discouraging large coefficient values.
 Retains all features but with smaller impact.

Implementation Example:

 Suppose we have a dataset with “YearsExperience” and “Salary” columns.

 We’ll train a ridge regression model to predict salaries based on experience.
 You can implement ridge regression from scratch in Python or use libraries like
scikit-learn.

Lasso regression, also known as L1 regularization, is a powerful technique used in

statistical modeling and machine learning. Here’s what you need to know:

1. Objective:
o Lasso aims to find a balance between model simplicity and accuracy.
o It adds a penalty term to the traditional linear regression model.
2. How It Works:
o The linear regression equation is:

y=β0 +β1 x1+β2 x2+…+βpxp +ε

y is the dependent variable (target).

β0 ,β1 ,…,βp are coefficients (parameters).
o Lasso encourages sparse solutions by forcing some coefficients to be exactly
zero.
o It automatically identifies and discards irrelevant or redundant variables.
3. Use Cases:
o Feature selection: Lasso helps choose relevant features.
o High multicollinearity: Useful when features are highly correlated.
o Automation of model selection.
4. Problem Scenario:
o Imagine you’re trying to predict house prices based on features such
as location, square footage, and the number of bedrooms.

Lasso Regression Twist:

 Lasso regression adds a penalty term to the linear regression cost function:

Cost=RSS(w)+λj=1∑D∣wj∣

o RSS(w) represents the residual sum of squares (similar to OLS).

o wj are the regression coefficients.
o λ controls the strength of regularization (higher λ means stronger
regularization).
 Lasso encourages sparse models by forcing some coefficients to be exactly zero.

Feature Importance:

 Lasso helps identify which features are more important.

 In our example, it might reveal that location and square footage play a major role in
determining house prices.
 Difference between Ridge and Lasso Regression

Ridge Regression Lasso Regression

and Encourages some coefficients to be
Shrinks the coefficients toward zero
exactly zero

Adds a penalty term proportional to the Adds a penalty term proportional to the
sum of squared coefficients sum of absolute values of coefficients

Does not eliminate any features Can eliminate some features

Suitable when all features are Suitable when some features are irrelevant
importantly or redundant

More computationally efficient Less computationally efficient

Requires setting a hyperparameter Requires setting a hyperparameter

Performs better when there are many Performs better when there are a few large
small to medium-sized coefficients coefficients

Overfitting
Overfitting occurs when our machine learning model tries to cover all the data points or more than
the required data points present in the given dataset. Because of this, the model starts caching noise
and inaccurate values present in the dataset, and all these factors reduce the efficiency and accuracy
of the model.

Overfitting is the main problem that occurs in supervised learning.

Example: The concept of the overfitting can be understood by the below graph of
the linear regression output:
Underfitting
Underfitting occurs when our machine learning model is not able to capture the
underlying trend of the data. To avoid the overfitting in the model, the fed of training
data can be stopped at an early stage, due to which the model may not learn enough
from the training data. As a result, it may fail to find the best fit of the dominant trend
in the data.

In the case of underfitting, the model is not able to learn enough from the training
data, and hence it reduces the accuracy and produces unreliable predictions.

An underfitted model has high bias and low variance.

Example: We can understand the underfitting using below output of the linear
regression model:
As we can see from the above diagram, the model is unable to capture the data points
present in the plot.

Standardization

The steps to be followed are :

Data collection

Our data can be in various formats i.e., numbers (integers) & words (strings), for now, we’ll consider
only the numbers in our Dataset.

Assume our dataset has random numeric values in the range of 1 to 95,000 (in random order). Just
for our understanding consider a small Dataset of barely 10 values with numbers in the given range
and randomized order.

1) 99

2) 789

3) 1

4) 541

5) 5

6) 6589

7) 94142
8) 7

9) 50826

10) 35464

If we just look at these values, their range is so high, that while training the model with 10,000 such
values will take lot of time. That’s where the problem arises.

Understanding standardization

We have a solution to solve the problem arisen i.e. Standardization. It helps us solve this by :

Down Scaling the Values to a scale common to all, usually in the range -1 to +1.

And keeping the Range between the values intact.

So, how do we do that? we’ll there’s a mathematical formula for the same i.e., Z-Score =
(Current_value – Mean) / Standard Deviation.

Using this formula we are replacing all the input values by the Z-Score for each and every value.
Hence we get values ranging from -1 to +1, keeping the range intact.

Standardization performs the following:

Converts the Mean (μ) to 0

Converts to S.D. (σ) to 1

It’s pretty obvious for Mean = 0 and S.D = 1 as all the values will have such less difference and each
value will nearly be equal 0, hence Mean = 0 and S.D. = 1.

Machine Learning Using Optimization and Logistic Regression and Sigmoid Function - GRP 06
No ratings yet
Machine Learning Using Optimization and Logistic Regression and Sigmoid Function - GRP 06
31 pages
Regression Analysis Assignment
100% (1)
Regression Analysis Assignment
8 pages
Linear Regression Python Programming
No ratings yet
Linear Regression Python Programming
25 pages
Machine Learning With Ridge and Lasso Regression
No ratings yet
Machine Learning With Ridge and Lasso Regression
19 pages
LASSO and Ridge-1
No ratings yet
LASSO and Ridge-1
15 pages
Lecture+Notes+-+Advanced+Regression
No ratings yet
Lecture+Notes+-+Advanced+Regression
12 pages
Lecture - 6 Classification (Logistic Regression)
No ratings yet
Lecture - 6 Classification (Logistic Regression)
48 pages
Lecture 09 ML
No ratings yet
Lecture 09 ML
26 pages
ML Ai
No ratings yet
ML Ai
53 pages
Regularization
No ratings yet
Regularization
3 pages
ML Tutorial
No ratings yet
ML Tutorial
45 pages
Sample Research Paper
No ratings yet
Sample Research Paper
26 pages
Module 2 Modified
No ratings yet
Module 2 Modified
67 pages
Chapter 4 - Linear Model: Prepared By: Shier Nee, SAW Based On: Probabilistic Machine Learning by Kevin Murphy
No ratings yet
Chapter 4 - Linear Model: Prepared By: Shier Nee, SAW Based On: Probabilistic Machine Learning by Kevin Murphy
42 pages
Supervised Regression Notes
No ratings yet
Supervised Regression Notes
11 pages
ML - LAB - BE CSE (DS) Final
No ratings yet
ML - LAB - BE CSE (DS) Final
110 pages
Lec8 Regularization Polynomial Regression
No ratings yet
Lec8 Regularization Polynomial Regression
30 pages
Logistic Regression
No ratings yet
Logistic Regression
34 pages
Lecture 7 - Part A - Mutli Class and Overfitting and Regularization
No ratings yet
Lecture 7 - Part A - Mutli Class and Overfitting and Regularization
43 pages
Regression
No ratings yet
Regression
16 pages
Lecture Material 11
No ratings yet
Lecture Material 11
14 pages
Chapter+3+ ++Regression+Algorithms
No ratings yet
Chapter+3+ ++Regression+Algorithms
22 pages
Chapter 3. Linear Regression
No ratings yet
Chapter 3. Linear Regression
41 pages
9 - Linear Regression-Problems and Solutions
No ratings yet
9 - Linear Regression-Problems and Solutions
23 pages
ML Linear Model
No ratings yet
ML Linear Model
10 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
56 pages
Logistic Regression
No ratings yet
Logistic Regression
42 pages
FAM Unit6
No ratings yet
FAM Unit6
32 pages
Group30 Linear Regression
No ratings yet
Group30 Linear Regression
20 pages
ML 1
No ratings yet
ML 1
24 pages
Machine Learning
No ratings yet
Machine Learning
19 pages
Chapter2 - Optimisation
No ratings yet
Chapter2 - Optimisation
7 pages
Lecture-6 Linear Regression Addition
No ratings yet
Lecture-6 Linear Regression Addition
15 pages
21csc305p ML Unit 2
No ratings yet
21csc305p ML Unit 2
115 pages
Ridge Mt1cars
No ratings yet
Ridge Mt1cars
4 pages
Classification & Regression BDMDM Print
No ratings yet
Classification & Regression BDMDM Print
5 pages
Linear and Logistic Regression
No ratings yet
Linear and Logistic Regression
21 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
Regression Analysis in Machine Learning: Context
No ratings yet
Regression Analysis in Machine Learning: Context
16 pages
Ridge and Lasso Regression in Python
No ratings yet
Ridge and Lasso Regression in Python
18 pages
Linear Regression
No ratings yet
Linear Regression
8 pages
PA Notes 2
No ratings yet
PA Notes 2
23 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
5 pages
Simple Linear Regression Definition: Two Variables Independent Variable Dependent Variable Equation
No ratings yet
Simple Linear Regression Definition: Two Variables Independent Variable Dependent Variable Equation
9 pages
ML Solved Endsem
No ratings yet
ML Solved Endsem
16 pages
Unit 2
No ratings yet
Unit 2
92 pages
Feature Selection
No ratings yet
Feature Selection
19 pages
MLA TAB Lecture3
No ratings yet
MLA TAB Lecture3
70 pages
Regression Questionnaire
No ratings yet
Regression Questionnaire
10 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
A Tutorial of Machine Learning
No ratings yet
A Tutorial of Machine Learning
16 pages
Regularization and Feature Selectio N
No ratings yet
Regularization and Feature Selectio N
102 pages
Module 3.3 Classification Models, An Overview
No ratings yet
Module 3.3 Classification Models, An Overview
11 pages
Module 3
No ratings yet
Module 3
35 pages
ML Models and When To Choose One Over Others
No ratings yet
ML Models and When To Choose One Over Others
7 pages
Analysis of Incidence Rates - 1st Edition Full Version Download
100% (10)
Analysis of Incidence Rates - 1st Edition Full Version Download
15 pages
Comprehensive Machine Learning Tutorial - Regressio
No ratings yet
Comprehensive Machine Learning Tutorial - Regressio
9 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
3rd Unit DL Final Class Notes
No ratings yet
3rd Unit DL Final Class Notes
78 pages
CCE3 - KNOWLEDGE REPRESENTATION AND ML DL With Answer
No ratings yet
CCE3 - KNOWLEDGE REPRESENTATION AND ML DL With Answer
46 pages
MS 02 More Exercises
No ratings yet
MS 02 More Exercises
5 pages
3 Unit - Dspu
No ratings yet
3 Unit - Dspu
23 pages
Horngren Ima16 Tif 03 GE
No ratings yet
Horngren Ima16 Tif 03 GE
47 pages
Political Dynasty and Its Effects To Goo
No ratings yet
Political Dynasty and Its Effects To Goo
43 pages
Minor Project Final Report (20bca19)
No ratings yet
Minor Project Final Report (20bca19)
85 pages
Bios Tat
No ratings yet
Bios Tat
20 pages
S.Y Syllabus
No ratings yet
S.Y Syllabus
57 pages
7 Classical Assumptions of Ordinary Least Squares (OLS) Linear Regression - Statistics by Jim
No ratings yet
7 Classical Assumptions of Ordinary Least Squares (OLS) Linear Regression - Statistics by Jim
71 pages
Course Outline Year Three Semester Two JHS
No ratings yet
Course Outline Year Three Semester Two JHS
115 pages
ADM730 InstructorNotes
No ratings yet
ADM730 InstructorNotes
138 pages
Engineering Mathematics - IV - Assignments - 1 - 2
No ratings yet
Engineering Mathematics - IV - Assignments - 1 - 2
3 pages
The Simple Regression Model
No ratings yet
The Simple Regression Model
10 pages
A Study of Job Seekers Perceptions of The Job Por
No ratings yet
A Study of Job Seekers Perceptions of The Job Por
16 pages
Daniel, S., & Bridges, S. K
No ratings yet
Daniel, S., & Bridges, S. K
8 pages
Chapter 7. Budget and Quantitative Techniques Dec
No ratings yet
Chapter 7. Budget and Quantitative Techniques Dec
16 pages
6 - Regression and Correlation PDF
No ratings yet
6 - Regression and Correlation PDF
15 pages
BRM - Topic 8
No ratings yet
BRM - Topic 8
94 pages
Department of Artificial Intelligence & Data Science K. K. Wagh Institute of Engineering Education and Research
No ratings yet
Department of Artificial Intelligence & Data Science K. K. Wagh Institute of Engineering Education and Research
5 pages
PWLF Jekel Venter v2
No ratings yet
PWLF Jekel Venter v2
15 pages
ACT202 FI Chapter 02 Cost Term and Concepts
No ratings yet
ACT202 FI Chapter 02 Cost Term and Concepts
59 pages
Multi Col Linearity
No ratings yet
Multi Col Linearity
37 pages
DAAL - Assignment No 7
No ratings yet
DAAL - Assignment No 7
2 pages
Time Series and Panel Data Econometrics
No ratings yet
Time Series and Panel Data Econometrics
95 pages
How Do You Use This Module?: Module 4 - Chapter 4: Test For Contingency Tables
No ratings yet
How Do You Use This Module?: Module 4 - Chapter 4: Test For Contingency Tables
10 pages
Yulfita Aini Efi Andari PDF
No ratings yet
Yulfita Aini Efi Andari PDF
8 pages
5.1 Review of Simple Linear Regression
No ratings yet
5.1 Review of Simple Linear Regression
2 pages
Ty Ai&Ds Semester: Assignment No.: Assignment Title
No ratings yet
Ty Ai&Ds Semester: Assignment No.: Assignment Title
4 pages
DAAL - Assignment No 10
No ratings yet
DAAL - Assignment No 10
2 pages
Ds Lab 4.ipynb - TARUN
No ratings yet
Ds Lab 4.ipynb - TARUN
6 pages
BA Test
No ratings yet
BA Test
7 pages
DAAL - Assignment No 8
No ratings yet
DAAL - Assignment No 8
1 page
Statistical Learning in Practice - Young
No ratings yet
Statistical Learning in Practice - Young
2 pages
DAAL - Assignment No 9
No ratings yet
DAAL - Assignment No 9
1 page
Aqoonta Ilkaha
No ratings yet
Aqoonta Ilkaha
12 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet

Unit 2

Uploaded by

Unit 2

Uploaded by

Linear regression

How does logistic regression work?

The logistic function is defined as:

( \beta_0 ) is the intercept.

( \beta_1 ) is the coefficient for the predictor ( X ).

( e ) is the base of the natural logarithm.

Example: Predicting Whether a Student Passes an Exam

Hours Studied Pass (1) / Fail (0)

We use the logistic regression model:

P(Pass)=1+e−(β0 +β1 ⋅Hours Studied)1​

To predict the probability of passing for a student who studied 4 hours:

So, the probability of passing is 0.5 or 50%.

 RSS(w) represents the residual sum of squares (similar to OLS).

6. Imagine we’re analyzing a supply chain delivery dataset.

Ridge Regression Twist:

 Ridge regression adds a penalty term to the cost function:

o RSS(w) represents the residual sum of squares (similar to OLS).

Benefits of Ridge Regression:

 Stabilizes the model by shrinking coefficients.

 Suppose we have a dataset with “YearsExperience” and “Salary” columns.

Lasso regression, also known as L1 regularization, is a powerful technique used in

y=β0 +β1 x1​+β2 x2​+…+βp​xp +ε

y is the dependent variable (target).

Lasso Regression Twist:

o RSS(w) represents the residual sum of squares (similar to OLS).

 Lasso helps identify which features are more important.

Ridge Regression Lasso Regression

Does not eliminate any features Can eliminate some features

More computationally efficient Less computationally efficient

Requires setting a hyperparameter Requires setting a hyperparameter

Overfitting is the main problem that occurs in supervised learning.

An underfitted model has high bias and low variance.

The steps to be followed are :

And keeping the Range between the values intact.

Standardization performs the following:

Converts the Mean (μ) to 0

Converts to S.D. (σ) to 1

You might also like

P(Pass)=1+e−(β0 +β1 ⋅Hours Studied)1

y=β0 +β1 x1+β2 x2+…+βpxp +ε