6 ML Updated

Uploaded by

Dhruv Tyagi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views23 pages

6 ML Updated

Uploaded by

Dhruv Tyagi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

Lecture-6

Machine Learning
with Python
Linear Regression in Machine
Learning
Linear regression is one of the easiest and most popular Machine Learning
algorithms. It is a statistical method that is used for predictive analysis.
Linear regression algorithm shows a linear relationship between a dependent
(y) and one or more independent (y) variables, hence called as linear regression.
The linear regression model provides a sloped straight line representing the
relationship between the variables.
Linear Regression in Machine
Learning

y= a0+a1x+ ε
Here,

Y= Dependent Variable (Target Variable)

X= Independent Variable (predictor Variable)
a0= intercept of the line (Gives an additional degree of
freedom)
a1 = Linear regression coefficient (scale factor to each input
value).
ε = random error
The values for x and y variables are training datasets for Linear Regression
model representation.
Linear Regression in Machine
Learning
Linear Regression in Machine
Learning
Finding the best fit line:

When working with linear regression, our main goal is to find the best fit line
that means the error between predicted values and actual values should be
minimized. The best fit line will have the least error.
The different values for weights or the coefficient of lines (a0, a1) gives a
different line of regression, so we need to calculate the best values for a0 and
a1 to find the best fit line, so to calculate this we use cost function.
Cost function-
The different values for weights or coefficient of lines (a0, a1) gives the different
line of regression, and the cost function is used to estimate the values of the
coefficient for the best fit line.
Cost function optimizes the regression coefficients or weights. It measures how
a linear regression model is performing.
We can use the cost function to find the accuracy of the mapping function,
which maps the input variable to the output variable. This mapping function is
also known as Hypothesis function.
Cost function-

For Linear Regression, we use the Mean Squared Error (MSE) cost function, which is the average
of squared error occurred between the predicted values and actual values. It can be written as:
For the above linear equation, MSE can be calculated as:

Where,
N=Total number of observation
Yi = Actual value
(a1xi+a0)= Predicted value.
Cost function-

Residuals: The distance between the actual value and predicted values is
called residual.
If the observed points are far from the regression line, then the residual will be
high, and so cost function will high.
If the scatter points are close to the regression line, then the residual will be
small and hence the cost function.
Model Performance:
The Goodness of fit determines how the line of regression fits the set of
observations. The process of finding the best model out of various models is
called optimization.
Assumptions of Linear
Regression

These are some formal checks while building a Linear Regression model, which
ensures to get the best possible result from the given dataset.
Linear relationship between the features and target:
Linear regression assumes the linear relationship between the dependent and
independent variables.
Small or no multicollinearity between the features:
Multicollinearity means high-correlation between the independent variables. Due to
multicollinearity, it may difficult to find the true relationship between the predictors
and target variables. Or we can say, it is difficult to determine which predictor
variable is affecting the target variable and which is not. So, the model assumes
either little or no multicollinearity between the features or independent variables.
Assumptions of Linear
Regression
Homoscedasticity Assumption:
Homoscedasticity is a situation when the error term is the same for all the values of
independent variables. With homoscedasticity, there should be no clear pattern distribution of
data in the scatter plot.
Normal distribution of error terms:
Linear regression assumes that the error term should follow the normal distribution pattern. If
error terms are not normally distributed, then confidence intervals will become either too wide
or too narrow, which may cause difficulties in finding coefficients.
It can be checked using the q-q plot. If the plot shows a straight line without any deviation,
which means the error is normally distributed.
No autocorrelations:
The linear regression model assumes no autocorrelation in error terms. If there will be any
correlation in the error term, then it will drastically reduce the accuracy of the model.
Autocorrelation usually occurs if there is a dependency between residual errors.
Logistic Regression
in Machine
Learning
Logistic Regression in Machine
Learning

Logistic regression is a supervised machine learning algorithm mainly used for

classification tasks where the goal is to predict the probability that an instance
of belonging to a given class or not.
It is a kind of statistical algorithm, which analyze the relationship between a set
of independent variables and the dependent binary variables.
It is a powerful tool for decision-making.
For example email spam or not.
Logistic Regression in Machine
Learning

It’s referred to as regression because it takes the output of the linear

regression function as input and uses a sigmoid function to estimate the
probability for the given class.
The difference between linear regression and logistic regression is that linear
regression output is the continuous value that can be anything while logistic
regression predicts the probability that an instance belongs to a given class or
not.
**NOTE**
Logistic regression predicts the output of a categorical dependent variable.
Therefore the outcome must be a categorical or discrete value.
It can be either Yes or No, 0 or 1, true or False, etc. but instead of giving the
exact value as 0 and 1, it gives the probabilistic values which lie between 0 and
1.
Logistic Regression is much similar to the Linear Regression except that how
they are used.
Linear Regression is used for solving Regression problems, whereas Logistic
regression is used for solving the classification problems.
**NOTE**
In Logistic regression, instead of fitting a regression line, we fit an
“S” shaped logistic function, which predicts two maximum values (0
or 1).
The curve from the logistic function indicates the likelihood of
something such as whether the cells are cancerous or not, a mouse
is obese or not based on its weight, etc.
Logistic Regression is a significant machine learning algorithm
because it has the ability to provide probabilities and classify new
data using continuous and discrete datasets.
Logistic Function (Sigmoid
Function):
sigmoid function
The sigmoid function is a mathematical function used to map the predicted values to
probabilities.
It maps any real value into another value within a range of 0 and 1.
The value of the logistic regression must be between 0 and 1, which cannot go beyond this limit,
so it forms a curve like the "S" form. The S-form curve is called the Sigmoid function or the
logistic function.
In logistic regression, we use the concept of the threshold value, which defines the probability of
either 0 or 1. Such as values above the threshold value tends to 1, and a value below the
threshold values tends to 0.
Assumptions for Logistic
Regression:

The dependent variable must be categorical in nature.

The independent variable should not have multi-collinearity.
Type of Logistic Regression:

On the basis of the categories, Logistic Regression can be classified into three
types:
Binomial: In binomial Logistic regression, there can be only two possible types
of the dependent variables, such as 0 or 1, Pass or Fail, etc.
Multinomial: In multinomial Logistic regression, there can be 3 or more
possible unordered types of the dependent variable, such as "cat", "dogs", or
"sheep"
Ordinal: In ordinal Logistic regression, there can be 3 or more possible ordered
types of dependent variables, such as "low", "Medium", or "High"
Thank You!!

Erick Myers - Python Machine Learning is the Complete Guide to Everything You Need to Know About Python Machine Learning_ Keras, Numpy, Scikit Learn, Tensorflow, With Useful Exercises and Examples. (2
50% (2)
Erick Myers - Python Machine Learning is the Complete Guide to Everything You Need to Know About Python Machine Learning_ Keras, Numpy, Scikit Learn, Tensorflow, With Useful Exercises and Examples. (2
175 pages
Linear Regression
No ratings yet
Linear Regression
16 pages
Gea1000 Finals Cheatsheet
No ratings yet
Gea1000 Finals Cheatsheet
2 pages
Dsmnru Bba Syllabus
No ratings yet
Dsmnru Bba Syllabus
42 pages
Method Validation of Analytical Procedures
100% (1)
Method Validation of Analytical Procedures
14 pages
Notes For Business Analytics Part II
No ratings yet
Notes For Business Analytics Part II
66 pages
Assignment 2 Solution
No ratings yet
Assignment 2 Solution
6 pages
Linear Regression-3: Prof. Asim Tewari IIT Bombay
No ratings yet
Linear Regression-3: Prof. Asim Tewari IIT Bombay
50 pages
Concrete Durability Index Testing Manual
100% (1)
Concrete Durability Index Testing Manual
26 pages
Sirca S., Horvat M. Computational Methods For Physicists
100% (1)
Sirca S., Horvat M. Computational Methods For Physicists
13 pages
Activity 5 - Statistical Analysis and Design - Regression - Correlation
No ratings yet
Activity 5 - Statistical Analysis and Design - Regression - Correlation
29 pages
The Relationship Between Student Motivation and Class Engagement Levels
No ratings yet
The Relationship Between Student Motivation and Class Engagement Levels
22 pages
Regression Analysis: Causal Relationship Between The Explanatory and
No ratings yet
Regression Analysis: Causal Relationship Between The Explanatory and
17 pages
Assignment 5
No ratings yet
Assignment 5
10 pages
Analysis of Age-at-Death Estimation Using Data From A New, Modern Autopsy Sample-Part I: Pubic Bone
No ratings yet
Analysis of Age-at-Death Estimation Using Data From A New, Modern Autopsy Sample-Part I: Pubic Bone
7 pages
Linear Regression Logistic Regression Classification
No ratings yet
Linear Regression Logistic Regression Classification
66 pages
Unit 2 ML
No ratings yet
Unit 2 ML
201 pages
Paper Salud Familiar
No ratings yet
Paper Salud Familiar
7 pages
Regression in M.L
No ratings yet
Regression in M.L
13 pages
LEC2 مشين
No ratings yet
LEC2 مشين
116 pages
Cec Project of Business Statistics (Autosaved)
No ratings yet
Cec Project of Business Statistics (Autosaved)
9 pages
Linear Regression Simple Technique For I
No ratings yet
Linear Regression Simple Technique For I
3 pages
CH1: Management Accounting in Context
100% (1)
CH1: Management Accounting in Context
21 pages
02 LR
No ratings yet
02 LR
11 pages
ML Practical 04
No ratings yet
ML Practical 04
19 pages
Linear Regression and Logistic Regression
No ratings yet
Linear Regression and Logistic Regression
19 pages
CS1 Study Guide 2025
No ratings yet
CS1 Study Guide 2025
16 pages
IE0005 Exercise Solutions 2-6
No ratings yet
IE0005 Exercise Solutions 2-6
84 pages
Problem I
No ratings yet
Problem I
13 pages
Farlin Case Assignment 2 Final Draft
No ratings yet
Farlin Case Assignment 2 Final Draft
9 pages
Machine Learning: Bilal Khan
100% (2)
Machine Learning: Bilal Khan
20 pages
36
No ratings yet
36
15 pages
DMML Unit4
No ratings yet
DMML Unit4
77 pages
Linear Regression
No ratings yet
Linear Regression
83 pages
Iet Cipher ML Bootcamp (Session-1)
No ratings yet
Iet Cipher ML Bootcamp (Session-1)
67 pages
Unit - Iii Data Analysis
No ratings yet
Unit - Iii Data Analysis
39 pages
Unit-4 DS Student
No ratings yet
Unit-4 DS Student
43 pages
Fai Module 3
No ratings yet
Fai Module 3
67 pages
ML - LAB - BE CSE (DS) Final
No ratings yet
ML - LAB - BE CSE (DS) Final
110 pages
AKTU MBA 1st Semester
No ratings yet
AKTU MBA 1st Semester
14 pages
Regression
No ratings yet
Regression
12 pages
BA Notes
No ratings yet
BA Notes
24 pages
5.3) Ordinal Logistic Regression 2
No ratings yet
5.3) Ordinal Logistic Regression 2
40 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
Regression Modelling
No ratings yet
Regression Modelling
25 pages
Unit-Iii-1 1
No ratings yet
Unit-Iii-1 1
31 pages
Lecture Material 11
No ratings yet
Lecture Material 11
14 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Regression Analysis Linear Multiple Logistic
No ratings yet
Regression Analysis Linear Multiple Logistic
25 pages
Unit 2
No ratings yet
Unit 2
19 pages
2.1 Regression Analysis
No ratings yet
2.1 Regression Analysis
28 pages
ML Khuraim
No ratings yet
ML Khuraim
27 pages
FAM Unit6
No ratings yet
FAM Unit6
32 pages
(Ebooks PDF) Download Data Analytics For Business AI-ML-PBI-SQL-R Wolfgang Garn Full Chapters
100% (3)
(Ebooks PDF) Download Data Analytics For Business AI-ML-PBI-SQL-R Wolfgang Garn Full Chapters
62 pages
Regression: Unit Iii
No ratings yet
Regression: Unit Iii
54 pages
Unit-2 ML
No ratings yet
Unit-2 ML
39 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
Regression
No ratings yet
Regression
11 pages
Linear Regression
No ratings yet
Linear Regression
14 pages
Linear Regression - Everything You Need To Know About Linear Regression
No ratings yet
Linear Regression - Everything You Need To Know About Linear Regression
17 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
ML Assignment
No ratings yet
ML Assignment
3 pages
ML Unit-2 Half
No ratings yet
ML Unit-2 Half
16 pages
Unit-2: Machine Learning Techniques (KCS-055) Module-2
No ratings yet
Unit-2: Machine Learning Techniques (KCS-055) Module-2
199 pages
DA Unit-3
No ratings yet
DA Unit-3
13 pages
ML 7th Sem AIML ITE Notes Complete LONG (1) - 34-62
No ratings yet
ML 7th Sem AIML ITE Notes Complete LONG (1) - 34-62
29 pages
UNIT 2 Machine Learning BCAI601BCDS062
No ratings yet
UNIT 2 Machine Learning BCAI601BCDS062
244 pages
Linear Regression
No ratings yet
Linear Regression
5 pages
228w1f0065 ML
No ratings yet
228w1f0065 ML
15 pages
ML Unit-2 Final
No ratings yet
ML Unit-2 Final
32 pages
Linear and Logistic Regression
No ratings yet
Linear and Logistic Regression
21 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
56 pages
JCSSP 2025 817 826
No ratings yet
JCSSP 2025 817 826
10 pages
Intro To Linear and Logistic Reg
No ratings yet
Intro To Linear and Logistic Reg
5 pages
ML 2 ND Unit
No ratings yet
ML 2 ND Unit
50 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
7 pages
ML Section2
No ratings yet
ML Section2
36 pages
Unit-3 - Introduction To ML, Part-1
No ratings yet
Unit-3 - Introduction To ML, Part-1
3 pages
Compare & Contrast Linear Vs Logistic Regression
No ratings yet
Compare & Contrast Linear Vs Logistic Regression
3 pages
Chp2 Logistic Regression
No ratings yet
Chp2 Logistic Regression
6 pages
Chapter - 2 - Linear and Logistic Regression
No ratings yet
Chapter - 2 - Linear and Logistic Regression
34 pages
AAI Lecture 10 SP 25
No ratings yet
AAI Lecture 10 SP 25
37 pages
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
No ratings yet
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
60 pages
Unit 2 3 Notes
No ratings yet
Unit 2 3 Notes
16 pages
LR 1751142062
No ratings yet
LR 1751142062
10 pages
Unit 2
No ratings yet
Unit 2
136 pages
Linear Vs Logistic Regression Comparison
No ratings yet
Linear Vs Logistic Regression Comparison
4 pages
Assignment 2 ML New
No ratings yet
Assignment 2 ML New
6 pages

6 ML Updated

Uploaded by

6 ML Updated

Uploaded by

Lecture-6

Y= Dependent Variable (Target Variable)

Logistic regression is a supervised machine learning algorithm mainly used for

It’s referred to as regression because it takes the output of the linear

The dependent variable must be categorical in nature.

You might also like