0% found this document useful (0 votes)

42 views18 pages

Linear Regression

Uploaded by

choudhary.singhcs.aiml23

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views18 pages

Linear Regression

Uploaded by

choudhary.singhcs.aiml23

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 18

What is Linear Regression?

Linear Regression is a supervised learning algorithm in machine

learning, which is widely used for solving regression problems.

Regression is a type of machine learning problem where the goal is to

predict a continuous output variable based on one or more input

variables.

In Linear Regression, the goal is to find the best-fitting linear equation

to describe the relationship between the input variables (also known

as predictors or features) and the output variable (also known as the

response variable).
The equation for a simple linear regression model can be written as

follows:

y = b0 + b1 * x

Here, y is the dependent variable (the variable we are trying to

predict), x is the independent variable (the predictor or feature), b0 is

the intercept term (the value of y when x is zero), and b1 is the slope

coefficient (the change in y for a unit change in x).

The goal of Linear Regression is to find the best values for b0 and b1

such that the line best fits the data points, minimizing the errors or

the difference between the predicted values and the actual values.

Types of Linear Regression?

There are two main types of Linear Regression models: Simple Linear

Regression and Multiple Linear Regression.

Simple Linear Regression: In simple linear regression, there is only

one independent variable (also known as the predictor or feature) and

one dependent variable (also known as the response variable). The

goal of simple linear regression is to find the best-fitting line to

describe the relationship between the independent and dependent

variable. The equation for a simple linear regression model can be

written as:

Y = b0 + b1 * X

Here, Y is the dependent variable, X is the independent variable, b0 is

the intercept term, and b1 is the slope coefficient.

Multiple Linear Regression: In multiple linear regression, there are

multiple independent variables and one dependent variable. The goal

of multiple linear regression is to find the best-fitting line to describe

the relationship between the independent variables and the

dependent variable. The equation for a multiple linear regression

model can be written as:

Y = b0 + b1 * X1 + b2 * X2 + … + bn * Xn

Here, Y is the dependent variable, X1, X2, …, Xn are the independent

variables, b0 is the intercept term, and b1, b2, …, bn are the slope

coefficients.
In both types of linear regression, the goal is to find the best values

for the intercept and slope coefficients that minimize the difference

between the predicted values and the actual values. Linear regression

is widely used in many real-world applications, such as finance,

marketing, and healthcare, for predicting outcomes such as stock

prices, customer behavior, and patient outcomes.

Linear Regression Line

In machine learning, a regression line can show two types of

relationships between the input variables (also known as predictors or

features) and the output variable (also known as the response

variable) in a linear regression model.

● Positive Relationship: A positive relationship exists

between the input variables and the output variable when

the slope of the regression line is positive. In other words, as

the values of the input variables increase, the value of the

output variable also increases. This can be seen as an

upward slope on a scatter plot of the data.

● Negative Relationship: A negative relationship exists

between the input variables and the output variable when

the slope of the regression line is negative. In other words,

as the values of the input variables increase, the value of the

output variable decreases. This can be seen as a downward

slope on a scatter plot of the data.

Finding the best fit line

In machine learning, finding the best-fitting line is crucial in linear

regression, as it determines the accuracy of the predictions made by

the model. The best-fitting line is the line that has the smallest

difference between the predicted values and the actual values.

To find the best-fitting line in a linear regression model, we use a

process called “ordinary least squares (OLS) regression”. This process

involves calculating the sum of the squared differences between the

predicted values and the actual values for each data point, and then

finding the line that minimizes this sum of squared errors.

The best-fitting line is found by minimizing the residual sum of

squares (RSS), which is the sum of the squared differences between

the predicted values and the actual values. This is achieved by

adjusting the values of the intercept and slope coefficients, also

known as c and m, respectively.

Once the values of c and m are determined, we can use the linear

regression equation to make predictions for new data points. The

equation for a simple linear regression model can be written as:

y=c+m*x

Here, y is the dependent variable (the variable we are trying to

predict), x is the independent variable (the predictor or feature), c is

the intercept term (the value of y when x is zero), and m is the slope

coefficient (the change in y for a unit change in x).

In multiple linear regression, the equation would have more

independent variables, and the slope coefficients for each variable

would be included in the equation.

Overall, finding the best-fitting line in a linear regression model is

critical for accurate predictions and is achieved by minimizing the

residual sum of squares using the OLS regression method.

Gradient Descent : Linear Regression

In this tutorial you can learn how the gradient descent algorithm

works and implement it from scratch in python. First we look at what

linear regression is, then we define the loss function. We learn how

the gradient descent algorithm works and finally we will implement it

on a given data set and make predictions.

Linear Regression

In statistics, linear regression is a linear approach to modeling

the relationship between a dependent variable and one or more

independent variables. Let X be the independent variable and Y

be the dependent variable. We will define a linear relationship

between these two variables as follows:

This is the equation for a line that you studied in high school. m

is the slope of the line and c is the y intercept. Today we will use

this equation to train our model with a given dataset and predict

the value of Y for any given value of X. Our challenge today is to

determine the value of m and c, such that the line corresponding

to those values is the best fitting line or gives the minimum

error.

Loss Function

The loss is the error in our predicted value of m and c. Our goal

is to minimize this error to obtain the most accurate value of m

and c.

We will use the Mean Squared Error function to calculate the

loss. There are three steps in this function:

1. Find the difference between the actual y and predicted

y value(y = mx + c), for a given x.

2. Square this difference.

3. Find the mean of the squares for every value in X.

Mean Squared Error Equation

Here yᵢ is the actual value and ȳᵢ is the predicted

value. Lets substitute the value of ȳᵢ:

Substituting the value of ȳᵢ

So we square the error and find the mean. hence the name Mean

Squared Error. Now that we have defined the loss function, lets

get into the interesting part — minimizing it and finding m and

c.
The Gradient Descent Algorithm

Gradient descent is an iterative optimization algorithm to find

the minimum of a function. Here that function is our Loss

Function.

Understanding Gradient Descent

Imagine a valley and a person with no sense of direction who

wants to get to the bottom of the valley. He goes down the slope

and takes large steps when the slope is steep and small steps

when the slope is less steep. He decides his next position based
on his current position and stops when he gets to the bottom of

the valley which was his goal.

Let’s try applying gradient descent to m and c and approach it

step by step:

1. Initially let m = 0 and c = 0. Let L be our learning rate.

This controls how much the value of m changes with

each step. L could be a small value like 0.0001 for good

accuracy.

2. Calculate the partial derivative of the loss function with

respect to m, and plug in the current values of x, y, m

and c in it to obtain the derivative value D.

Derivative with respect to m

Dₘ is the value of the partial derivative with respect to m.

Similarly lets find the partial derivative with respect to c, Dc :

Derivative with respect to c

3. Now we update the current value of m and c using the

following equation:

4. We repeat this process until our loss function is a very small

value or ideally 0 (which means 0 error or 100% accuracy). The

value of m and c that we are left with now will be the optimum

values.

Now going back to our analogy, m can be considered the current

position of the person. D is equivalent to the steepness of the

slope and L can be the speed with which he moves. Now the new

value of m that we calculate using the above equation will be his

next position, and L×D will be the size of the steps he will take.

When the slope is more steep (D is more) he takes longer steps

and when it is less steep (D is less), he takes smaller steps.

Finally he arrives at the bottom of the valley which corresponds

to our loss = 0.

Now with the optimum value of m and c our model is ready to

make predictions !

Implementing the Model

Now let’s convert everything above into code and see our model

in action !

import numpy as np

import pandas as pd

import matplotlib.pyplot as plt

plt.rcParams['figure.figsize'] = (12.0,
9.0)

# Preprocessing Input data

data = pd.read_csv('data.csv')

X = data.iloc[:, 0]

Y = data.iloc[:, 1]

plt.scatter(X, Y)

plt.show()
# Building the model

m = 0

c = 0

L = 0.0001 # The learning Rate

epochs = 1000 # The number of iterations to perform

gradient descent

n = float(len(X)) # Number of elements in X

# Performing Gradient Descent

for i in range(epochs):

Y_pred = m*X + c # The current predicted value of Y

D_m = (-2/n) * sum(X * (Y - Y_pred)) # Derivative

wrt m

D_c = (-2/n) * sum(Y - Y_pred) # Derivative wrt c

m = m - L * D_m # Update m

c = c - L * D_c # Update c

print (m, c)

1.4796491688889395 0.10148121494753726

# Making predictions

Y_pred = m*X + c

plt.scatter(X, Y)

plt.plot([min(X), max(X)], [min(Y_pred), max(Y_pred)],

color='red') # regression line

plt.show()
Gradient descent is one of the simplest and widely used

algorithms in machine learning, mainly because it can be applied

to any function to optimize it. Learning it lays the foundation to

mastering machine learning.

Machine Learning QB
No ratings yet
Machine Learning QB
32 pages
Regression: Unit Iii
No ratings yet
Regression: Unit Iii
54 pages
Linear Inequalities - Class 11 JEE
100% (1)
Linear Inequalities - Class 11 JEE
4 pages
Unit 3c Linear Regression
No ratings yet
Unit 3c Linear Regression
98 pages
Data Science
100% (1)
Data Science
14 pages
Unit 2
No ratings yet
Unit 2
136 pages
Day.9 SML
No ratings yet
Day.9 SML
23 pages
DA unit-III
No ratings yet
DA unit-III
30 pages
NOTES - UNIT 2 - Machine Learning
No ratings yet
NOTES - UNIT 2 - Machine Learning
33 pages
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
No ratings yet
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
60 pages
Simple Linear and Logistic Regression
No ratings yet
Simple Linear and Logistic Regression
81 pages
Unit-3 Data Analysis
No ratings yet
Unit-3 Data Analysis
36 pages
IV Ai & Ds Al3451 ML Unit2
No ratings yet
IV Ai & Ds Al3451 ML Unit2
50 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
AIML MSE 2 Notes
No ratings yet
AIML MSE 2 Notes
35 pages
Regression
No ratings yet
Regression
60 pages
5 - AML Lecture 5 - Linear Regression
No ratings yet
5 - AML Lecture 5 - Linear Regression
56 pages
ML Unit-2
No ratings yet
ML Unit-2
123 pages
OE-ML Unit - 3
No ratings yet
OE-ML Unit - 3
29 pages
Module III (Part II) (Regression and Time Series)
No ratings yet
Module III (Part II) (Regression and Time Series)
118 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
MachineLearning Unit-II
No ratings yet
MachineLearning Unit-II
45 pages
Linear Regression
100% (1)
Linear Regression
8 pages
Supervised Machine Learning - Regression
No ratings yet
Supervised Machine Learning - Regression
34 pages
Linear Regression
No ratings yet
Linear Regression
49 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
ML Unit-2
No ratings yet
ML Unit-2
34 pages
MachineLearning Unit II
No ratings yet
MachineLearning Unit II
45 pages
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
No ratings yet
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
19 pages
Linear Regression
No ratings yet
Linear Regression
17 pages
18-Linear Regression
No ratings yet
18-Linear Regression
29 pages
Mod3 Eda
No ratings yet
Mod3 Eda
16 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Linear Regression - Module 3
No ratings yet
Linear Regression - Module 3
16 pages
Chapter4 Regression
No ratings yet
Chapter4 Regression
15 pages
Regression Analysis
No ratings yet
Regression Analysis
49 pages
Regression
No ratings yet
Regression
4 pages
Regression Unit-2
No ratings yet
Regression Unit-2
5 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
CSL0777 L12
No ratings yet
CSL0777 L12
18 pages
Linear-Regression ML
No ratings yet
Linear-Regression ML
36 pages
Lecture Note #8 - PEC-CS701E
No ratings yet
Lecture Note #8 - PEC-CS701E
20 pages
Everything You Need To Know About Linear Regression
No ratings yet
Everything You Need To Know About Linear Regression
19 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
Experiment No 7
No ratings yet
Experiment No 7
7 pages
Machine Learning and Deep Learning Course
No ratings yet
Machine Learning and Deep Learning Course
23 pages
Hanan
No ratings yet
Hanan
9 pages
Linear Regression - Everything You Need To Know About Linear Regression
No ratings yet
Linear Regression - Everything You Need To Know About Linear Regression
17 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
14 pages
1.5.linear Regression
No ratings yet
1.5.linear Regression
5 pages
Linear Regression For Machine Learning
No ratings yet
Linear Regression For Machine Learning
9 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
13 pages
Regression Notes
100% (1)
Regression Notes
20 pages
SimpleMultipleLinearRegression FoundationalMathofAI S24
No ratings yet
SimpleMultipleLinearRegression FoundationalMathofAI S24
6 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Mathematics For Junior High Schools: September 2012
No ratings yet
Mathematics For Junior High Schools: September 2012
121 pages
2nd School Consolidated Least Learned Skills
No ratings yet
2nd School Consolidated Least Learned Skills
9 pages
CSG PrApril2025
No ratings yet
CSG PrApril2025
247 pages
Linear Equation ch3 Class 10 Worksheet
No ratings yet
Linear Equation ch3 Class 10 Worksheet
4 pages
RPT: Mathematic Form 3
No ratings yet
RPT: Mathematic Form 3
15 pages
Daily Lesson Log
100% (1)
Daily Lesson Log
11 pages
GMAT Algebra Formulas (PDF)
No ratings yet
GMAT Algebra Formulas (PDF)
27 pages
E3Chapter6 - Sequences, Functions and Graphs
No ratings yet
E3Chapter6 - Sequences, Functions and Graphs
88 pages
EXTRA QUESTIONS Linear Equations in Two Variables
No ratings yet
EXTRA QUESTIONS Linear Equations in Two Variables
16 pages
MJRB-DLP-CO1-Finding The Equation of A Line
No ratings yet
MJRB-DLP-CO1-Finding The Equation of A Line
6 pages
Teacher Work Sample: Instruction I: Individualization and Management
No ratings yet
Teacher Work Sample: Instruction I: Individualization and Management
46 pages
Review Exercise 2
No ratings yet
Review Exercise 2
19 pages
Fu Ch11 Linear Regression
No ratings yet
Fu Ch11 Linear Regression
70 pages
Erdem Denizli Lesson Plan
No ratings yet
Erdem Denizli Lesson Plan
12 pages
Unit 6: System of Equations Homework Packet
No ratings yet
Unit 6: System of Equations Homework Packet
20 pages
Algebra Test
No ratings yet
Algebra Test
7 pages
Polynomials Grade 10 Ashwin
No ratings yet
Polynomials Grade 10 Ashwin
10 pages
Solve Systems by Graphing
No ratings yet
Solve Systems by Graphing
14 pages
DarylL - loganAFirstCourse Bar Convergence-1
No ratings yet
DarylL - loganAFirstCourse Bar Convergence-1
10 pages
Chapter 5 Pending
No ratings yet
Chapter 5 Pending
6 pages
Lesson 3.4 - Graphing Linear Equations
No ratings yet
Lesson 3.4 - Graphing Linear Equations
4 pages
(ANSWER+KEY) +G8 Practice Test On Functions and Graph .
No ratings yet
(ANSWER+KEY) +G8 Practice Test On Functions and Graph .
3 pages
Math 080 Final Test Review
No ratings yet
Math 080 Final Test Review
3 pages
Module 6
No ratings yet
Module 6
7 pages
Study: Cbse X Mathematic Pair of Li.. NCERT Question Solution
No ratings yet
Study: Cbse X Mathematic Pair of Li.. NCERT Question Solution
1 page
Gmat Lesson 3 Practice
No ratings yet
Gmat Lesson 3 Practice
8 pages
COURSE: Intermediate Algebra, MAT137 Spring 2016 Online W/campus Requirement
No ratings yet
COURSE: Intermediate Algebra, MAT137 Spring 2016 Online W/campus Requirement
6 pages
Class 9 Maths Le in Two Variables
No ratings yet
Class 9 Maths Le in Two Variables
2 pages
Algebra 2: Chapter 2 Review Name
No ratings yet
Algebra 2: Chapter 2 Review Name
5 pages
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet