0% found this document useful (0 votes)
13 views2 pages

What Is Linear Regression

Uploaded by

Shazia Siddiqui
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views2 pages

What Is Linear Regression

Uploaded by

Shazia Siddiqui
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

What is Linear Regression?

Linear Regression is a statistical supervised learning technique to predict the quantitative


variable by forming a linear relationship with one or more independent features.
It helps determine:
→ If a independent variable does a good job in predicting the dependent variable.
→ Which independent variable plays a significant role in predicting the dependent variable.

Now,As you know most of the algorithm works with some kind of assumptions.So, Before
moving on here is the list of assumptions of the Linear Regression.
These assumptions should be kept in mind when performing Linear Regression analysis so that
the model performs it’s best.

Assumptions of Linear Regression:


 The Independent variables should be linearly related to the dependent variables.
This can be examined with the help of several visualization techniques like: Scatter plot
or maybe you can use Heatmap or pairplot(to visualize every features in the data in one
particular plot).
 Every feature in the data is Normally Distributed.
This again can be checked with the help of different visualization Techniques,such as Q-
Q plot,histogram and much more.
 There should be little or no multi-collinearity in the data.
The best way to check the prescence of multi-collinearity is to perform VIF(Variance
Inflation Factor).
 The mean of the residual is zero.
A residual is the difference between the observed y-value and the predicted y-
value.However, Having residuals closer to zero means the model is doing great.
 Residuals obatined should be normally distributed.
This can be verified using the Q-Q Plot on the residuals.
 Variance of the residual throughout the data should be same.This is known as
homoscedasticity.
This can be checked with the help of residual vs fitted plot.
 There should be little or no Auto-Correlation is the data.
Auto-Correlation Occurs when the residuals are not independent of each other.This
usally takes place in time series analysis.
You can perform Durbin-Watson test or plot ACF plot to check for the autocorrelation.If
the value of Durbin-Watson test is 2 then that means no autocorrelation,If value < 2 then
there is positive correlation and if the value is between >2 to 4 then there is negative
autocorrelation.

If the features in the dataset are not normally distributed try out different transformation
techniques to transform the distribution of the features present in the data.
Types of Linear Regression
→ Simple Linear Regression:
Simple Linear Regression helps to find the linear relationship between two continuous
variables,One independent and one dependent feature.
Formula can be represented as y=mx+b

→ Multiple Linear Regression:


Multiple linear Regression is the most common form of linear regression analysis. As a
predictive analysis, the multiple linear regression is used to explain the relationship between one
continuous dependent variable and two or more independent variables.
The independent variables can be continuous or categorical (dummy coded as appropriate).
We Often use Multiple Linear Regression to do any kind of predictive analysis as the data we get
has more than 1 independent features to it.
Formula can be represented as Y=mX1+mX2+mX3…+b,Or

You might also like