PA Notes

The document outlines the steps for developing a simple linear regression (SLR) model, emphasizing that the dependent variable (Y) must be numeric. It includes the regression equation, examples of interpreting coefficients, and discusses the importance of data correlation and homoscedasticity. Additionally, it briefly mentions logistic regression and the need for dummy coding when dealing with categorical variables.

Uploaded by

Vencel Patrick

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views28 pages

PA Notes

Uploaded by

Vencel Patrick

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Formula for regression coefficient (y on x)

Y in linear regression model should only be float or integer

You cannot build linear regression model if your Y is an object

Framework for SLR model development

Step 1 - Collect/extract data
Step 2 - Preprocess the data
Step 3 - Divide the data into training and validation data
Step 4 - Perform descriptive analysis
Step 5 - Define the functional form of regression
Step 6 - Estimate regression parameters
Step 7 - Perform regression model diagnostics
Step 8 - Validate the model using validation data
Step 9 - Decide on the model deployment
E=Yi- ŷ
Yi - actual
Ŷ - predicted

ŷ=β0+β1x

The equation can be interpreted as follows: for every one percentage increase in grade 10
marks, the salary of the MBA students will increase at the rate of 3076.1774 on an average.

Eg.
ŷ(marks)=20+0.76(study for)
For every 1 hour increase in study hour, there is an increase in marks by 0.76 on an average

ŷ(yield)=20-0.76(rainfall)
For every 1 unit increase in rainfall, there is a decrease in yield by 0.76 on an average
SST=SSR+SSE
SST - sum of square of total variation
SSR - sum of squares of regression
SSE - sum of squares of errors
If the correlation comes out as 0.99, then the model is most probably wrong or a created(fake) data
set
A proper data set will have a correlation around 0.5-0.75.

r-square=0.99
This means that 99% of the variation in y is explained by x (the explanatory variable which you have
used)
Homoscedasticity is preferred when plotting a graph for predicted and actual
‘T’ test - used to find relationship between two variables
Right graph is homoscedastic
In the case of the let graph you need to treat the data in such a way that the pattern created is
eliminated
When the x is categorical, dummy coding needs to be done
Logistic Regression
1. Classification
2. Discrete choices
3. Class probability
Covered up - ‘the model’
Covered up - ‘as positive’
Technique to be used -
. Regression analysis
. Time series analysis

LP-III Lab Manual
No ratings yet
LP-III Lab Manual
49 pages
Regression & Correlation
No ratings yet
Regression & Correlation
44 pages
Lecture Notes
No ratings yet
Lecture Notes
141 pages
6 ASAP Advanced Statistics-Regression
No ratings yet
6 ASAP Advanced Statistics-Regression
53 pages
Unit-III (Data Analytics)
50% (2)
Unit-III (Data Analytics)
15 pages
Lecture 7
No ratings yet
Lecture 7
12 pages
Unit 07 Regression Correlation
No ratings yet
Unit 07 Regression Correlation
36 pages
15.simple Linear Regression-530
No ratings yet
15.simple Linear Regression-530
54 pages
CHAPTER 2 Simple Linear Regression
100% (1)
CHAPTER 2 Simple Linear Regression
76 pages
Module 11 Unit 2 Simple Linear Regression
No ratings yet
Module 11 Unit 2 Simple Linear Regression
10 pages
Regression
100% (1)
Regression
43 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
36 pages
The Simple Linear Regression Model and Correlation
100% (1)
The Simple Linear Regression Model and Correlation
64 pages
Regression
No ratings yet
Regression
14 pages
Simple Linear Regression Sample
No ratings yet
Simple Linear Regression Sample
55 pages
Simple Linear Regression Part 1
No ratings yet
Simple Linear Regression Part 1
63 pages
Unit 5
No ratings yet
Unit 5
34 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
9 pages
Unit 2-1
No ratings yet
Unit 2-1
30 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
65 pages
Regression Analysis: Basic Statistics
No ratings yet
Regression Analysis: Basic Statistics
26 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
F Regression
No ratings yet
F Regression
65 pages
Linear Regression For Intermediate
No ratings yet
Linear Regression For Intermediate
6 pages
Intro To Reg Models
No ratings yet
Intro To Reg Models
27 pages
Unit 9 Regression SLM
No ratings yet
Unit 9 Regression SLM
24 pages
Lecture 12
No ratings yet
Lecture 12
47 pages
8-Simple Regression Analysis
No ratings yet
8-Simple Regression Analysis
9 pages
Regression Analysis
No ratings yet
Regression Analysis
47 pages
Part 4 Forecasting BIS & ABA
No ratings yet
Part 4 Forecasting BIS & ABA
16 pages
Linear Regression: Rustom D. Sutaria - Avia Intelligence 2016, Dubai
No ratings yet
Linear Regression: Rustom D. Sutaria - Avia Intelligence 2016, Dubai
3 pages
Regression Analysis
No ratings yet
Regression Analysis
34 pages
Simple LR Lecture
No ratings yet
Simple LR Lecture
60 pages
Entrepreneurship Notes
No ratings yet
Entrepreneurship Notes
61 pages
6.3 SSK5210 Parametric Statistical Testing - Analysis of Variance LR and Correlation - 2
No ratings yet
6.3 SSK5210 Parametric Statistical Testing - Analysis of Variance LR and Correlation - 2
39 pages
Unit III
No ratings yet
Unit III
13 pages
LinearStatisticalModels and Regression Analysis
No ratings yet
LinearStatisticalModels and Regression Analysis
27 pages
FM Project REPORT - Group3
No ratings yet
FM Project REPORT - Group3
24 pages
6.1 Basics-of-Statistical-Modeling
No ratings yet
6.1 Basics-of-Statistical-Modeling
17 pages
Week-4 BA Linear Regression
No ratings yet
Week-4 BA Linear Regression
16 pages
Unit 2-Part 3-Linear Regression
No ratings yet
Unit 2-Part 3-Linear Regression
38 pages
Do Access To Urban Infrastructure Influence Rental Housing Prices in Kenya - Theuri - Do Access To Urban Infrastructure Influence Rental Housing Prices in Kenya
No ratings yet
Do Access To Urban Infrastructure Influence Rental Housing Prices in Kenya - Theuri - Do Access To Urban Infrastructure Influence Rental Housing Prices in Kenya
50 pages
Session 5 Marked B PDF
No ratings yet
Session 5 Marked B PDF
36 pages
Module 3&4
No ratings yet
Module 3&4
4 pages
Ch17 Curve Fitting
No ratings yet
Ch17 Curve Fitting
44 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
13 pages
Simple Linear Regression: Coefficient of Determination
No ratings yet
Simple Linear Regression: Coefficient of Determination
21 pages
Intro To Regresion: Codergirl Data Analysis
No ratings yet
Intro To Regresion: Codergirl Data Analysis
32 pages
Regression Models - Follow
No ratings yet
Regression Models - Follow
7 pages
Strategic Management
No ratings yet
Strategic Management
15 pages
Engineering Analysis & Statistics: Lect. # 11
No ratings yet
Engineering Analysis & Statistics: Lect. # 11
22 pages
3 Da
No ratings yet
3 Da
16 pages
Data Analytics Unit III
No ratings yet
Data Analytics Unit III
15 pages
Module 11. Lesson Proper
No ratings yet
Module 11. Lesson Proper
5 pages
Lecture9 Regression1 PDF
No ratings yet
Lecture9 Regression1 PDF
22 pages
ML Assignment No. 1: 1.1 Title
No ratings yet
ML Assignment No. 1: 1.1 Title
8 pages
A Tutorial On How To Run A Simple Linear Regression in Excel
No ratings yet
A Tutorial On How To Run A Simple Linear Regression in Excel
19 pages
Chapter 10
No ratings yet
Chapter 10
3 pages
Module 11 Unit 2 Simple Linear Regression
No ratings yet
Module 11 Unit 2 Simple Linear Regression
12 pages
Regression and Correlation
No ratings yet
Regression and Correlation
14 pages
OCEANLOTUS A Look Into Vietnam's Stealthy Threat Actor
No ratings yet
OCEANLOTUS A Look Into Vietnam's Stealthy Threat Actor
7 pages
Simple Regression 1
No ratings yet
Simple Regression 1
18 pages
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet

PA Notes

Uploaded by

PA Notes

Uploaded by

Formula for regression coefficient (y on x)

Y in linear regression model should only be float or integer

Framework for SLR model development

You might also like