0% found this document useful (0 votes)

143 views23 pages

Linear Regression

Linear regression estimates the coefficients of a linear equation that best predicts a dependent variable from independent variables. It assumes a linear relationship between variables and that error terms are normally distributed with constant variance. The example analyzes polishing time data, finding time can be predicted from diameter. About half the variation in time is explained by the model, which is statistically significant. Diagnostic plots show the error term is approximately normally distributed.

Uploaded by

Rabiqa Rani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

143 views23 pages

Linear Regression

Uploaded by

Rabiqa Rani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 23

Linear Regression

Linear Regression Linear Regression estimates the coefficients of the linear equation, involving one or more independent variables, that best predict the value of the dependent variable. For example, to predict a salesperson's total yearly sales (the dependent variable) from independent variables such as age, education, and years of experience. Example. Is the number of games won by a basketball team in a season related to the average number of points the team scores per game? A scatterplot indicates that these variables are linearly related. The number of games won and the average number of points scored by the opponent are also linearly related. These variables have a negative relationship. As the number of games won increases, the average number of points scored by the opponent decreases. With linear regression, you can model the relationship of these variables. A good model can be used to predict how many games teams will win.

The linear regression model assumes that there is a linear, or "straight line," relationship between the dependent variable and each predictor. This relationship is described in the following formula. yi=b0+b1xi1+...+bpxip+ei where yi is the value of the ith case of the dependent scale variable p is the number of predictors bj is the value of the jth coefficient, j=0,...,p xij is the value of the ith case of the jth predictor ei is the error in the observed value for the ith case The model is linear because increasing the value of the jth predictor by 1 unit increases the value of the dependent by bj units. Note that b0 is the intercept, the model-predicted value of the dependent variable when the value of every predictor is equal to 0.

For the purpose of testing hypotheses about the values of model parameters, the linear regression model also assumes the following: The error term has a normal distribution with a mean of 0. The variance of the error term is constant across cases and independent of the variables in the model. An error term with non-constant variance is said to be heteroscedastic. The value of the error term for a given case is independent of the values of the variables in the model and of the values of the error term for other cases.

Example The Nambe Mills company has a line of metal tableware products that require a polishing step in the manufacturing process. To help plan the production schedule, the polishing times for 59 products were recorded, along with the product type and the relative sizes of these products, measured in terms of their diameters. We can use linear regression to determine whether the polishing time can be predicted by product size. Before running the regression, we should examine a scatterplot of polishing time by product size to determine whether a linear model is reasonable for these variables.

To produce a scatterplot of time by diam, from the menus choose: Graphs Scatter/Dot...

Click Define

Select time as the y variable and diam as the x variable. Click OK. These selections produce the scatterplot

To see a best-fit line overlaid on the points in the scatterplot, activate the graph by double-clicking on it. Select a point in the Chart Editor. Click the Add fit line tool, then close the Chart Editor

Creating a Scatterplot of the Dependent by the Independent

The resulting scatterplot appears to be suitable for linear regression, with two possible causes for concern

Running the Analysis

To run a linear regression analysis, from the menus choose: Analyze Regression Linear

...

Select time as the dependent variable. Select diam as the independent variable. Select type as the case labeling variable. Click Plots

Select *SDRESID as the y variable and *ZPRED as the x variable. Select Histogram and Normal probability plot. Click Continue. Click Save in the Linear Regression dialog box

Select Standardized in the Predicted Values group. Select Standardized in the Residuals group, Click Continue. Click OK in the Linear Regression dialog box

These selections produce a linear regression model for polishing time based on diameter. Diagnostic plots of the Studentized residuals by the model-predicted values are requested, and various values are saved for further diagnostic testing.

Coefficients
This table shows the coefficients of the regression line.

It states that the expected polishing time is equal to 3.457 * DIAM - 1.955. If Nambe Mills plans to manufacture a 15inch casserole, the predicted polishing time would be 3.457 * 15 - 1.955 = 49.9, or about 50 minutes.

Checking the Model Fit

The ANOVA table tests the acceptability of the model from a statistical perspective. The Regression row displays information about the variation accounted for by your model The Residual row displays information about the variation that is not accounted for by your model

The regression and residual sums of squares are approximately equal, which indicates that about half of the variation in polishing time is explained by the model. The significance value of the F statistic is less than 0.05, which means that the variation explained by the model is not due to chance. While the ANOVA table is a useful test of the model's ability to explain any variation in the dependent variable, it does not directly address the strength of that relationship.

The model summary table reports the strength of the relationship between the model and the dependent variable R, the multiple correlation coefficient, is the linear correlation between the observed and model-predicted values of the dependent variable. Its large value indicates a strong relationship

R Square, the coefficient of determination, is the squared value of the multiple correlation coefficient. It shows that about half the variation in time is explained by the model.

As a further measure of the strength of the model fit, compare the standard error of the estimate in the model summary table to the standard deviation of time reported in the descriptive statistics table. Without prior knowledge of the diameter of a new product, our best guess for the polishing time would be about 35.8 minutes, with a standard deviation of 19.0

. .

With the linear regression model, the error of your estimate is considerably lower, about 13.7

Checking the Normality of the Error Term

A residual is the difference between the observed and modelpredicted values of the dependent variable. The residual for a given product is the observed value of the error term for that product. A histogram or P-P plot of the residuals will help you to check the assumption of normality of the error term The shape of the histogram should approximately follow the shape of the normal curve. This histogram is acceptably close to the normal curve.

Introduction To Datascience (R20DS501)
100% (1)
Introduction To Datascience (R20DS501)
19 pages
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
No ratings yet
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
89 pages
IV Ai & Ds Al3451 ML Unit2
No ratings yet
IV Ai & Ds Al3451 ML Unit2
50 pages
STATG5 - Simple Linear Regression Using SPSS Module
No ratings yet
STATG5 - Simple Linear Regression Using SPSS Module
16 pages
AIML MSE 2 Notes
No ratings yet
AIML MSE 2 Notes
35 pages
C6 Regression
No ratings yet
C6 Regression
27 pages
Linear Regression
No ratings yet
Linear Regression
38 pages
Regression Model
No ratings yet
Regression Model
26 pages
1 Linear Regression
No ratings yet
1 Linear Regression
22 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Unit 3
No ratings yet
Unit 3
24 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
45 pages
STAT22209 - Chapter 02-Regression Analyisis - 2022
No ratings yet
STAT22209 - Chapter 02-Regression Analyisis - 2022
41 pages
Regression
No ratings yet
Regression
48 pages
Linear Regression
No ratings yet
Linear Regression
35 pages
Regression Coeffient
No ratings yet
Regression Coeffient
52 pages
Lesson - 4.2 - Exploratory Data Analysis - Analyze - Phase
No ratings yet
Lesson - 4.2 - Exploratory Data Analysis - Analyze - Phase
50 pages
Advanced - Linear Regression
No ratings yet
Advanced - Linear Regression
57 pages
MS Preweek B47
No ratings yet
MS Preweek B47
20 pages
Day 2-Data Science
No ratings yet
Day 2-Data Science
16 pages
Da Unit 3 R22
No ratings yet
Da Unit 3 R22
15 pages
OE-ML Unit - 3
No ratings yet
OE-ML Unit - 3
29 pages
SPSS Regression PC
No ratings yet
SPSS Regression PC
8 pages
Simple Liner REgression
No ratings yet
Simple Liner REgression
27 pages
Student's full name: - Trần Chí Cường - Nguyễn Anh Quân - Dương Thành Đạt Class: MKT1501 Lecturer: Nguyễn Việt Anh Group Assignment MAS 202 Summary of chapter 13 I. Simple Linear Regression Models
No ratings yet
Student's full name: - Trần Chí Cường - Nguyễn Anh Quân - Dương Thành Đạt Class: MKT1501 Lecturer: Nguyễn Việt Anh Group Assignment MAS 202 Summary of chapter 13 I. Simple Linear Regression Models
7 pages
Updated Lecture 7
No ratings yet
Updated Lecture 7
29 pages
Presentation Business Applications
No ratings yet
Presentation Business Applications
18 pages
Unit III
No ratings yet
Unit III
13 pages
Linear Regression
100% (2)
Linear Regression
28 pages
COMM5005 Lecture 8
No ratings yet
COMM5005 Lecture 8
54 pages
Practical - Regression
No ratings yet
Practical - Regression
114 pages
2023 Statistics Fin 10
No ratings yet
2023 Statistics Fin 10
14 pages
Unit1 - Data Science - SPPU
No ratings yet
Unit1 - Data Science - SPPU
15 pages
Chapter 4
No ratings yet
Chapter 4
15 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
2 pages
Assumptions of Regression
100% (2)
Assumptions of Regression
16 pages
Regression Analysis Using R
No ratings yet
Regression Analysis Using R
17 pages
4TH Year Cat 1
No ratings yet
4TH Year Cat 1
12 pages
Chapter 4 MLR
No ratings yet
Chapter 4 MLR
17 pages
Unit-III (Data Analytics)
50% (2)
Unit-III (Data Analytics)
15 pages
Module01.1 LinearRegression
No ratings yet
Module01.1 LinearRegression
32 pages
Chapter 8 B - Trendlines and Regression Analysis
No ratings yet
Chapter 8 B - Trendlines and Regression Analysis
73 pages
Topic - 9 PDF
No ratings yet
Topic - 9 PDF
12 pages
Linear Regression: Ramlee@fpe. 1
No ratings yet
Linear Regression: Ramlee@fpe. 1
51 pages
Hanan
No ratings yet
Hanan
9 pages
Evans Analytics2e PPT 08
No ratings yet
Evans Analytics2e PPT 08
65 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
2023 CFA L2 Book 1 Quants Eco Multiple
No ratings yet
2023 CFA L2 Book 1 Quants Eco Multiple
63 pages
1.linear Regression PSP
No ratings yet
1.linear Regression PSP
92 pages
Unit5 R
No ratings yet
Unit5 R
5 pages
Regression Analysis
No ratings yet
Regression Analysis
20 pages
Linear Regression Analysis: Module - Iv
No ratings yet
Linear Regression Analysis: Module - Iv
10 pages
Regression and Introduction To Bayesian Network
No ratings yet
Regression and Introduction To Bayesian Network
12 pages
Data Analytics Unit III
No ratings yet
Data Analytics Unit III
15 pages
5.multiple Regression
No ratings yet
5.multiple Regression
17 pages
Statistic For Agriculture Studies: The Assumptions of Regression
No ratings yet
Statistic For Agriculture Studies: The Assumptions of Regression
6 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
Lab-3: Regression Analysis and Modeling Name: Uid No. Objective
No ratings yet
Lab-3: Regression Analysis and Modeling Name: Uid No. Objective
9 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
11 pages
Stats AP Review
100% (2)
Stats AP Review
38 pages
Unit-5 ML Notes
No ratings yet
Unit-5 ML Notes
72 pages
Isye4031 Regression and Forecasting Practice Problems 2 Fall 2014
No ratings yet
Isye4031 Regression and Forecasting Practice Problems 2 Fall 2014
5 pages
1 Econreview-Questions
No ratings yet
1 Econreview-Questions
26 pages
Activity Analysis, Cost Behavior, and Cost Estimation
No ratings yet
Activity Analysis, Cost Behavior, and Cost Estimation
64 pages
Ecf630-Final Examination - May 2021
No ratings yet
Ecf630-Final Examination - May 2021
12 pages
Fitting & Interpreting Linear Models in Rinear Models in R
100% (1)
Fitting & Interpreting Linear Models in Rinear Models in R
8 pages
An Introduction To Hedge Funds White Paper - January 2025
No ratings yet
An Introduction To Hedge Funds White Paper - January 2025
27 pages
Effect of Exchange Rate On Economic Growth in Nigeria
No ratings yet
Effect of Exchange Rate On Economic Growth in Nigeria
11 pages
The Critical Diameter, Detonation Velocity and Shock Sensitivity of Australian pbxw-115 PDF
No ratings yet
The Critical Diameter, Detonation Velocity and Shock Sensitivity of Australian pbxw-115 PDF
45 pages
An Application On Multinomial Logistic Regression Model
No ratings yet
An Application On Multinomial Logistic Regression Model
22 pages
IJRPR21983
No ratings yet
IJRPR21983
7 pages
Measurement: Rami Ahmad
No ratings yet
Measurement: Rami Ahmad
10 pages
Workshop Calculos Bioindicadores PDF
No ratings yet
Workshop Calculos Bioindicadores PDF
66 pages
Chapter 2 PDF Lecture Notes
No ratings yet
Chapter 2 PDF Lecture Notes
23 pages
Evaluating Cost Function Criteria in Predicting Healthy Gait
No ratings yet
Evaluating Cost Function Criteria in Predicting Healthy Gait
16 pages
Adminaj,+abd +taufiq+et+al
No ratings yet
Adminaj,+abd +taufiq+et+al
7 pages
Sinha 2020 Women in The Bahrain Financial Sector Opportunities Challenges and Strategic Choices
No ratings yet
Sinha 2020 Women in The Bahrain Financial Sector Opportunities Challenges and Strategic Choices
17 pages
Food Security - A Global Issue
No ratings yet
Food Security - A Global Issue
25 pages
Artificial Intelligence Approach For Modeling House Price Prediction
No ratings yet
Artificial Intelligence Approach For Modeling House Price Prediction
5 pages
Midterm Exam
No ratings yet
Midterm Exam
7 pages
ECC321 Chapter2
No ratings yet
ECC321 Chapter2
5 pages
Assignment Simple Linear Regression
No ratings yet
Assignment Simple Linear Regression
11 pages
Examining The Factors Associated With Customer Satisfaction Using Smartphones
No ratings yet
Examining The Factors Associated With Customer Satisfaction Using Smartphones
6 pages
The Effect of Internal Control, Human Resources Competency, and Use of Information Technology On Quality of Financial Statement With Organizational Commitment As Intervening Variables
No ratings yet
The Effect of Internal Control, Human Resources Competency, and Use of Information Technology On Quality of Financial Statement With Organizational Commitment As Intervening Variables
8 pages
10 1109@iccitechn 2018 8631942
No ratings yet
10 1109@iccitechn 2018 8631942
5 pages
Chapter 2 - Cost Behaviour
No ratings yet
Chapter 2 - Cost Behaviour
4 pages
Multiple Regression
No ratings yet
Multiple Regression
3 pages
Saint Gba334 Module 3 Quiz 2
0% (1)
Saint Gba334 Module 3 Quiz 2
2 pages
Manufacturing: Engineering, Management and Marketing
From Everand
Manufacturing: Engineering, Management and Marketing
S.O.T Ogaji
No ratings yet
Control Charts: Six Sigma Thinking, #7
From Everand
Control Charts: Six Sigma Thinking, #7
Sumeet Savant
4/5 (1)

Linear Regression

Uploaded by

Linear Regression

Uploaded by

Linear Regression

Creating a Scatterplot of the Dependent by the Independent

Running the Analysis

Checking the Model Fit

Checking the Normality of the Error Term

You might also like