0% found this document useful (0 votes)

2 views

Practical 6 Multiple Linear Regression Using SPSS

The document outlines the process of developing a multiple linear regression model using SPSS to predict monthly revenue based on various independent variables. It discusses key metrics for evaluating regression performance, including R-squared, RMSE, and MAE, and highlights issues such as multicollinearity and normality of residuals. Ultimately, a simplified model using only the average room price variable is recommended due to the insignificance of other variables and the fulfillment of regression assumptions.

Uploaded by

easyupload999

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Practical 6 Multiple Linear Regression Using SPSS

Uploaded by

easyupload999

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 29

Multiple Linear

Regression Using SPSS

Assumption of multiple linear
regression
Metrics to evaluate regression
• 1. R-squared (R²):
• Definition: Measures the proportion of the variance in the dependent
variable that is predictable from the independent variables.
• Range: 0 to 1.
• Higher is better: Closer to 1 indicates better model performance.
• Limitations: Does not convey the magnitude of prediction errors.
• Sensitive to overfitting (especially in high-dimensional datasets).
• Doesn't work well with nonlinear relationships unless transformed appropriately.
Metrics to evaluate regression…
2. Root Mean Squared Error (RMSE):
• Definition: Square root of the average squared difference between predicted
and actual values.

• Range: 0 to ∞.
• Lower is better: Smaller RMSE indicates better model performance.
• Strength:
• Penalizes large errors more than small ones (due to squaring).
• Useful when large deviations are more concerning.
• Limitations: Sensitive to outliers due to squaring.
Metrics to evaluate regression…
• 3. Mean Absolute Error (MAE):
• Definition: Average of absolute differences between predicted and actual
values.

• Range: 0 to ∞.
• Lower is better: Smaller MAE indicates better model performance.
• Strength:
• Provides a straightforward measure of average error magnitude.
• Less sensitive to outliers compared to RMSE.
• Limitations: Does not account for variance in error magnitude.
Dataset

Problem Statement: Develop a regression model to predict monthly revenue

based on average room price, guest satisfaction score, marketing expenses.
Enter data
• Input the dataset
• Define variable names Monthly_Revenue, Avg_Room_Price,
Guest_Satisfaction, Marketing_Expenses in the "Variable View" tab.
• Enter the dataset in the "Data View" tab.
• Go to Analyze > Regression > Linear.
• Select Monthly Revenue as the dependent variable.
• Select Average Room Price, Guest Satisfaction Score, and Marketing
Expenses as independent variables.
Statistics Options
• Click on Statistics.
• Select Collinearity diagnostic for
Collinearity assumption testing,
• In Residual options select
Durbin-Watson for the
independence of residuals
assumptions
Plots options
• Click on Plots
• Add ZPRED (Normalised
predictions) to Y-Axis
• Add ZRESID (Normalised
residuals) to X-Axis
• This will generate plot to check
for Homoscedasticity assumption
• Select Histogram and Normal
probability plot, this will test the
normality of Residuals.
Save options
• Click on Save button,
• Select Predicted Values as
Unstandardized.
• This step will save the regression
predicted values as a new
variable to data view.
• Select Unstandardized Residuals,
this will save the error/residuals
to data.
• Click Continue, Click Ok.
Output

Our model’s R square is 0.984 which indicates that 98.4% of the variation in monthly revenue is explained
by the independent variables. This is pretty good regression model.
Durbin-Watson Test for
Independence of Residuals

From previous slide, Durbin-Watson Value is 1.448 which means there is positive
autocorrelation.
Anova for Regression

The ANOVA (Analysis of Variance) table in regression analysis evaluates whether the regression model
explains a significant portion of the variation in the dependent variable. It essentially tests the null
hypothesis that all regression coefficients (except the intercept) are zero.

Since null hypothesis is rejected, hence we cans ay that regression coefficients aren’t zero.
T-test for the regression coefficients

In regression analysis, the t-test for regression coefficients evaluates whether each independent
variable significantly contributes to predicting the dependent variable. Specifically, it tests the null
hypothesis (H0) that a coefficient (β) is equal to zero, indicating no effect.
Only average room price contributes significantly for predicting the dependent variable. We
can remove guest satisfaction and marketing expenses variables.
VIF (Variance Inflation Ratio) is high for avg_room_price and marketing expense (>10).
Hence, multicollinearity exists here.
Correlation bw independent
variables.

Since correlation is high, we can say that there exists multicollinearity In the data.
Normality Testing

Residuals aren’t normally distributed

Testing normality of Residuals using
Tests

Residuals aren’t normally distributed as tested by the above two tests

Test for homoscedasticity

All the points in this scatter plot appear

randomly distributed, with no
discernible pattern. Therefore, the
residuals exhibit homoscedasticity.
Regression line

Line of regression is Y= 2569.615 + 19.845 × X1 + 29.883 × X2 + .111 × X3

Where X1 is avg. room price, X2 is guest satisfaction score, X3 is
marketing expenses
Evaluate the model
• Calculate the following model evaluation metrics.
• R2= 0.983
• RMSE =56.05
• MAE=38.56
Assumption violated
• Independence of residuals
• Multicollinearity.
• Normality of residuals.
Build a new regression model
• Here we’ll use model with average room per price variable only, as
the contribution of other variables found in-significant.
Check R-square and Durbin Watson
Test

R-square is 0.98 and Durbin Watson test range is between 1.5-2.5,

hence Residuals are independent of each other.
Test for Homoscedasticity

No pattern exists here, hence homoscedasticity assumption is satisfied.

Normality Test for Residuals
Normality Test for Residuals

Residuals are normally distributed as p value is greater

than 0.05.
Regression line

The regression model is : Y= 2473.982 + 25.340 × X1

Where X1 is Average room price variable
Conclusions
• R-Square is 0.98
• RMSE=62.47
• MAE=47.13
• No multicollinearity as we’ve only single variable.
• RMSE and MAE are higher than the model built with 3 variables.
• All the assumptions are satisfied by single independent variables.
• But RMSE and MAE values are high.
Few other concepts related to
models
• Training data
• Test data
• Overfitting
• Underfitting

Capstone Proect Notes 2
100% (2)
Capstone Proect Notes 2
16 pages
Pracitcal 7 Polynomial Regression
No ratings yet
Pracitcal 7 Polynomial Regression
17 pages
Multiple Linear Regression and Its Assumptions
No ratings yet
Multiple Linear Regression and Its Assumptions
16 pages
The Bucharest University of Economic Studies Bucharest Business School Romanian - French INDE MBA Program
No ratings yet
The Bucharest University of Economic Studies Bucharest Business School Romanian - French INDE MBA Program
53 pages
6.multiple Regressions - BDSM - 2020 - Oct
No ratings yet
6.multiple Regressions - BDSM - 2020 - Oct
45 pages
Regression Essay
No ratings yet
Regression Essay
7 pages
Lecture 12 (2)
No ratings yet
Lecture 12 (2)
5 pages
4TH YEAR CAT 1
No ratings yet
4TH YEAR CAT 1
12 pages
Lecture 12 Regression
No ratings yet
Lecture 12 Regression
55 pages
Regression Anslysis
No ratings yet
Regression Anslysis
23 pages
COURSES ECONOMETRICS Multiple Regression, Dummy, Error Anal
No ratings yet
COURSES ECONOMETRICS Multiple Regression, Dummy, Error Anal
26 pages
PM Week1 MLSDeck0.2
No ratings yet
PM Week1 MLSDeck0.2
15 pages
ADM2304 Multiple Regression Dr. Suren Phansalker
No ratings yet
ADM2304 Multiple Regression Dr. Suren Phansalker
12 pages
BSD 3101-Lab Exercise 1
No ratings yet
BSD 3101-Lab Exercise 1
12 pages
Topic 7-Regression Analysis
No ratings yet
Topic 7-Regression Analysis
56 pages
Unit 5
No ratings yet
Unit 5
18 pages
Unit 4-B: Multiple Regression
No ratings yet
Unit 4-B: Multiple Regression
75 pages
What Is Regression Analysis
No ratings yet
What Is Regression Analysis
18 pages
Multiple Regression
100% (1)
Multiple Regression
21 pages
Regression Analysis: Advanced Research Methadology
No ratings yet
Regression Analysis: Advanced Research Methadology
44 pages
11 Regression
No ratings yet
11 Regression
34 pages
Chapter 4 - Multiple Regression
No ratings yet
Chapter 4 - Multiple Regression
39 pages
UKP6053 - L8 Multiple Regression
100% (2)
UKP6053 - L8 Multiple Regression
105 pages
Multiple Linear Regression Analysis
No ratings yet
Multiple Linear Regression Analysis
23 pages
Unit II - Diagnotis and Multiple Linear
No ratings yet
Unit II - Diagnotis and Multiple Linear
8 pages
Simple Liner REgression
No ratings yet
Simple Liner REgression
27 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
22 pages
MRA Output Questions Sample1
No ratings yet
MRA Output Questions Sample1
9 pages
Ch6 Multiple Regression
No ratings yet
Ch6 Multiple Regression
29 pages
Lecture - Regression - Compatibility Mode
No ratings yet
Lecture - Regression - Compatibility Mode
8 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
Practice Statistics
No ratings yet
Practice Statistics
23 pages
Multiple Regression Analysis
No ratings yet
Multiple Regression Analysis
10 pages
Chapter 18
No ratings yet
Chapter 18
25 pages
Unit-III (Data Analytics)
50% (2)
Unit-III (Data Analytics)
15 pages
Multiple Regression PDF
No ratings yet
Multiple Regression PDF
19 pages
Case 2, in class, Correlation and Multiple regression (R)
No ratings yet
Case 2, in class, Correlation and Multiple regression (R)
12 pages
MGT555 CH 6 Regression Analysis
No ratings yet
MGT555 CH 6 Regression Analysis
19 pages
Regression Analysis
No ratings yet
Regression Analysis
37 pages
Linear Regression (Simple & Multiple)
No ratings yet
Linear Regression (Simple & Multiple)
29 pages
Regression Analysis
No ratings yet
Regression Analysis
20 pages
Trip Generation Analysis
No ratings yet
Trip Generation Analysis
56 pages
Multiple Regression
No ratings yet
Multiple Regression
25 pages
Presentation Business Applications
No ratings yet
Presentation Business Applications
18 pages
BB Multiple Regression
No ratings yet
BB Multiple Regression
59 pages
regression-analysis-notes
No ratings yet
regression-analysis-notes
6 pages
Simple Regression With SPSS
No ratings yet
Simple Regression With SPSS
19 pages
lecture 9-10
No ratings yet
lecture 9-10
28 pages
Linear Regression PDF
100% (1)
Linear Regression PDF
32 pages
Evans Analytics2e PPT 08
No ratings yet
Evans Analytics2e PPT 08
65 pages
Regression Anova Dstsc07
No ratings yet
Regression Anova Dstsc07
34 pages
DA-MODULE-3
No ratings yet
DA-MODULE-3
54 pages
BA unit 2 notes (1)
No ratings yet
BA unit 2 notes (1)
5 pages
Combinepdf PDF
No ratings yet
Combinepdf PDF
140 pages
Example How To Perform Multiple Regression Analysis Using SPSS Statistics
100% (1)
Example How To Perform Multiple Regression Analysis Using SPSS Statistics
14 pages
Regression
No ratings yet
Regression
6 pages
5. BRM Session 5 Slides
No ratings yet
5. BRM Session 5 Slides
19 pages
Classical Machine Learning: Linear Regression: Ramesh S
No ratings yet
Classical Machine Learning: Linear Regression: Ramesh S
28 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
From Everand
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
S. Deviant
4.5/5 (6)
Arithmetic Sequence Exercises
No ratings yet
Arithmetic Sequence Exercises
7 pages
Asness 2000
No ratings yet
Asness 2000
19 pages
Electric Field Mapping
No ratings yet
Electric Field Mapping
8 pages
QUINDOS
No ratings yet
QUINDOS
20 pages
Physics Formula Sheet
No ratings yet
Physics Formula Sheet
4 pages
Download Full Applied Asymptotic Analysis 1st Edition Peter D. Miller PDF All Chapters
100% (11)
Download Full Applied Asymptotic Analysis 1st Edition Peter D. Miller PDF All Chapters
59 pages
Additional Assignment V - Solution
No ratings yet
Additional Assignment V - Solution
4 pages
Absolute Vs Incremental
No ratings yet
Absolute Vs Incremental
3 pages
(Specialist) 2011 TSSM Exam 2 Solutions
No ratings yet
(Specialist) 2011 TSSM Exam 2 Solutions
17 pages
CIGRE Efficacy of Loose Spacers in Mitigating Galloping of Bundled Conductors
No ratings yet
CIGRE Efficacy of Loose Spacers in Mitigating Galloping of Bundled Conductors
12 pages
Assignment Problem
100% (1)
Assignment Problem
23 pages
Compressor Power Estimation Calcs
No ratings yet
Compressor Power Estimation Calcs
4 pages
Combining Like Terms Worksheet PDF
No ratings yet
Combining Like Terms Worksheet PDF
2 pages
Problem of Polimer PDF
No ratings yet
Problem of Polimer PDF
33 pages
MSC Patran Gap Elements
100% (1)
MSC Patran Gap Elements
22 pages
MATH2920 Notes
No ratings yet
MATH2920 Notes
33 pages
Assembly of Nanorods Into Designer Superstructures: The Role of Templating, Capillary Forces, Adhesion, and Polymer Hydration
No ratings yet
Assembly of Nanorods Into Designer Superstructures: The Role of Templating, Capillary Forces, Adhesion, and Polymer Hydration
8 pages
NMM P1 Last Push MG 2024 - 125924 - 241018 - 052841
No ratings yet
NMM P1 Last Push MG 2024 - 125924 - 241018 - 052841
262 pages
Time Table 2024-25 - 23-09-2024
No ratings yet
Time Table 2024-25 - 23-09-2024
18 pages
Work Energy Power
No ratings yet
Work Energy Power
47 pages
Techs - Elliott-Wave Rules & Guidelines
100% (2)
Techs - Elliott-Wave Rules & Guidelines
8 pages
ALY 6000 Project 6
No ratings yet
ALY 6000 Project 6
4 pages
Ae Mat 171 Precalculus Algebra
No ratings yet
Ae Mat 171 Precalculus Algebra
1 page
Multiplying Fractions by An Integer Sheet 1
No ratings yet
Multiplying Fractions by An Integer Sheet 1
1 page
9789360536145
100% (1)
9789360536145
2 pages
Solving World Problem Involving Addition
No ratings yet
Solving World Problem Involving Addition
9 pages
OO ABAP - PPT - Ravi Andela - Nov 22nd, 2014
No ratings yet
OO ABAP - PPT - Ravi Andela - Nov 22nd, 2014
68 pages
The Fokker-Planck Equation
No ratings yet
The Fokker-Planck Equation
12 pages
Lab Manual - Exp - 4 - CMOS NAND NOR
No ratings yet
Lab Manual - Exp - 4 - CMOS NAND NOR
7 pages
02vibrationabsorber 150601135301 Lva1 App6892 PDF
No ratings yet
02vibrationabsorber 150601135301 Lva1 App6892 PDF
12 pages

Practical 6 Multiple Linear Regression Using SPSS

Uploaded by

Practical 6 Multiple Linear Regression Using SPSS

Uploaded by

Multiple Linear

Regression Using SPSS

Problem Statement: Develop a regression model to predict monthly revenue

Residuals aren’t normally distributed

Residuals aren’t normally distributed as tested by the above two tests

All the points in this scatter plot appear

Line of regression is Y= 2569.615 + 19.845 × X1 + 29.883 × X2 + .111 × X3

R-square is 0.98 and Durbin Watson test range is between 1.5-2.5,

No pattern exists here, hence homoscedasticity assumption is satisfied.

Residuals are normally distributed as p value is greater

The regression model is : Y= 2473.982 + 25.340 × X1

You might also like