Example for Regression

Uploaded by

Ketan Jagtap

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Example for Regression

Uploaded by

Ketan Jagtap

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Problem 7 (Refer Past year TEE of SIDM 2022)

Fourteen Twenty-Two Food Stores, Inc., is planning to expand its convenience store chain.
To aid in selecting locations for the new stores, it has collected weekly sales data from each
of its 23 stores. To help explain the variability in weekly sales, it has also collected
information describing four variables that it believes are related to sales. The variables are
defined as follows:
SALES: average weekly sales for each store in thousands of dollars
AUTOS: average weekly auto traffic volume in thousands of cars
ENTRY: ease of entry/exit measures on a scale of 1 to 100
ANNINC: average annual household income for the area in thousands of dollars
DISTANCE: distance in miles from the store to the nearest supermarket

The data were analyzed and the output follows:

SUMMARY OUTPUT

Regression Statistics
Multiple 0.915034
R Square 0.837288
Adjusted R
Square 0.186441
Standard
Error 0.73646
Observations 23

ANOVA
Significanc
df SS MS F e
Regression 4 2861495 715374 102.39 0.002
Residual 18 125761 6896.7
Total 22 2987256

Coefficient Standard
s deviation t Stat P-value
Intercept 175.37 92.62 1.89 0.075
AUTOS -0.028 0.315 -0.09 0.929
ENTRY 3.775 1.272 2.97 0.008
ANNINC 1.990 4.510 0.44 0.664
DISTANCE 212.41 28.090 7.56 0.000

a. Is the regression significant as a whole? Why or why not?

b. Provide the best fitting regression equation for the sales of Fourteen Twenty-Two Food
Stores, Inc.
c. Comment on the significance of each predictor. How would it impact the sales? What
measures should be taken by the store manager to have a successful run of the new stores?
d. How do you rate the predictive power of the model? Is it sufficient for generalization of the
model? Discuss.
e. The residuals do not follow homoscedasticity. Would it be a major decision-making factor
in the opening of the store? Explain with proper reasoning and data support.
Synoptic
a. Is the regression significant as a whole? Why or why not?

Yes, the regression is significant as a whole. This is indicated by the F-statistic obtained from
the ANOVA table, which tests the overall significance of the regression model. The F-
statistic value is 102.39, and the associated p-value is 0.002. Since the p-value is less than the
conventional significance level of 0.05, we can reject the null hypothesis, which states that
there is no significant relationship between the predictor variables (AUTOS, ENTRY,
ANNINC, and DISTANCE) and the sales. The low p-value suggests that at least one of the
predictor variables has a significant effect on sales, justifying the overall significance of the
regression model.

b. Provide the best fitting regression equation for the sales of Fourteen Twenty-Two
Food Stores, Inc.
The regression equation is given by: SALES = 175.37 - 0.028(AUTOS) + 3.775(ENTRY) +
1.990(ANNINC) + 212.41(DISTANCE)
c. Comment on the significance of each predictor. How would it impact the sales? What
measures should be taken by the store manager to have a successful run of the new
stores?

The significance of each predictor can be determined by examining their individual p-values:
 AUTOS: The coefficient for AUTOS is -0.028, and its p-value is 0.929. The p-value
> 0.05, indicating that the average weekly auto traffic volume (AUTOS) is not a
significant predictor of sales. Changes in auto traffic are unlikely to have a significant
impact on sales.
 ENTRY: The coefficient for ENTRY is 3.775, and its p-value is 0.008. The p-value <
0.05, suggesting that the ease of entry/exit measures (ENTRY) is a significant
predictor of sales. A higher ENTRY score (easier access to the store) positively
affects sales. The store manager should prioritize locations with easier entry and exit
measures to potentially boost sales.
 ANNINC: The coefficient for ANNINC is 1.990, and its p-value is 0.664. The p-
value > 0.05, indicating that the average annual household income for the area
(ANNINC) is not a significant predictor of sales. In this case, the average household
income does not have a significant impact on sales.
 DISTANCE: The coefficient for DISTANCE is 212.41, and its p-value is 0.000. The
p-value is much less than 0.05, indicating that the distance from the store to the
nearest supermarket (DISTANCE) is a highly significant predictor of sales. The
negative coefficient suggests that as the distance to the nearest supermarket decreases,
sales increase. Store locations closer to supermarkets are likely to attract more
customers and generate higher sales.

To have a successful run of the new stores, the store manager should focus on selecting
locations with easier access (lower ENTRY scores) and proximity to supermarkets (lower
DISTANCE). These factors have the most significant impact on sales, as indicated by the
regression analysis.

d. How do you rate the predictive power of the model? Is it sufficient for generalization
of the model? Discuss.
The model's predictive power can be assessed using the R-squared value, which is a
measure of how well the model explains the variability in the dependent variable (SALES)
based on the independent variables (AUTOS, ENTRY, ANNINC, and DISTANCE).
The R-squared value of the model is 0.837288, which means approximately 83.73% of the
variability in weekly sales can be explained by the predictor variables. This is a relatively
high R-squared value, indicating that the model fits the data well and the predictor variables
collectively have a strong relationship with sales.
However, it's important to note that the adjusted R-squared value is 0.186441, which is
considerably lower than the R-squared value. The adjusted R-squared takes into account the
number of predictor variables and penalizes the model for including irrelevant or redundant
variables. The large difference between R-squared and adjusted R-squared suggests that
some of the predictor variables (AUTOS, ANNINC) might not be adding much value to
the model's predictive power.
Regarding the generalization of the model, it's important to be cautious. While the model
shows a strong relationship between the predictor variables and sales based on the available
data, generalization to new stores and locations requires additional validation and testing. The
model should be tested on new data from different stores and locations to assess its
performance and predictive accuracy before making business decisions based solely on the
current model.
e. The residuals do not follow homoscedasticity. Would it be a major decision-making
factor in the opening of the store? Explain with proper reasoning and data support.

Residuals are the differences between the observed sales values and the predicted sales values
obtained from the regression model. Homoscedasticity refers to the assumption that the
residuals should have a constant variance across all levels of the predictor variables. If
the residuals exhibit non-constant variance (heteroscedasticity), it can affect the reliability
and validity of the regression model's results.
In this case, the Standard Error of the regression model is 0.73646, which provides an
indication of the average deviation of the actual sales values from the predicted values.
However, the presence of heteroscedasticity indicates that the variability of the residuals
changes across different levels of predictor variables, leading to less reliable predictions.

While heteroscedasticity is a concern for the accuracy of the model's predictions, it

might not be a major decision-making factor in the opening of new stores. The reasons
are as follows:
1. The primary focus of the regression analysis is to identify the significant predictors of
sales and establish their relationships. The regression coefficients and their associated
p-values are useful for understanding the direction and significance of these
predictors, regardless of the heteroscedasticity issue.
2. The other statistical measures, such as the F-statistic and R-squared, indicate that the
model has overall significance and explains a considerable amount of the variability
in sales.
3. While heteroscedasticity affects the precision of individual predictions, it does
not invalidate the entire model or its ability to identify the most important
predictors.
4. The model's effectiveness in predicting sales can be assessed through cross-validation
and testing on new data from different store locations. If the model performs well and
maintains its predictive accuracy, despite the heteroscedasticity, it may still be
valuable for making decisions about new store openings.
In summary, while heteroscedasticity is a statistical concern and should be addressed for
better prediction precision, it may not be the deciding factor in the decision to open new
stores. Other aspects, such as the significance of predictors, practical implications of
coefficients, and validation through additional data, play a more critical role in making
informed business decisions.

Goodbelly Marketing Analysis Final
85% (13)
Goodbelly Marketing Analysis Final
32 pages
Mabini Bsa22 Laboratory Activity 5
No ratings yet
Mabini Bsa22 Laboratory Activity 5
7 pages
COMEGE Catalogue FR-En-De Email
No ratings yet
COMEGE Catalogue FR-En-De Email
80 pages
Calza Bsa21 Laboratory Activity 5 2
No ratings yet
Calza Bsa21 Laboratory Activity 5 2
8 pages
Cable Co Audit Write - Up
No ratings yet
Cable Co Audit Write - Up
7 pages
City of Dasmariñas, Cavite: Correlation and Regression Analysis
No ratings yet
City of Dasmariñas, Cavite: Correlation and Regression Analysis
8 pages
Walmart Sales Prediction Using Support Vector Regression and Multivariate Regression
No ratings yet
Walmart Sales Prediction Using Support Vector Regression and Multivariate Regression
5 pages
Walmart_Sales_Prediction_Using_Multiple_Linear_Reg
No ratings yet
Walmart_Sales_Prediction_Using_Multiple_Linear_Reg
6 pages
Revised STATS
No ratings yet
Revised STATS
10 pages
Cantanero Bsa24 Laboratory Activity 5
No ratings yet
Cantanero Bsa24 Laboratory Activity 5
8 pages
Regression Model Development for Revenue Dataset.docx
No ratings yet
Regression Model Development for Revenue Dataset.docx
9 pages
Cantanero - Correlation and Simple Linear Regression Laboratory
No ratings yet
Cantanero - Correlation and Simple Linear Regression Laboratory
31 pages
Rossmann Sales Prediction: Computing For Data Sciences-Final Project
100% (1)
Rossmann Sales Prediction: Computing For Data Sciences-Final Project
46 pages
Report Group 8 Final
No ratings yet
Report Group 8 Final
13 pages
Predicting Sales of Rossman Stores: Machine Learning Engineer Nanodegree
No ratings yet
Predicting Sales of Rossman Stores: Machine Learning Engineer Nanodegree
23 pages
Croq Pain Case Mitchell Zhen
No ratings yet
Croq Pain Case Mitchell Zhen
27 pages
Regression Analysis and Modeling For Decision Support
No ratings yet
Regression Analysis and Modeling For Decision Support
45 pages
Big Data Jury
No ratings yet
Big Data Jury
21 pages
07a Linear Correlation & Regression (I)
No ratings yet
07a Linear Correlation & Regression (I)
2 pages
BAN 602 - Project5
No ratings yet
BAN 602 - Project5
4 pages
5. BRM Session 5 Slides
No ratings yet
5. BRM Session 5 Slides
19 pages
Sales Prediction of Walmart Based On Regression Models: Abstract
No ratings yet
Sales Prediction of Walmart Based On Regression Models: Abstract
10 pages
Group 13 - Term Project
No ratings yet
Group 13 - Term Project
18 pages
Quantitative Techniques
No ratings yet
Quantitative Techniques
11 pages
Sales Analysis of Walmart Data: Mayank Gupta, Prerana Ghosh, Deepti Bahel, Anantha Venkata Sai Akhilesh Karumanchi
No ratings yet
Sales Analysis of Walmart Data: Mayank Gupta, Prerana Ghosh, Deepti Bahel, Anantha Venkata Sai Akhilesh Karumanchi
10 pages
Assignment 2 - Maria Espinoza
No ratings yet
Assignment 2 - Maria Espinoza
14 pages
Business Intelligence Questions, Analytical & Reporting Hint
From Everand
Business Intelligence Questions, Analytical & Reporting Hint
Dr. Zemelak Goraga
No ratings yet
Linear_Regression_datascience_basit.pdf
No ratings yet
Linear_Regression_datascience_basit.pdf
19 pages
KhanhNgo_1677046_Critical Thinking Exercise-Real Estate
No ratings yet
KhanhNgo_1677046_Critical Thinking Exercise-Real Estate
10 pages
Correlation-and-Regression-Handout-1
No ratings yet
Correlation-and-Regression-Handout-1
7 pages
Walmart Capstone Project
No ratings yet
Walmart Capstone Project
46 pages
EViews and Regression Analysis
No ratings yet
EViews and Regression Analysis
9 pages
Ambudheesh Assignment QTMD
No ratings yet
Ambudheesh Assignment QTMD
4 pages
Critical Thinking Exercise-Real Estate
No ratings yet
Critical Thinking Exercise-Real Estate
11 pages
Primer - Regression
No ratings yet
Primer - Regression
39 pages
Data Analysis
100% (1)
Data Analysis
28 pages
Chapter 10 Regression Slides
No ratings yet
Chapter 10 Regression Slides
46 pages
Salesgrowt H
No ratings yet
Salesgrowt H
10 pages
SAM
No ratings yet
SAM
7 pages
Correlation: Assignment 3 - Correlation and Regression Analysis
No ratings yet
Correlation: Assignment 3 - Correlation and Regression Analysis
7 pages
Regression Analysis in R
No ratings yet
Regression Analysis in R
7 pages
Test Exercise 1 (Assignment-1)
No ratings yet
Test Exercise 1 (Assignment-1)
4 pages
BDM 2 - 15 Dec 2009
No ratings yet
BDM 2 - 15 Dec 2009
6 pages
Presentation Business Applications
No ratings yet
Presentation Business Applications
18 pages
Business
No ratings yet
Business
2 pages
Multiple Regression Analysis
No ratings yet
Multiple Regression Analysis
12 pages
Forecasting 2nd III 17
No ratings yet
Forecasting 2nd III 17
4 pages
How to Optimise Your Supply Chain to Make Your Firm Competitive!
From Everand
How to Optimise Your Supply Chain to Make Your Firm Competitive!
Andrei Besedin
1/5 (1)
Rossmann Sales Prediction Presentation
No ratings yet
Rossmann Sales Prediction Presentation
35 pages
APznzaYfP - U5jLRZ2TpLX7kHt2tkiVK6jNpWwQHkAlnUuO J tiqe5P61S5P0hPVDo5SkjCoz1ia4cBQBtMl27e6jpHqSi8ZbhT9e2oWWI - lk7T9 L lvGLt7EoZ2Qh7wBdPZEObjjyPQzsw7OX36Qv2r4bHmLcDWz21t57GVYWFcREh8 - 6wq8sbrsDXQjKla
No ratings yet
APznzaYfP - U5jLRZ2TpLX7kHt2tkiVK6jNpWwQHkAlnUuO J tiqe5P61S5P0hPVDo5SkjCoz1ia4cBQBtMl27e6jpHqSi8ZbhT9e2oWWI - lk7T9 L lvGLt7EoZ2Qh7wBdPZEObjjyPQzsw7OX36Qv2r4bHmLcDWz21t57GVYWFcREh8 - 6wq8sbrsDXQjKla
46 pages
S02 - Regression Modelling
No ratings yet
S02 - Regression Modelling
17 pages
EPBM 05 Group 6 assignment
No ratings yet
EPBM 05 Group 6 assignment
40 pages
Team3assignment1 1
No ratings yet
Team3assignment1 1
5 pages
Cases
No ratings yet
Cases
7 pages
Arun_27072021_Predictive_Modeling.pdf
No ratings yet
Arun_27072021_Predictive_Modeling.pdf
33 pages
Regression Interpretation
No ratings yet
Regression Interpretation
2 pages
Quiz Solutions
No ratings yet
Quiz Solutions
6 pages
A Mini Project Report On: "Big Mart Sales Prediction" by
67% (3)
A Mini Project Report On: "Big Mart Sales Prediction" by
23 pages
Dat Sol 2
No ratings yet
Dat Sol 2
32 pages
Asdm GJ23NS005
No ratings yet
Asdm GJ23NS005
13 pages
KhanhNguyen-Critical Thinking Exercise-Real Estate
No ratings yet
KhanhNguyen-Critical Thinking Exercise-Real Estate
6 pages
DT103v Presentation v2 1698948303809001JlIr
No ratings yet
DT103v Presentation v2 1698948303809001JlIr
22 pages
VW 6-Speed Automatic Gearbox 09G
No ratings yet
VW 6-Speed Automatic Gearbox 09G
116 pages
Threat Intelligence Market
No ratings yet
Threat Intelligence Market
8 pages
SPM AddMath Trigonometric Functions
No ratings yet
SPM AddMath Trigonometric Functions
10 pages
MA3251 -QUESTION BANK-IA2
No ratings yet
MA3251 -QUESTION BANK-IA2
7 pages
Profession-of-Engineering-Hoover-1951
No ratings yet
Profession-of-Engineering-Hoover-1951
2 pages
Guitguit, Jazmine B. - Module 3
No ratings yet
Guitguit, Jazmine B. - Module 3
3 pages
Lời Giải Chi Tiết SOME OTHER TOPICS
No ratings yet
Lời Giải Chi Tiết SOME OTHER TOPICS
13 pages
Camloc Ram H
No ratings yet
Camloc Ram H
65 pages
Armstrong Nature of The Mind
No ratings yet
Armstrong Nature of The Mind
23 pages
Actuarial Science in Kenya - A Student's Perspective
No ratings yet
Actuarial Science in Kenya - A Student's Perspective
4 pages
2-Solved Problems - Water Demand
No ratings yet
2-Solved Problems - Water Demand
12 pages
Epicor University - Purchase Management Course PDF
No ratings yet
Epicor University - Purchase Management Course PDF
105 pages
Authoritarian
No ratings yet
Authoritarian
9 pages
Bet GPS
100% (1)
Bet GPS
14 pages
Sony Nas-C5e v1.2
No ratings yet
Sony Nas-C5e v1.2
48 pages
Truth Hardware Accessory Items
No ratings yet
Truth Hardware Accessory Items
6 pages
P6 Math Diagnostic Test (TCH Copy)
No ratings yet
P6 Math Diagnostic Test (TCH Copy)
9 pages
6-Eaton Switch Disconnector
No ratings yet
6-Eaton Switch Disconnector
84 pages
CH 04
No ratings yet
CH 04
132 pages
Final Screenless Display
No ratings yet
Final Screenless Display
4 pages
Integrative Reviewer
No ratings yet
Integrative Reviewer
6 pages
TOS Final Exam - Chem 1 COT SY 2021-2022
No ratings yet
TOS Final Exam - Chem 1 COT SY 2021-2022
5 pages
Questions & Answers On Synchronous Motors
100% (1)
Questions & Answers On Synchronous Motors
28 pages
Basic Sensory Attributes
No ratings yet
Basic Sensory Attributes
31 pages
Longitudinal Section B-B Transverse Section A-A: Welding Details Not To Scale
No ratings yet
Longitudinal Section B-B Transverse Section A-A: Welding Details Not To Scale
1 page
Tree Removal Tree Pruning Application 2022
No ratings yet
Tree Removal Tree Pruning Application 2022
2 pages
Activity 7 Kayla Lewis
No ratings yet
Activity 7 Kayla Lewis
14 pages
Rubric For Math Problem Solving
No ratings yet
Rubric For Math Problem Solving
2 pages

Example for Regression

Uploaded by

Example for Regression

Uploaded by

Problem 7 (Refer Past year TEE of SIDM 2022)

The data were analyzed and the output follows:

a. Is the regression significant as a whole? Why or why not?

While heteroscedasticity is a concern for the accuracy of the model's predictions, it

You might also like