Module 10. Simple Linear Regression Regression Analysis
Module 10. Simple Linear Regression Regression Analysis
Regression analysis
A parametric tool used to describe the linear relationship between the independent
and dependent variables.
Develops a model to predict the values of the dependent variable based on the
values of the independent variables.
Simplest type of regression analysis which involves one independent variable and
done dependent variable in which the relationship between the two variables is
estimated by a straight line
Y = a + bx or y = bo + b1x
To develop the linear regression model, you need to solve for the y-intercept and
slope first; the formulas of which are as follow:
Coefficient of determination = r2
For problem illustrations, let us use the same two problems in module 9.
Business Statistics: Module 10. Simple Linear Regression Page 2 of 6
Problem illustrations
In this particular module, we will set aside hypotheses, level of significance, critical
values, decision rules, conclusions, and recommendations for the meantime.
We will proceed with the computation. Initially, the table shown above has only three
columns (written in blank font); and we added three more columns (written in red font) to
complete our data matrix. For column X*Y, you need to multiply each value of X and Y,
and you have to do it for every value in column X and Y. For column X^2, you multiply
each value of X by itself, and you need to do it for every value in column X. For column
Y^2, you multiply each value of Y by itself, and you have to do it for every value in
column Y. Lastly, you need to get the total of each column.
c. Linear equation model = Y = -1.69 + 0.05X since it’s a model, we retain the X
d. Y = -1.69 + 0.05X
= -1.69 +.05(300) = 13.31
Business Statistics: Module 10. Simple Linear Regression Page 3 of 6
The r-value of 0.7196 reflects the high positive relationship of the calories and and
fat of coffee drinks
f. r2 = 0.71962 = 0.5178 or 51.78%, which means 51.78% of the changes in the fat of
the coffee drinks can be explained by its interaction with the calories of the coffee
drinks, and the remaining 48.22% are unexplained factors or factors not included in
this study.
2. The store manager wants to determine the relationship between the number of
weekend television commercials shown and the sales of stereo and sound
equipment at the store. The table below shows the gathered data.
a. Determine the slope value.
b. Determine the y-intercept value
c. Develop the linear regression equation or model
d. Predict the sales for 8 TV weekend commercials
e. Describe the coefficient of correlation
f. Describe the coefficient of determination
d. Y = 34.49 + 4.89X
= 34.49 + 4.89(8) = 73.61
There is a high positive correlation between number of weekend television commercials and
sales of stereo and sound equipment.
f. r2 = 0.89562 = 0.8021 or 80.21%, which means 80.21% of the changes in the sales of
stereo and sound equipment of the store can be explained by the changes in
number of weekend television commercials and the 19.79% are caused by other
unexplained factors.
1. A production manager has compared the dexterity test scores of seven assembly
line employees with their hourly productivity. The table below shows the result.
a. Determine the slope value.
b. Determine the y-intercept value
c. Develop the linear regression equation or model
d. Predict the hourly productivity for a dexterity score of 20
e. Describe the coefficient of correlation
f. Describe the coefficient of determination
Employee Dexterity test score (X) Hourly productivity
A 13 56
B 15 64
C 18 68
D 17 71
E 12 52
F 14 62
G 16 64
Total
2. It has been reported that the average American male consumes 3774 calories per
day and that 72.2% of American males are overweight. This information along with
data for seven other countries, is shown below.
a. Determine the slope value.
b. Determine the y-intercept value
c. Develop the linear regression equation or model
d. Predict the % overweight for American male who consumes 4000 calories per
day
e. Describe the coefficient of correlation
f. Describe the coefficient of determination
Country Calories per day % overweight
A 2214 10.6
B 2975 27.7
C 3458 40.4
D 3523 64.7
E 2560 62.8
F 3257 15.3
G 3885 70.0
Total
Business Statistics: Module 10. Simple Linear Regression Page 5 of 6
3. In a certain company, employees who have stayed for more than five years with the
firm are given the opportunity to own a stock share at a discounted price as a form of
reward. The personnel manager wanted to know if the employees’ number of years
in service with the firm has influence on number of stock shares own? Describe
a. Determine the slope value.
b. Determine the y-intercept value
c. Develop the linear regression equation or model
d. Predict the number of stock shares own of an employee with 20 years in service
with the firm.
e. Describe the coefficient of correlation
f. Describe the coefficient of determination
Employee No. of years (X) No. of stock shares (Y)
A 7 315
B 13 418
C 15 570
D 7 273
E 10 300
F 14 665
G 16 660
H 10 310
Total
4. The Insurance Institute for Highway Safety has listed the following ratings based on
collision and comprehensive claims for nine makes of midsize four-door cars from
2014-2016 model years. Higher numbers reflect higher claims in the collision and
comprehensive categories of coverage.
a. Determine the slope value.
b. Determine the y-intercept value
c. Develop the linear regression equation or model
d. Predict the comprehensive claim rating for a collision rating of 130
e. Describe the coefficient of correlation
f. Describe the coefficient of determination
Claims Collision claim rating (X) Comprehensive claim rating (Y)
1 113 89
2 108 91
3 124 92
4 131 108
5 128 108
6 90 74
7 99 79
8 106 86
9 116 98
Total
5. A mail-order catalog business that sells personal computer supplies, software, and
hardware maintains a centralized warehouse for the distribution of products ordered.
Management is currently examining the process distribution from the warehouse and
is interested in studying the factors that affect warehouse distribution costs.
Currently, a small handling fee is added to the order, regardless of the amount. The
Business Statistics: Module 10. Simple Linear Regression Page 6 of 6
table below shows the data (distribution costs in thousands of dollars and number of
orders) collected for the past 12 months.
a. Determine the slope value.
b. Determine the y-intercept value
c. Develop the linear regression equation or model
d. Predict the number of orders for a distribute costs of 90.25 (in thousand dollars)
e. Describe the coefficient of correlation
f. Describe the coefficient of determination
Month Distribution costs Number of orders
1 52.95 4015
2 71.66 3806
3 85.58 5309
4 63.69 4262
5 72.81 4296
6 68.44 4097
7 52.46 3213
8 70.77 4809
9 82.03 5237
10 74.39 4732
11 70.84 4413
12 54.08 2921
Total
References
Albright, S. et al. (2015). Business analytics: data analysis and decision making (5th
ed). Cengage Learning.
Anderson, D., Sweeney, D.J., et.al., (2018). Modern business statistics. Australia:
Cengage Learning.
Antivola, H. (2015). Business statistics: a modular approach. Books Atbp. Publishing.
Anywhere Math. (2016). Introduction to Statistics.
https://www.youtube.com/watch?v=LMSyiAJm99g.
Berenson, M.L., Levine, D.M., & Krehbiel, T.C. (2015). Basic business statistics:
concepts and applications. Pearson Education Sou7th Asia Pte. Ltd.
Bowerman, B. (2017). Business statistics in practice: using modeling, data, and
analytics (8th ed.). McGraw-Hill Education.
Jaggia, S. (2019). Business statistics: communicating with numbers (3rd ed.). McGraw-
Hill Education.
Lee, N. (2016). Business statistics: using excel & SPSS. Sage.
Mukaka, M.M. (2012). A guide to appropriate use of correlation coefficient in medical
research. Malawi Medical Journal, v.24(3).
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3576830/
Simple Learning Pro. (2015). Mean, median, mode, range, and standard deviation.
https://www.youtube.com/watch?v=mk8tOD0t8M0.
Sharpe, N. (2015). Business statistics 3rd ed. Pearson Education.
Willoughby, D. (2015). An essential guide to business statistics. John Wiley & Sons.