Unit 4
Unit 4
Overview:
Managerial decisions are often based on the relationship between two or more variables.
Sometimes a manager will rely on intuition to judge how two variables are related. However, if
data can be obtained, a statistical procedure called regression analysis can be used to develop
an equation showing how the variables are related. In regression terminology, the variables being
predicted is called the dependent variables or responses and the variables being used to predict
the value of dependent variables are called the independent variables or predictor variables.
Regression analysis involving one independent variable and one dependent variable is
referred to as a simple regression and in statistical notation y denotes the dependent variable and
x the independent variable. A regression analysis for which any one unit change in the
independent variable, x, is assumed to result in the same change in the dependent variable y is
referred to as a linear regression. Regression analysis involving two or more independent
variables is called multiple regression.
Learning Objectives:
After successful completion of this module, you should be able to:
• Understand how regression analysis can be used to determine how a dependent and is
related to independent variable.
• Incorporate categorical independent variables into a regression model.
• Apply different analytics modelling concepts on enterprise.
Course Materials:
To develop better work schedules, the manager wants to estimate the total daily travel
times for their drivers (denoted by y) and the number of miles traveled (denoted by x). Using the
regression analysis,, we can develop an equation showing how the dependent variable y is related
to the independent variable x.
3.1
The equation that describes how the expected value of y, denoted E(y) is related to x is
called the regression equation. The regression equation for simple linear regression follows:
3.2
Table 3.1
If the value of the population parameters were known, we could use equation
(4.2) to compute the mean value of y for a given value of x. The estimated regression equation
for simple linear regression follows:
3.3
Based on Table 4.1 we can use excel chart tools to compute for the estimation equation
of the scatter chart – see Figure 4.1 scatter chart. The following steps describes how to compute
the estimated regression equation using the data in the worksheet.
Step 1. Right click on any data point in the scatter chart and select Add Trendline.
Figure 3.2 shows the original data, scatter chart, estimated regression line and estimated
regression equation.
Figure 3.2
The least squares method is a procedure for using sample data to find the estimated
regression. Figure 3.1 is a scatter chart of data in table 3.1. Miles traveled is shown on the
horizontal axis, and travel time (in hours) is shown on the vertical axis. Scatter charts for
regression analysis are constructed with the independent variable x on the horizontal ais and the
dependent variable y on the vertical axis.
In the
multiple
3.4
regression model, b0, b1, b2, . . . , bq are the parameters and the error term is a random variable.
A close examination of this model reveals that y is a linear function of x1, x2, . . . ,xq
plus the error term . As in simple regression, the error term accounts for the variability in y
that cannot be explained by the linear effect of the q independent variables. The interpretation of
the y-intercept b0 in multiple regression is similar to the interpretation in simple regression; in a
multiple regression model, b0 is the mean of the dependent variable y when all of the independent
variables x1, x2, . . . ,xq are equal to zero.
3.5
Using Excel’s Regression Tool to Develop the Estimated Multiple Regression Equation
Steps to use Excel’s Regression tool to compute the estimated equation using the data
in the worksheet.
Step 4. When the Regression dialog box appears (as shown in Figure 3.4):
Select Labels
Selecting Labels tells Excel to use the names you have given to your variables in
Row 1 when displaying the regression model output.
Click OK
Figure 3.3
Note:
• Selecting New Worksheet Ply: tells Excel to place the output of the regression analysis in a
new work- sheet. In the adjacent box, you can specify the name of the worksheet where the
output is to be placed, or you can leave this blank and allow Excel to create anew worksheet
to use as the destination for the results of this regression analysis (as we are doing here).
• If Data Analysis does not appear in your Analysis group, you will have to load the Analysis
ToolPak Add-in into Excel. To do so, click the FILE tab in the Ribbon, and click Options. When
the Excel Options dialog box appears, click Add-Ins from the menu.
Next to Manage:, select Excel Add-ins, and click Go. . . at the bottom of the dialog box.
When the Add- Ins dialog box appears, select Analysis ToolPakand click OK.
Figure 3.5
In the Excel output shown in Figure, the label for the independent variable x1 is Miles (see
cell A18) and the label for the independent variable x2 is Deliveries (see cell A19) The estimated
regression equation is
3.6
• For a fixed number of deliveries, we estimate that the mean travel time will increase by
• For a fixed distance traveled,we estimate that the mean travel time will increase by 0.69
hours when the number of deliveries increases by 1 delivery.
The categorical
variable for the
Butler Trucking
Company example had two levels: (1) driving assignments that include travel on the congested
segment of highway during the afternoon rush hour period and (2) driving assignments that do
not. A dummy variable withavalueofzeroindicatingadrivingassignmentthatdoesnot include travel
on the congested segment of highway during the afternoon rush hour period and a value of one
Figure 3.7
indicating a driving assignment that includes such travel was sufficient. See figure below that
included the congested segment and not by preparing histogram.
Read: Chapter 4 - Camm, J., Cochran, Fry, Ohlmann, Anderson, Sweeney, Williams. (2015).
Essentials of Business Analytics. Stamford, USA: Cengage Learning.