M. Amir Hossain PHD: Course No: Emba 502: Business Mathematics and Statistics
M. Amir Hossain PHD: Course No: Emba 502: Business Mathematics and Statistics
Statistics
10
9
8
7
6
Y
5
4
3
2
1
0
0 1 2 3 4 5 6 7 8 9 10
X
Perfect Positive Correlation
10
9
8
7
6
Y
5
4
3
2
1
0
0 1 2 3 4 5 6 7 8 9 10
X
Zero Correlation
10
9
8
7
6
Y
5
4
3
2
1
0
0 1 2 3 4 5 6 7 8 9 10
X
Strong Positive Correlation
10
9
8
7
6
Y
5
4
3
2
1
0
0 1 2 3 4 5 6 7 8 9 10
X
Formula for r
n( XY ) ( X )( Y )
r
n( X 2 2
) ( X ) n Y Y
2 2
Coefficient of Determination
The Coefficient of Determination, r2 - the proportion
of the total variation in the dependent variable Y
that is explained or accounted for by the variation
in the independent variable X.
The coefficient of determination is the square of the
coefficient of correlation, and ranges from 0 to 1.
Example: Sales and advertisement expense data,
r = 0.759 and r2 = (0.759)2 = 0.576
57.6% variation of sales can be explained by the
variation in advertisement expenses
Example: The following sample observations were
randomly selected:
X:4 5 3 6 7
Y:4 6 5 7 8
Determine the coefficient of correlation and coefficient
of determination and interpret.
Ans: Here, ∑X = 25, ∑Y = 30, ∑X2 = 135, ∑Y2 = 190,
∑XY = 159, r = 0.9, r2 = 0.82.
Comment: 82% of the variation in Y can be explained by
X
Example: Dan Ireland, the student body president at Toledo State
University, is concerned about the cost of textbooks. To provide
insight into the problem he selects a sample of eight textbooks
currently on sale in the bookstores. He decides to study the
relationship between the number of pages in the text and the cost.
Compute the correlation coefficient.
No. of Pages Price (in $)
500 28
700 25
800 33
600 24
400 23
500 27
600 21
800 31
Calculate r and comment on the relationship between the variables.
Also r2
Regression Analysis
In regression analysis an equation is developed to
express the relationship between dependent and
independent variables
The equation must be linear
Purpose: to determine the regression equation; it is used to
predict the value of the dependent variable (Y) based on
the independent variable (X).
Procedure: select a sample from the population and list the
paired data for each observation; draw a scatter diagram
to give a visual portrayal of the relationship; determine
the regression equation.
Regression Analysis
Yi β0 β1x i ε i
Where 0 and 1 are the population model coefficients
and is a random error term.
Simple Linear Regression Model
The population regression model:
Population Random
Population Independent
Slope Error
Y intercept Variable
Coefficient term
Yi β0 β1Xi ε i
Dependent
Variable
Y Yi β 0 β1X i ε i
Observed Value
of Y for Xi
εi Slope = β1
Predicted Value Random Error
of Y for Xi
for this Xi value
Intercept = β0
Xi
Regression Analysis
β̂ 0 b0 y bˆ1 x
x - x y y
β̂1 b1
x - x
2
Simple Linear Regression Equation
yˆ i b0 b1x i observation i
Interpretation of the Slope and the Intercept
RSS ESS
r
2
1
TSS TSS
ESS
SY X
n2