Topic 9-Simple Linear Regression
Topic 9-Simple Linear Regression
Chap 9-1
Chapter Topics
Estimation
Chap 9-2
Correlation vs. Regression
A scatter plot (or scatter diagram) can be used to
show the relationship between two numerical
variables
Chap 9-3
Regression analysis is used to:
Predict the value of a dependent variable based on
the value of independent variables
Explain the impact of changes in an independent
variable on the dependent variable
Chap 9-4
10
9
8
7
6
Y 5
4
3
2
1
0
0 1 2 3 4 5 6 7 8 9 10
X
Zero Correlation
Chap 9-7
10
9
8
7
6
Y 5
4
3
2
1
0
0 1 2 3 4 5 6 7 8 9 10
X
Chap 9-11
Simple Linear Regression Model
Chap 9-12
Types of Relationships
Linear relationships Curvilinear relationships
Y Y
X X
Y Y
X X
Chap 9-13
The Linear Regression Model
Population Random
Population Independent Error
Slope
Dependent Y intercept Variable term
Coefficient
Variable
Yi β0 β1Xi ε i
Linear component Random Error
component
Chap 9-14
Linear Regression Equation
The simple linear regression equation provides an
estimate of the population regression line
Estimated (or
predicted) Y Estimate of the Estimate of the
value for regression regression slope
observation i intercept
Value of X for
Ŷi b0 b1Xi
observation i
Chap 9-15
Interpretation of the
Intercept and the Slope
b0 (called the intercept) is the estimated mean
value of Y when the value of X is zero
Chap 9-16
b1 (called the slope) is the estimated change in the
mean value of Y for every one-unit change in X
Note:
The unit of measurement for b0 and b1 always follow
the y.
Chap 9-17
Finding the Least Squares Equation
X Y
XY n
b1
X 2
X 2
n
b0 Y b1 X
These values can be obtained from the calculator.
Chap 9-18
Example 1
Nicholas from MMU is concerned about the cost to
students of textbooks. He believes there is a
relationship between the number of pages in the text
and the selling price of the book. To provide insight
into the problem he selects a sample of 8 textbooks
currently on sale in the bookstore.
Chap 9-19
Book Page Price(RM)
Introduction to Statistics 500 84
Basic Algebra 700 75
Introduction to Psychology 800 99
Introduction to Sociology 600 72
Business Management 400 69
Introduction to Biology 500 81
Fundamentals of Finance 600 63
Principles of Marketing 800 93
Chap 9-20
Example 1
n8 XY 397,200
X 4900 X 2 3,150,000
Y 636 2
Y 51,606
Chap 9-21
X Y
XY
b1 n
X 2
X
2
n
4900(636)
397200
8
0 .05143
4900 2
3150000 8
b0 Y 1 X 79.5 0.05143(612.5) 48
Chap 9-22
Thus, the regression equation is:
Y’ = 48 + 0.05143 X
ANOVA
df SS MS F Significance F
Regression 1 18934.9348 18934.9348 11.0848 0.01039
Residual/Error 8 13665.5652 1708.1957
Total 9 32600.5000
Chap 9-26
Correlation Coefficient (r)
To measure strength of the linear relationship
between 2 variables
Also called the Pearson’s Product Moment
Correlation Coefficient
It can range from – 1.00 to 1.00
Chap 9-27
Formula to Calculate the
Coefficient of Correlation
X Y
XY
r n
X 2 2 Y 2
X Y
2
n n
Chap 9-29
Example 1
Nicholas from MMU is concerned about the cost to
students of textbooks. He believes there is a
relationship between the number of pages in the text
and the selling price of the book. To provide insight
into the problem he selects a sample of 8 textbooks
currently on sale in the bookstore.
Chap 9-30
X Y
XY
n
r
X 2 2 Y 2
X Y
2
n n
4900(636)
397200
8
0.614
49002 6362
3150000 8
51606
8
2
r 0.614 r 0.377
Chap 9-32
Estimation
Chap 9-33
Estimation
What is the estimated selling price for a book
that has 800 pages?
Chap 9-34
Types of Estimation
Interpolated Estimate Extrapolated Estimate
is an estimation made
is an estimation made outside
within the given data
the given data range.
range.
Chap 9-35
Using the Regression Equation in Example 1, compute
Y when X = 1100 & X = 550
Chap 9-42
Store Sq. Ft. Annual Sales ($’000)
1 1.7 3.7
2 1.6 3.9
3 2.8 6.7
4 5.6 9.5
5 1.3 3.4
6 2.2 5.6
7 1.3 3.7
8 1.1 2.7
9 3.2 5.5
10 1.5 2.9
11 5.2 10.7
12 4.6 7.6
13 5.8 11.8
14 3.0 4.1
Chap 9-43