Applied Quantitative Analysis and Practices: Lecture#22
Applied Quantitative Analysis and Practices: Lecture#22
and Practices
LECTURE#22
By
Dr. Osman Sadiq Paracha
Previous Lecture Summary
Application in SPSS for factor analysis stages
Interpretation of factor matrix
Validation of factor analysis
Factor Scores
Simple Linear Regression
Correlation vs. Regression
DCOVA
A scatter plot can be used to show the
relationship between two variables
Correlation analysis is used to measure the
strength of the association (linear relationship)
between two variables
Correlation is only concerned with strength of the
relationship
No causal effect is implied with correlation
Types of Relationships
DCOVA
Y Y
X X
Y Y
X X
Types of Relationships
DCOVA
(continued)
Strong relationships Weak relationships
Y Y
X X
Y Y
X X
Types of Relationships
DCOVA
(continued)
No relationship
X
Introduction to
Regression Analysis
DCOVA
Regression analysis is used to:
Predict the value of a dependent variable based on
the value of at least one independent variable
Explain the impact of changes in an independent
variable on the dependent variable
Dependent variable: the variable we wish to
predict or explain
Independent variable: the variable used to predict
or explain the dependent
variable
Simple Linear Regression
Model
DCOVA
Only one independent variable, X
Relationship between X and Y is
described by a linear function
Changes in Y are assumed to be related
to changes in X
Simple Linear Regression
Model
DCOVA
Population Random
Population Independent Error
Slope
Y intercept Variable term
Coefficient
Dependent
Variable
Yi β0 β1Xi ε i
Linear component Random Error
component
Simple Linear Regression
Model DCOVA
(continued)
Y Yi β0 β1Xi ε i
Observed Value
of Y for Xi
εi Slope = β1
Predicted Value
Random Error
of Y for Xi
for this Xi value
Intercept = β0
Xi X
Simple Linear Regression
Equation (Prediction Line) DCOVA
The simple linear regression equation provides an
estimate of the population regression line
Estimated
(or predicted) Estimate of Estimate of the
Y value for the regression regression slope
observation i
intercept
Value of X for
Ŷi b0 b1Xi
observation i
The Least Squares Method
b1 =
b0 =
Interpretation of the
Slope and the Intercept
350
300
250
200
150
100
50
0
0 500 1000 1500 2000 2500 3000
Square Feet
Simple Linear Regression Example
Regression Statistics
Multiple R 0.76211 The regression equation is:
R Square 0.58082
Adjusted R Square 0.52842 house price 98.24833 0.10977 (square feet)
Standard Error 41.33032
Observations 10
ANOVA
df SS MS F Significance F
Regression 1 18934.9348 18934.9348 11.0848 0.01039
Residual 8 13665.5652 1708.1957
Total 9 32600.5000
350
Slope
300
250
= 0.10977
200
150
100
50
Intercept 0
= 98.248 0 500 1000 1500 2000 2500 3000
Square Feet
98.25 0.1098(200 0)
317.85
The predicted price for a house with 2000
square feet is 317.85($1,000s) = $317,850
Simple Linear Regression
Example: Making Predictions
When using a regression model for prediction,
only predict within the relevant range of data
Relevant range for
interpolation
450
400
House Price ($1000s)
350
300
250
200
150
100
50 Do not try to
0
extrapolate
0 500 1000 1500 2000 2500 3000
Square Feet
beyond the range
of observed X’s
Measures of Variation
Total variation is made up of two parts: