Lecture 1 - Intro
Lecture 1 - Intro
Agenda:
Single linear regression models
Inference for regression (hypothesis test, and confidence interval for the regression
slope)
ANOVA; F-statistic; R-squared
Software outputs
Introduction
Response variable or dependent: variable being predicted.
Predictor variables or independent variables: variables being used to predict the value of
the dependent variable.
Simple (single) linear regression: a regression analysis for which any one unit change in
the independent variable, x, is assumed to result in a change in the dependent variable,
y.
Multiple linear regression: a regression analysis involving two or more independent
variables
Correlation
Σ zx z y
r=
n−1
y− y
z=
s
Substituting the values of the sample statistics b oand b 1for β oand β 1in the regression
equation and dropping the error term, we obtain the estimated regression for simple
linear regression.
o We are finding the regression that minimizes the sum of squared errors.
Estimating b oand b 1
Straight lines can be written as
We can find the slope of the least squares line using the correlation and the standard
deviations
To find the intercept of our line, we use the means. If our line estimates
the data, then it should predict the y-bar for the x-value of x-bar.