Regression and Correlation Analysis
Regression and Correlation Analysis
Regression model.
In simple linear regression, the model used to describe the relationship between a
single dependent variable y and a single independent variable x is y = a 0 + a1x + k.
a0and a1 are referred to as the model parameters, and is a probabilistic error term that
accounts for the variability in y that cannot be explained by the linear relationship
with x. If the error term were not present, the model would be deterministic; in that
case, knowledge of the value of x would be sufficient to determine the value of y.
Correlation and regression analysis are related in the sense that both deal with
relationships among variables. The correlation coefficient is a measure of linear
association between two variables. Values of the correlation coefficient are always
between -1 and +1. A correlation coefficient of +1 indicates that two variables are
perfectly related in a positive linear sense, a correlation coefficient of -1 indicates that
two variables are perfectly related in a negative linear sense, and a correlation
coefficient of 0 indicates that there is no linear relationship between the two variables.
For simple linear regression, the sample correlation coefficient is the square root of
the coefficient of determination, with the sign of the correlation coefficient being the
same as the sign of b1, the coefficient of x1 in the estimated regression equation.