Regression Numarical
Regression Numarical
The least squares line has two components: the slope b, and Y-intercept a.
The equations for a and b are:
a = Y̅ - bx̅
2249 – (36)(449)/8
b=
204−362 /8
b = 5.44
x̅ = 36/8 = 4.5
Therefore
a = Y̅ - b X̅
a = 56.12 – b (4.5) = 56.12 - 5.44(4.5) = 31.64
You can make predictions of Y from given values of X using your equation:
Y = a + bX
Y = 31.64 + 5.44X
This means that our line starts out at 31.64 and the Y-values increase
by 5.44 percentage points for every 1 Chimpanzee that joins the hunting
party.
To test this out, let’s predict the percent hunt success for 4 chimpanzees,
i.e. X = 4; Y = ?
Y = 31.64 + 5.44(4), which results in Y=53.4
X = 9; Y = ?
Y = 31.64 + 5.44(4),
We just predicted the percentage of successful hunts for a chimpanzee
hunting party based solely on knowledge of their group size.
However, now that you can make predictions, you need to qualify your
predictions with the Correlation Coefficient, which describes how well the
data fits your calculated line.
r = .96
Our value is close to positive 1, which means that the data is highly
correlated, and positive.
You could have determined this from looking at the least squares line
plotted over the scatterplot, but the Correlation Coefficient gives you
scientific proof!
Table showing the scores on the final exam based on scores from
the third exam.
65 175
67 133
71 185
71 163
66 126
75 198
67 153
70 163
71 159
69 151
69 159