Stats 4
Stats 4
On a scatter diagram, the closer the points lie to a straight line, the stronger the linear
relationship between two variables. To quantify the strength of the relationship, we can
calculate the correlation coefficient. In algebraic notation, if we have two variables x and y,
and the data take the form of n pairs (i.e. [x 1, y1], [x2, y2], [x3, y3] ... [xn, yn]), then the
correlation coefficient is given by the following equation:
where is the mean of the x values, and is the mean of the y values.
This is the product moment correlation coefficient (or Pearson
correlation coefficient). The value of r always lies between -1 and
+1. A value of the correlation coefficient close to +1 indicates a
strong positive linear relationship (i.e. one variable increases with
the other; Fig. 2). A value close to -1 indicates a strong negative
linear relationship (i.e. one variable decreases as the other
increases; Fig. 3). A value close to 0 indicates no linear relationship
(Fig. 4); however, there could be a nonlinear relationship between
the variables (Fig. 5).
FIGURE 2;
CORRELATION COEFFICIENT (R) = +0.9. POSITIVE LINEAR RELATIONSHIP
FIGURE 3;
CORRELATION COEFFICIENT (R) = -0.9. NEGATIVE LINEAR RELATIONSHIP.
FIGURE 4;
CORRELATION COEFFICIENT (R) = 0.04. NO RELATIONSHIP.
FIGURE 5;
CORRELATION COEFFICIENT (R) = -0.03. NONLINEAR RELATIONSHIP.
HYPOTHESIS TEST OF CORRELATION