Correlation
Correlation
Correlation Coefficients
• The relations between two variables
• How the value of one variable changes when the value of another variable
changes
• A correlation coefficient is a numerical index to reflect the
relationship between two variables.
• Range: -1 ~ +1
• Bivariate correlation (for two variables)
Correlation Coefficients
• Parametric
• Pearson product-moment correlation (named for inventor Karl Pearson)
• Non-parametric
• Spearman’s rank correlation
• Kendall tau rank correlation coefficient
Pearson correlation coefficient
• For two variables which are continuous in nature
• Height, age, test score, income
• But not for discrete or categorical variables
• Race, political affiliation, social class, rank
Y Y
X X
Y Y
X X
Slide from: Statistics for Managers Using Microsoft® Excel 4th Edition, 2004 Prentice-Hall
Linear Correlation
Strong relationships Weak relationships
Y Y
X X
Y Y
X X
Slide from: Statistics for Managers Using Microsoft® Excel 4th Edition, 2004 Prentice-Hall
Linear Correlation
No relationship
X
Slide from: Statistics for Managers Using Microsoft® Excel 4th Edition, 2004 Prentice-Hall
Correlation
8
Positive Correlation
12
Correlation vs. causality
• The correlation represents the association between two or more
variables
• It has nothing to do with causality (there is no cause relation between
two correlated variables)
• Ice cream and crime are correlated, but
• Ice cream does not cause crime
Pearson product-moment correlation
coefficient
n XY X Y
rxy
[n X 2 ( X ) ][ n Y ( Y )
2 2 2
]
A Pearson product-moment correlation was run to determine the relationship between height and
distance jumped in a long jump. There was a strong, positive correlation between height and distance
jumped, which was statistically significant (r = .706, n = 14, p = .005).