10 CORRELATION Phi-Coefficient
10 CORRELATION Phi-Coefficient
AND PROCEDURES
FOR TESTING
RELATIONSHIPS
PREPARED BY JE-ANN H. IROG
NON-LINEAR
CORRELATION &
NONPARAMETRIC
PHI-COEFFICIENT
Phi-Coefficient
Also referred to as the mean square contingency coefficient
and denoted by the symbol φ.
A dichotomous variable is a variable with only two mutually exclusive response options.
Phi-Coefficient
It intersects variables across a 2x2 matrix to estimate whether there
is a non-random pattern across the four cells in the 2x2 matrix
Two
Dichotomous
Variables
Example Data Set
It intersects variables across a
Subjects Sex at Birth Marital Status 2x2 matrix to estimate
1 = Male 1= Single whether there is a non-
2 = Female 2 = Married random pattern across the
A 1 2 four cells in the 2x2 matrix
B 1 1
C 1 1 SEX AT BIRTH
D 2 2 Male Female
E 2 2 MARITAL Married
F 1 1 STATUS Single
G 2 2
H 2 1
I 2 2
J 1 1
K 1 1
L 2 1
Four cells in the 2x2 matrix
MALE FEMALE
MARRIED 1 4
SINGLE 5 2
Formula
MALE FEMALE
MALE FEMALE
6 (g) 6 (h)
Interpretation
Similar to a Pearson Correlation Coefficient, a Phi
Coefficient takes on values between -1 and 1 where:
-1 indicates a perfectly negative relationship
between the two variables.
0 indicates no association between the two
variables.
+1 indicates a perfectly positive relationship between
the two variables.
Interpretation
A Phi-coefficient of 0 would indicate that there is no systematic pattern across the 2x2 matrix.
SEX AT BIRTH
Male Female
MARITAL Married 3 3
STATUS Single 3 3
or SEX AT BIRTH
Male Female
MARITAL Married 5 5
STATUS Single 1 1
Interpretation
A positive Phi-coefficient would indicate that most of the data are in the diagonal cells.
SEX AT BIRTH
Male Female
MARITAL Married 4 1
STATUS Single 2 5
Interpretation
A negative Phi-coefficient would indicate that most of the data are in the off-diagonal cells. A
SEX AT BIRTH
Interpretation
Interpretation
In general, the further away a Phi Coefficient is
from zero, the stronger the relationship between
the two variables.
In other words, the further away a Phi Coefficient
is from zero, the more evidence there is for some
type of systematic pattern between the two
variables.
Notes in Interpretation
The signs of negative (-) and positive (+) in
the computed Phi coefficient are not much
important, especially if both of your
variables are nominal. It is better to look at
the pattern in the cross tab to better
interpret and discuss the result.
SEX AT BIRTH Phi-
Coefficient
+.507
N (r)2
Where
N is the sample size
r is the computed Phi coefficient
+.507
INSTRUCTION: