Correlation: Khairil Anuar Md. Isa Bbiomedicalsc. (Hons), Ukm Msc. (Medical Stat), Usm
Correlation: Khairil Anuar Md. Isa Bbiomedicalsc. (Hons), Ukm Msc. (Medical Stat), Usm
Khairil Anuar Md. Isa BBiomedicalSc.(hons), UKM MSc. (Medical Stat), USM
WHAT IS CORRELATION?
Measure the strength and direction of a linear relationship between a pair of random variable. Measured by the coefficient of correlation, r (rho) : (Population) r has a value between -1 to 1. The sample estimate for r is denoted by r : (Sample Coefficient of Correlation)
The type of coefficient suitable for a pair of variables is determined by measurement scales (ratio/interval or ordinal) and the distributions of the variables (normal or not normal). It does not indicate the cause and effect relationship E.g.: High Correlation does not necessarily mean one variable caused the other.
CORRELATION ANSWER:
What is the direction of the linear relationship (negative or positive)? How strong is the linear relationship?
! Regression will add the answer of prediction (more detail on the strength of linear relationship)
Correlation
Emphasis on the degree of linear relationship How strong is the relationship Does NOT matter which is X and which is Y
Regression Emphasis on prediction Directional ~ one is the predictor to predict the other one
MUST identify which is the predictor variable (X) and which is to predict (Y)
Measured by the Pearsons coefficient of correlation (Parametric) Both variables must be quantitative and normally distributed. The nature, direction and the strength of the relationship can be determined from scatter plot.
300
200
100
Positive correlation
BP
0 0 20 40 60 80 100 120 140
Weight
320
Negative correlation
300
280
260
240
X5
140
120
100
80
60
40
Weight
20
0 26 28 30 32 34 36
Almost no correlation
Age
r = -0.54
Shape of circle
r = 0.42
r = 0.17
At least one of the variables are normally distributed There is a linear relationship between variables Random sample
Ho= There is no correlation between X & Y HA= There is correlation between X & Y
SUMMARY..
To see whether there is any relationship between two numerical variables in term of strength and direction.
Scatter plot
Correlation test = r
FORMULA..
MUST be drawn from random sample One variable should not be a component of the other Data is not the combined data from two identifiable groups
..CAUTIONS..
Effect of outliers
Look
at the scatter plot! Outliers effect the means so can effect the coefficient May be muted if sample size are large Check whether true outlier or not then handle the outlier
..CAUTION..LIMITATIONS.
Very weak relationship (r<0.25) still give significant result with a adequate large sample size High correlation does not imply cause and effect relationship
No
STEPS..
Do scatter plot
Correlation test
INTERPRETATION
There is a significant (p<0.001), negative poor correlation (r=-0.244) between the age and PEFR.
Thank you..