How To Choose The Correct Statistical Test
How To Choose The Correct Statistical Test
How To Choose The Correct Statistical Test
Level of Measurement
Level of Measurement
• Nominal
– Named categories with no order
• Ordinal
– Categories with a logical order or rank order
• Interval
– Rank order AND distant between intervals of measurement have
meaning (zero value is arbitrary).
• Ratio
– Same properties as interval data AND the distance and ratio between
two measurements are defined and has an empirical (not arbitrary)
zero value.
– You can say a score of 20 is “twice as much” as 10.
Liamputtong 2013
Level of Measurement
Type Description
Nominal Classes or categories without numerical order
•Male, female
•Jewish, Catholic, Muslim
Ordinal (ranked) Ordered categories
•Mild pain, moderate pain, and severe pain
•High school, undergraduate, graduate
Interval The distance or interval between two
measurements have meaning
•Temperature in Celsius (zero = 273.15 Kelvin)
Ratio The distance and ratio between two
measurements are defined and zero has a
meaning of zero and therefore you can say “twice
as much”
•Weight
•Age in years
•Temperature in Kelvin (absolute zero)
What is the Distribution of the Data?
Central Tendency and Dispersion
• Central tendency
– Where the bulk of the data lie.
• Mode, Median, Mean, etc
• Dispersion
– How wide or narrow the data are
spread out.
• Number of categories, Range, Standard
Deviation, etc
Health Research Methods: A Canadian Perspective (2014) Edited by K. Bassil & D. Zabkiewicz; Chapter 7, pp. 119-142
Central Tendency
• Mode
– The value that appears most often
– (3, 4, 5, 6, 8, 8, 15) Mode = 8
• Mean
– The arithmetic average of the observations
– (3, 4, 5, 6, 8, 8, 15) Mean = 7
• Median
– Middle value (3, 4, 5, 6, 8, 8, 15) Median = 6
Level of Central Tendency Dispersion
Measurement
Nominal Mode (most frequent category) Number of categories
Ordinal Median (data are ranked, middle value with Range and the
half above and half below) Interquartile range
(median of upper half
and median of lower
half IQR is difference
between the two)
Health Research Methods: A Canadian Perspective (2014) Edited by K. Bassil & D. Zabkiewicz; Chapter 7,
pp. 119-142
Level of Central Tendency Dispersion
Measurement
Nominal Mode (most frequent category) Number of categories
Ordinal Median (data are ranked, middle value with Range and the
half above and half below) Interquartile range
(median of upper half
and median of lower
half IQR is difference
NON-PARAMETERIC TESTS between the two)
Health Research Methods: A Canadian Perspective (2014) Edited by K. Bassil & D. Zabkiewicz; Chapter 7,
pp. 119-142
What is the Distribution of the Data?
Normal Distribution
Or
Non-Normal Distribution
Normal Distribution
Average Hours of Sleep
Mean = 7.92
Std Error = 0.13
95% CI = 7.68 to 8.18
Non-Normal Distribution
Among respondents with babies
Mean = 5.88
Std Error = 0.30
95% CI = 5.27 to 6.49
Distribution of the Data
• Parametric test
– Interval or ratio level data with a NORMAL
DISTRIBUTION
• Non-parametric test
– Nominal or ordinal level data or interval or ratio
with a NON-NORMAL DISTRIBUTION
Common Statistical Tests
Is there a difference?
Parametric Non-Parametric
• T-test • Mann Whitney U
• ANOVA • Kruskal Wallis Test
• Chi-Square Test
T-test
• To test if two means are statistically different?
Depression (mean)
Total Sample 4
No Sexual abuse 2
Sexual abuse 8
What if the Distribution was
NON-NORMAL?
– One variable is Continuous (interval or ratio level)
with a NON-NORMAL DISTRIBUTION
• Non-normal distribution
– One Variable
• Ordinal or non-normal continuous level
– One Variable
• Two-level-categorical, dichotomous
– Dependent Variable
• Continuous (one variable)
– Independent Variable
• Categorical (One variable with more than two levels or groups)
Bruce, (2008); Tabachnick & Fidell (2007); Winston (1999); Liamputtong, 2013
ANOVA
• Are the mean depression score different for adolescents who experience
mild sexual abuse, moderate sexual abuse, or severe sexual abuse?
Depression (mean)
Total Sample 4
No Sexual Abuse 2
Minor Sexual Abuse 4
Moderate Sexual Abuse 7
Severe Sexual Abuse 9
ANOVA
• To test if three or means are statistically
different?
Liamputtong, 2013
Chi-Square Test of Significance (X2)
– One Variable
• Categorical with 2 or more levels
– One Variable
• Categorical with 2 or more levels
Pearson Spearman
• Distribution of the variables • Distribution of the variables
are normal (parametric test) are non-normal (non-
parametric test) OR one or
more variables are ordinal
– One Variable
• Continuous – One Variable
– One Variable • Continuous/Categorical
• Continuous – One Variable
• Continuous/Categorical
Linear Regression
Dependent variable = continuous (1 variable)
Independent variable = any level (1 or more)
Logistic Regression
Dependent variable = Dichotomous (1 variable)
Independent variable = any level (1 or more)
Is there a difference?
Parametric Test (Normal Distribution) Non-Parametric Test (Non-Normal Distribution)
T-test (difference in means) Mann Whitney U (difference in Medians)
One variable = continuous One variable = Continuous or ordinal
One variable = Dichotomous One variable = dichotomous
Chi-Square Test
One variable = 2 or more categories
One variable = 2 or more categories
To pick the correct statistical test you need to know…