Week 5 Lecture - Data Wrangling
Week 5 Lecture - Data Wrangling
CMP4294 Week 5
Outline
tline
• Dealing with missing d a t a
• Missing d a t a mechanisms
Examples:
• if men are more likely to tell you their weight than
women, weight is MAR.
For example:
• probability of someone reporting their income depends on what
their income is
• probability of reporting psychiatric treatment depends on
whether or not they have received it
Why MVs mechanisms are important?
• M e a n / Mode imputation
• Regression imputation
Hot-deck:
For an individual with missing data, find individuals
with the same observed values on other variables,
randomly pick one of their values as the one to use for
imputation
Imputation methods