Case Study 1
Case Study 1
You work at a health consultancy firm which is tasked with identifying high-risk
patients for cardiovascular disease. From the medical literature, cardiovascular
disease is influenced by patient’s age; body mass index (BMI); amount of exercise;
race, and education. The dataset contains patients’ race; body weight (in
kilograms); height (in meters); date of birth; number of minutes of exercise;
education level. Cells with NA indicate missing data (i.e., no answer was given)
1. Create a new variable called “BMI” that contains the body mass index of
patients. [6 points]
∈kg
Note 1: BMI is calculated as weight
¿¿
Note 2: When calculating BMI use mean imputation for missing data.
[5 points]
3. You are worried about selection issues with missing data and in particular
that women are more likely to not report their weight. Is this issue likely to
be a concern in your data? Justify your answer. [6 points]
1
[5 points]
10. Produce a table showing obesity by education level for (a) non-Hispanic
White individuals and (b) non-Hispanic Black individuals. [6 points]