Assignment # 1
Assignment # 1
Suppose we collect information on the ages (in years) of 50 students selected from a university.
21 19 24 25 29 34 26 27 37 33
18 20 19 22 19 19 25 22 25 23
25 19 31 19 23 18 23 19 23 26
22 28 21 20 22 22 21 20 19 21
25 23 18 37 27 23 21 25 21 24
(a) [5 points] Construct a stem-and-leaf display for the data.
(b) [5 points] Construct a histogram for the data.
(c) [10 points] Construct a box plot for the data. Do you think that data set has any possible outlier?
(d) [5 points] Calculate the mean, median and variance of all scores.
Question # 2:
The following data give the results of a sample survey. The letters Y, N, and D represent the three categories.
DNNYYYNYDY
YYYYNYYNNY
NYYNDNYYYY
YYNNYYNNDY
Keywords:
lung capacity
Categories:
regression; health
Description:
The data give information on the health and smoking habits of a sample of
654 youths, aged 3 to 19, in the area of East Boston during middle to late
1970s.In the full data set, there are 654 observations on 5 variables
Variables:
Age: The age of the subject in completed years
FEV: The forced expiratory volume, a measure of lung capacity, in
litres
Ht: Height (in inches)
Gender: The gender of the subject: Females coded as 0, males as 1
Smoke: The smoking status of the subject: 0 means a non-smoker;
1 means a smoker
Data Quality:
There are no missing values.
Source:
Kahn, Michael (2005). An Exhalent Problem for Teaching Statistics.
The Journal of Statistical Education, 13(2). Available on-line Notes:
References:
Kahn,M. (2003). Data Sleuth, STATS, 37, 24.