Assign 3
Assign 3
1.Define and compare the mean, median, and mode as measures of central tendency.
1. The average was taken for a The middle value in the The number that occurs the
set of numbers is called a data set is called Median. most in a given list of numbers
mean. is called a mode.
2. Add all of the numbers Place all the given It shows the frequency of
together and divide this sum numbers in an ascending occurrence.
of all numbers by a total order
number of numbers.
3. The result is the mean or The next step is to find the We can have more than one
average score. middle number on the list. mode or no mode at all.
It is called as the median.
4. Example: To find the Example: If the given list is Example: In the given series
average of the four numbers 4, 2, 8, 10, 19. 3,3,5,6,7,7,8,1,1,1,4,5,6
2, 4, 6, 8, we need to add the
number first. 1. Arrange the 1. Find the frequency of
numbers in each number.
1. 2 + 4 + 6+ 8 = 20 ascending order i. e 2. For number 3 it’s 2, for 5
2. Divide the sum by the 2, 4, 8, 10, 19. it’s 2, for 6 it’s 2, for 7 it’s
total number of 2. As the total 2, for 8 it’s one, for 1 it’s
numbers, i. e 4. numbers are 5, so 3, for 4 it’s 1.
3. 20/4 = 5 is the the middle number 3. The number with the
average or mean 8 is the median highest frequency is the
here. mode.
2. Under what conditions is use of the mean preferred? The median? The mode?
3. Define and compare three commonly used measures of dispersion the range, the
interquartile range, and the standard deviation.
Range
Range is the difference between the highest and lowest value of the sample.
Interquartile range
The interquartile range tells you the spread of the middle half of your distribution.
The interquartile range is a useful measure of variability and is given by the lower,
upper and middle quartiles.
Standard deviation
Standard deviation is defined as the square root of arithmetic mean of the squared
deviation of observation taken from the average observation.
6. A study was conducted investigating the long-term prognosis of children who have suffered
an acute episode of bacterial meningitis, an inflammation of the membranes enclosing the
brain and spinal cord. Listed below are the times to the onset of seizure for 13 children who
took part in the study [10]. In months, the measurements are:
0.10 0.25 0.50 4 12 12 24 24 31 36 42 55 96
(a) Find the following numerical summary measures of the data
i. mean 25.9115
ii. median 24.00
iii. mode 12.00
iv. range 95.90
v. interquartile range
39-2.25=36.75
vi. standard deviation 27.371
Statistics
Onset of Seizure
N Valid 13
Missing 0
Mean 25.9115
Median 24.0000
Mode 12.00a
Std. Deviation 27.37094
Range 95.90
Percentiles 25 2.2500
75 39.0000
a. Multiple modes exist. The smallest
value is shown
(a) Find the mean, median, standard deviation, and range of the recorded calcium levels.
Statistics
Calcium (nmol/l)
N Valid 8
Missing 0
Mean 3.1425
Median 3.0800
Mode 2.37a
Std. Deviation .51068
Range 1.47
a. Multiple modes exist. The smallest value is shown
(b) Compute the mean, median, standard deviation, and range of the given albumin levels.
Statistics
Albumin(g/l)
N Valid 8
Missing 0
Mean 40.38
Median 42.00
Mode 42
Std. Deviation 3.021
Range 9
(c) For healthy individuals, the normal range of calcium values is 2.12 to 2.74 mmol/1. while
the range of albumin levels is 32 to 55 g/1. Do you believe that patients suffering from
vitamin D intoxication have normal blood levels of calcium and albumin?
Yes, higher Vitamin D level leads to an increase in calcium, and more vitamin D
bound to albumin causes a decrease in albumin.
8. A study was conducted comparing female adolescents who suffer from bulimia to healthy
females with similar body compositions and levels of physical activity.
Listed below are measures of daily caloric intake, recorded in kilocalories per kilo-gram, for
samples of adolescents from each group [12].
(a) Find the median daily caloric intake for both the bulimic adolescents and the healthy ones.
Statistics
Bulimic Healthy
N Valid 23 15
Missing 0 8
Median 21.6000 30.6000
(b) Compute the interquartile range for each group.
Statistics
Bulimic Healthy
N Valid 23 15
Missing 0 8
Median 21.6000 30.6000
Percentiles 25 18.1000 23.8000
75 25.2000 36.6000
Bulimic
IQR= 25.2-18.1
=7.1
Healthy
IQR=36.6-23.8
=12.8
(c) Is a typical value of daily caloric intake larger for the individuals suffering from bulimia or
for the healthy adolescents? Which group has a greater amount of variability in the
measurements?
The typical value of daily caloric intake is larger for the healthy adolescents.
Healthy adolescents have a greater amount of variability in the measurements.
12. The percentages of low-birth-weight infants- defined as those weighing less than
2500 grams--for a number of nations around the world are saved under the variable name
lowbwt in the data set unicef [13] (Appendix B, Table B.2).
(a) Compute the mean and the median of these observations.
Statistics
Lowbwt Life60 Life92
N Valid 111 130 144
Missing 33 14 0
Mean 11.9640 51.7615 63.8542
Median 10.0000 48.0000 67.0000
Descriptive
Statistic Std. Error
Lowbwt Mean 11.9640 .61923
5% Confidence Interval for Lower Bound 11.9250
Mean Upper Bound 12.0029
5% Trimmed Mean 11.3784
Median 10.0000
Variance 42.562
Std. Deviation 6.52398
Minimum 4.00
Maximum 50.00
Range 46.00
Interquartile Range 8.00
Skewness 2.283 .229
Kurtosis 9.941 .455
(c) For this data set, which of these numbers would you prefer as a measure of central
tendency? Explain.
Although mean is sensitive to size and the distribution is nearly symmetrical, mean
will be favoured since the trimmed mean will eliminate the outliers.