Student Data Chapter A2 Data Descriptors r3
Student Data Chapter A2 Data Descriptors r3
1 Exercise 2.1
2
3 X
4 6
5 13
6 16
7 45
8 93
9 0
10 62
11 87
12 136
13 25
14 14
15 31
16
17 mean = 44 =AVERAGE(B4:B15)
18 median = 28 =MEDIAN(B4:B15)
A B C D E
1 Exercise X2.2
2
3
4 IQ
5 115
6 89
7 94
8 107
9 98
10 87
11 99
12 120
13 100
14 94
15 100
16 99
17
18 mean = 100.16666667 =AVERAGE(B5:B16)
19 median = 99 =MEDIAN(B5:B16)
20
21 Slightly over 100, suggest increase sample to confirm validity
A B C D
1 Exercise X2.3
2
3
4 Time
5 40
6 44
7 55
8 55
9 64
10 69
11
12 Small sample
13 Well spaced out
14 Any extreme values? No
15 Use mean
16
17 mean = 54.5 =AVERAGE(B5:B10)
18 median = 55 =MEDIAN(B5:B10)
A B C D
1 Exercise X2.4
2
3
4 X
5 1
6 1
7 1
8 1
9 1
10 2
11 2
12 2
13 2
14 2
15 2
16 3
17 3
18 3
19 3
20 3
21 4
22 4
23 4
24 4
25 5
26 5
27 5
28 5
29 5
30
31 Mean = 2.92 =AVERAGE(B5:B29)
32 Median = 3 =MEDIAN(B5:B29)
33 Mode = 2 =MODE(B5:B29)
A B C D E F
1 X2.5
2
3 ID x mean = 83.45 =AVERAGE(B4:B23)
4 1 75 median = 84 =MEDIAN(B4:B23)
5 2 77 Q1 = 80.25 =QUARTILE.EXC(B4:B23,1)
6 3 78 Q3 = 87.75 =QUARTILE.EXC(B4:B23,3)
7 4 78 20th percentile = 78.4 =PERCENTILE.EXC(B4:B23,0.2)
8 5 80
9 6 81
10 7 81 Given numbers are in size order
11 8 82 Using equation (5.2) and re-arranging to find P given position of 85 in odered list and value of N
12 9 83
13 10 84 N= 20 =COUNT(B4:B23)
14 11 84 For the value 85
15 12 84 Position of value 85 from ordered list = 13
16 13 85 P= 61.9047619 =100*E15/(E13+1)
17 14 87 Therefore, value of 85 represents the 62nd percentile
18 15 87
19 16 88 Check, 61.004..th percentile = 85 =PERCENTILE.EXC(B4:B23,E16/100)
20 17 88
21 18 88
22 19 89
23 20 90
A B C D E F G
1 X2.6
2
3 x Summary statistics
4 95 Mean = 128.5714 =AVERAGE(B4:B10)
5 110 Median = 130 =MEDIAN(B4:B10)
6 105
7 130 Mean ≈ Median, use mean and therefore standard deviation as the measure of spread
8 135 Assuming data is a sample from a population
9 155 Population sd = 25.31435 =STDEV.P(B4:B10)
10 170 Sample sd = 27.34262 =STDEV.S(B4:B10)
A B C D E F G H I J K L M N O P Q R S
1 X2.7
2
3 25 25 32 16 25 29 30 28 26 26 Summary statistics
4 20 23 28 25 18 18 22 18 21 25 mean = 24.275 =AVERAGE(B3:K6)
5 28 22 32 19 28 28 27 18 33 26 median = 25 =MEDIAN(B3:K6)
6 28 19 18 18 29 25 20 20 23 30
7 Mean ≈ Median, use mean and therefore standard deviation as the measure of spread
8 Assuming data is a sample from a population
9 Population sd = 4.604278 =STDEV.P(B3:K6)
10 Sample sd = 4.662933 =STDEV.S(B3:K6)
A B C D E F G H I J K L M N O P Q R S
1 X2.8
2
3 10 9 9 6 7 5 12 8 6 8 Summary statistics
4 2 9 4 10 5 5 5 7 9 9 mean = 6.966667 =AVERAGE(B3:K5)
5 6 7 7 8 6 4 8 7 5 6 median = 7 =MEDIAN(B3:K5)
6
7 Mean ≈ Median, use mean and therefore standard deviation as the measure of spread
8 Assuming data is a sample from a population
9 Population sd = 2.121058 =STDEV.P(B3:K5)
10 Sample sd = 2.157318 =STDEV.S(B3:K5)
A B C D E F G H I J K L
1 X2.9
2
3 (a)
4 80 165 159 143 140 (b) N= 45 =COUNT(B4:F12)
5 136 138 118 120 124 Mean = 132.13333 =AVERAGE(B4:F12)
6 159 131 93 145 109 Median = 136 =MEDIAN(B4:F12)
7 163 136 163 142 80 Standard deviation = 24.40364 =STDEV.P(B4:F12)
8 106 111 123 161 179 Q1 = 114 =QUARTILE.INC(B4:F12,1)
9 144 145 91 112 146 Q3 = 146 =QUARTILE.INC(B4:F12,3)
10 170 105 131 141 122 IQR = 32 =J9-J8
11 137 152 109 122 126 (d) Skewness = -0.29823 =SKEW(B4:F12)
12 114 155 92 143 165 Kurtosis = -0.54189 =KURT(B4:F12)
13
14
15 Class X Bin Bin Frequency
16 74.5 74.5 0
17 75-84 84.5 84.5 2
18 85-94 94.5 94.5 3
19 95-104 104.5 104.5 0
20 105-114 114.5 114.5 7
21 115-124 124.5 124.5 6
22 125-134 134.5 134.5 3
23 135-144 144.5 144.5 10
24 145-154 154.5 154.5 4
25 155-164 164.5 164.5 6
26 165-174 174.5 174.5 3
27 175-184 184.5 184.5 1
28 More 0
29
30
31
32 Class X Freq, f
33 75-84 2 Histogram for mileage travelled
34 85-94 3
35 95-104 0
12
36 105-114 7
37 115-124 6
10
38 125-134 3
39 135-144 10
40 145-154 4 8
Frequency
41 155-164 6
42 165-174 3 6
43 175-184 1
44 4
45
46
2
47
48
49 0
75-84
85-94
95-104
115-124
135-144
145-154
155-164
165-174
105-114
125-134
175-184
50
51
52
53 Mileage
54
A B C D
1 X2.10
2
3 Mean = 27000
4 Standard deviation = 6000
5
6 Co-efficient of variation = 22.22222 =C4/C3*100
A B C D
1 X2.11
2
3 Mean = 200
4 Standard deviation = 40
5
6 Co-efficent of variation = 20 =C4/C3*100
X2.12
19 28 17 16 18 23 19 21 24 17
20 20 21 25 20 21 17 20 20 22
15 16 17 21 21 21 13 16 15 19
Sample skewness = 0.426482 =SKEW(B3:K5)
Population skewness = 0.404854 =SKEWP(B3:K5)
Mark
13 1.29 2.5
Q3
14 1.59 Inner fence 2
15 1.73 lower value = 0.025 =E6-1.5*E12
16 2.31 upper value = 2.825 =E8+1.5*E12 1.5
17 1.23 1
18 1.89 Upper fence
0.5
19 1.54 lower value = -1.025 =E6-3*E12
20 1.97 upper value = 3.875 =E8+3*E12 0
1
21 1.26
22 0.27
23 0.79
24 1.23
25 1.56
26 0.89
27 1.78
28 1.52
29 1.07
30 0.92
31 1.38
32 1.56
33 1.98
34 1.34
35 4.89
36 1.39
37 1.76
38 0.71
39 2.46
40 0.89
41 2.01
42 3.21
43 1.98
A B C D E F G H I J K L M N O
1 X2.17 Five number summary and box plot
2
3 x Quartile.exc function
4 162726 Statistic Value Box plot for x
5 188656 Minimum 123636 =MIN(B4:B53)
6 165547 Q1 155332 =QUARTILE.EXC(B4:B53,1)
210000
7 175806 Median 175144 =QUARTILE.EXC(B4:B53,2)
8 190670 Q3 193021.5 =QUARTILE.EXC(B4:B53,3)
9 145810 Maximum 214563 =MAX(B4:B53) 190000
10 169682 Q1
11 155044 Minimum
12 149304 IQR = 37689.5 =E8-E6 170000 Median
Maximum
Mark
13 197847
Q3
14 167581 Inner fence
150000
15 174482 lower value = 98797.75 =E6-1.5*E12
16 158967 upper value = 249555.75 =E8+1.5*E12
17 195635
130000
18 214563 Upper fence
19 180784 lower value = 42263.5 =E6-3*E12
20 213945 upper value = 306090 =E8+3*E12 110000
1
21 172119
22 149081
23 176953
24 206280
25 171425
26 133577
27 140463
28 200177
29 145532
30 176666
31 182937
32 151921
33 198854
34 177031
35 162653
36 207242
37 155428
38 137108
39 163051
40 206433
41 131961
42 192410
43 179011
44 161865
45 181889
46 194856
47 189767
48 200931
49 169584
50 203645
51 123636
52 152005
53 188403
A B C D E F
1 X2.18 Excel Analysis-ToolPak to calculate descriptive statistics
2
3 x Column1
4 1.57
5 1.09 Mean 1.524
6 1.13 Standard Error 0.12191621755355
7 1.49 Median 1.395
8 0.98 Mode 0.76
9 0.76 Standard Deviation 0.77106586236363
10 1.4 Sample Variance 0.59454256410256
11 0.76 Kurtosis 8.79997964083546
12 1.38 Skewness 2.3368695663255
13 1.29 Range 4.62
14 1.59 Minimum 0.27
15 1.73 Maximum 4.89
16 2.31 Sum 60.96
17 1.23 Count 40
18 1.89
19 1.54
20 1.97
21 1.26
22 0.27
23 0.79
24 1.23
25 1.56
26 0.89
27 1.78
28 1.52
29 1.07
30 0.92
31 1.38
32 1.56
33 1.98
34 1.34
35 4.89
36 1.39
37 1.76
38 0.71
39 2.46
40 0.89
41 2.01
42 3.21
43 1.98
A B C D E F
1 X2.19 Excel Analysis-ToolPak to calculate descriptive statistics
2
3 x Column1
4 162726
5 188656 Mean 173838.86
6 165547 Standard Error 3269.63683510532
7 175806 Median 175144
8 190670 Mode #N/A
9 145810 Standard Deviation 23119.823781203
10 169682 Sample Variance 534526251.673878
11 155044 Kurtosis -0.764582375497427
12 149304 Skewness -0.163759884663332
13 197847 Range 90927
14 167581 Minimum 123636
15 174482 Maximum 214563
16 158967 Sum 8691943
17 195635 Count 50
18 214563
19 180784
20 213945
21 172119
22 149081
23 176953
24 206280
25 171425
26 133577
27 140463
28 200177
29 145532
30 176666
31 182937
32 151921
33 198854
34 177031
35 162653
36 207242
37 155428
38 137108
39 163051
40 206433
41 131961
42 192410
43 179011
44 161865
45 181889
46 194856
47 189767
48 200931
49 169584
50 203645
51 123636
52 152005
53 188403
A B C D E F G
1 Exercise 2.20
2
3 Beds Vacant, X f fx f x^2
4 0 4 0 =B4*C4 0 =D4*B4
5 2 8 16 32
6 3 12 36 108
7 5 4 20 100
8 6 2 12 72
9 8 1 8 64
10
11 Σf= 31 =SUM(C4:C9)
12 Σ fx = 92 =SUM(D4:D9)
13 Σ fx^2 = 376 =SUM(F4:F9)
14
15 Mean = 2.967742 =C12/C11
16 Standard deviation = 1.822509 =SQRT(C13/C11-C15^2)
A B C D E F G H I J K L
1 Exercise 2.26
2
3 Distance f LCB UCB x fx f x^2
4 57-59 9 56.5 59.5 58 =(D4+E4)/2 522 =C4*F4 30276 =H4*F4
5 60-62 10 59.5 62.5 61 610 37210
6 63-65 18 62.5 65.5 64 1152 73728
7 66-68 42 65.5 68.5 67 2814 188538
8 69-71 27 68.5 71.5 70 1890 132300
9 72-74 11 71.5 74.5 73 803 58619
10 75-77 5 74.5 77.5 76 =(D10+E10)/2 380 =C10*F10 28880 =H10*F10
11
12 Range = 21 =E10-D4
13
14 Σf= 122 =SUM(C4:C10)
15 Σ fx = 8171 =SUM(H4:H10)
16 Σ f x^2 = 549551 =SUM(J4:J10)
17
18 Mean = 66.97541 =C15/C14
19 Standard deviation = 4.33715 =SQRT(C16/C14-C18^2)
A B C D E F
1 TU2.1
2
3 ID X Summary statistics
4 1 28
5 2 23 mean = 25.41667 =AVERAGE(B4:B15)
6 3 27 median = 25.5 =MEDIAN(B4:B15)
7 4 19
8 5 22
9 6 19
10 7 23
11 8 26
12 9 34
13 10 30
14 11 29
15 12 25
A B C D E F G H I J K L M
1 TU2.2
2
English language
80 Median
15 12 79
mark
Maximum
16 13 71
Q3
17 14 84 75
18 15 71
19 16 81 70
20 17 82
21 18 83 65
22 19 93
23 20 82 60
24 21 75
25 22 61 55
Value
26 23 88
27 24 75
28 25 77
29 26 73
30 27 74
31 28 85
A B C D E F G H I J K L M N O P Q R
1 TU2.6
2
3 ID x Summary statistics
4 1 26 Box plot for MCQ marks
5 2 16 Statistic Value
40
6 3 22 Q1 22.5 =QUARTILE.EXC(B4:B31,1)
7 4 28 Minimum 15 =MIN(B4:B31)
8 5 28 Median 26.5 =MEDIAN(B4:B31)
35
9 6 15 Maximum 35 =MAX(B4:B31)
10 7 28 Q3 28.75 =QUARTILE.EXC(B4:B31,3)
11 8 26
12 9 28 30
13 10 29
14 11 25
MCQ mark
15 12 26 25
16 13 24
17 14 31
18 15 29 20
19 16 28
20 17 29
21 18 32
15
22 19 19
23 20 26
24 21 28
25 22 32 10
Value
26 23 22
27 24 35 Q1 Minimum Median Maximum Q3
28 25 22
29 26 25
30 27 27
31 28 18
A B C D E F
1 TU2.7
2
3 X Summary statistics
4 28
5 23 n= 35 =COUNT(B4:B38)
6 27 mean = 26.08571 =AVERAGE(B4:B38)
7 19 sample standard deviation = 4.272838 =STDEV.S(B4:B38)
8 22 median = 26 =MEDIAN(B4:B38)
9 19 first quartile = 23 =QUARTILE.EXC(B4:B38,1)
10 23 third quartile = 30 =QUARTILE.EXC(B4:B38,3)
11 26 sample skewness = -0.08423 =SKEW(B4:B38)
12 34 sample kurtosis = -0.7794 =KURT(B4:B38)
13 30
14 29
15 25
16 28
17 31
18 27
19 25
20 28
21 22
22 19
23 25
24 24
25 23
26 30
27 26
28 32
29 33
30 24
31 26
32 18
33 30
34 23
35 31
36 32
37 29
38 22
A B C D E F G H I J K L M N O P Q R S
1 TU2.8
2
3 ID x Summary statistics
4 1 43 Box plot for sales data
5 2 39 mean = 37.0625 =AVERAGE(B4:B51)
55
6 3 31 standard deviation = 7.022008 =STDEV.P(B4:B51)
7 4 34 median = 37.5 =MEDIAN(B4:B51)
8 5 37 50
9 6 36 Five-number summary
10 7 34
11 8 44 Statistic Value 45
12 9 29 Q1 31.25 =QUARTILE.EXC(B4:B51,1)
13 10 31 Minimum 25 =MIN(B4:B51) Q1
40 Minimum
14 11 44 Median 37.5 =MEDIAN(B4:B51)
Median
Sales data
15 12 38 Maximum 53 =MAX(B4:B51) Maximum
16 13 34 Q3 42.75 =QUARTILE.EXC(B4:B51,3) Q3
35
17 14 53
18 15 29
19 16 40
30
20 17 27
21 18 53
22 19 43 25
23 20 38
24 21 35
25 22 28 20
Value
26 23 25
27 24 29
28 25 46
29 26 31
30 27 43
31 28 38
32 29 35
33 30 25
34 31 41
35 32 46
36 33 32
37 34 38
38 35 39
39 36 32
40 37 33
41 38 42
42 39 27
43 40 45
44 41 26
45 42 50
46 43 38
47 44 37
48 45 36
49 46 46
50 47 41
51 48 38
A B C D E F G H I J K L M N O P Q R S
1 TU2.9
2
3 ID x Summary statistics
4 1 43 Box plot for the number of deliveries
5 2 39 mean = 36.03125 =AVERAGE(B4:B35)
60
6 3 31 standard deviation = 7.879896791 =STDEV.P(B4:B35)
7 4 34 median = 35.5 =MEDIAN(B4:B35) 55
8 5 37
9 6 36 Five-number summary 50
10 7 34
11 8 44 Statistic Value 45
12 9 29 Q1 29.5 =QUARTILE.EXC(B4:B35,1)
13 10 31 Minimum 18 =MIN(B4:B35) 40
14 11 44 Median 35.5 =MEDIAN(B4:B35)
Deliveries
35
15 12 38 Maximum 53 =MAX(B4:B35)
16 13 34 Q3 43 =QUARTILE.EXC(B4:B35,3)
30
17 14 53
18 15 29 25
19 16 40
20 17 27 20
21 18 53
22 19 43 15
23 20 38
24 21 35 10
Value
25 22 28
26 23 25 Q1 Minimum Median Maximum Q3
27 24 29
28 25 46
29 26 31
30 27 43
31 28 38
32 29 35
33 30 25
34 31 43
35 32 18