0% found this document useful (0 votes)
18 views10 pages

STAT - 101 - TUTORIAL - 4 Solutions

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
18 views10 pages

STAT - 101 - TUTORIAL - 4 Solutions

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

STAT 101 - TUTORIAL 4

gosego.mpaesele
March 2025

Question 1
i ) For class section A, median = 60, Q1 = 52, Q3 = 68. For class section B,
median = 76, Q1 = 60, Q3 = 84. For class section C, median = 58, Q1 =
50, Q3 = 80.
ii ) IQR = Q3 - Q1. For class Section C, IQR = 80 - 50 = 30. A larger IQR
means the data is more spread out. More variability in the data.
iii ) Yes, the potential outliers are located in class section A.

iv ) If the median is closer to Q1 (lower end of the box) then the data is
right-skewed.
v ) Class A is normally distributed, the median is centered inside the box.
Class B is left-skewed,the median is closer to Q3. Class C is right-skewed,
the median is closer to Q1.

Question 2
Given dataset:

45, 50, 52, 55, 55, 60, 62, 65, 65, 70, 75, 78, 80, 85, 90

i) Mean, Median, and Mode


Mean
The mean is calculated as the sum of all data points divided by the number of
data points:
Pn
xi
Mean = i=1
n
The sum of the data points is:

1
45 + 50 + 52 + 55 + 55 + 60 + 62 + 65 + 65 + 70 + 75 + 78 + 80 + 85 + 90 = 987

Since there are 15 data points, the mean is:


987
Mean = = 65.8
15

Median
The median is the middle value in the ordered dataset. The dataset is already
ordered:

45, 50, 52, 55, 55, 60, 62, 65, 65, 70, 75, 78, 80, 85, 90
Since there are 15 data points, the median is the 8th value, which is:

Median = 65

Mode
The mode is the value that appears most frequently. In this case, 55 and 65
both appear twice, so the dataset is bimodal with:

Mode = 55 and 65

ii) Best Measure of Central Tendency


The mean (65.8) and the median (65) are close, while the mode shows a bimodal
distribution (55 and 65). Since the dataset is fairly symmetrical and has no
extreme outliers, the median is the best measure of central tendency. It is less
affected by the spread of data and provides a more reliable central value in this
case.

Question 3
Given Dataset:
1500, 1700, 1800, 2000, 2200, 2500, 2700, 3000, 3200, 3500

2
i) First Quartile (Q1), Median (Q2), and Third
Quartile (Q3)
First Quartile (Q1)
The first quartile is the median of the lower half of the data:

Q1 = 1800

Median (Q2)
The median is the middle value of the dataset:
2200 + 2500
Q2 = = 2350
2

Third Quartile (Q3)


The third quartile is the median of the upper half of the data:

Q3 = 3000

ii) 40th Percentile


The 40th percentile is calculated by interpolating between the 4th and 5th data
points:
P40 = 2000 + 0.4 × (2200 − 2000) = 2080

iii) Interpretation of Q1, Q3, and 40th Percentile


• Q1 (1800): 25% of employees earn less than or equal to 1800 USD per
month.

• Q3 (3000): 75% of employees earn less than or equal to 3000 USD per
month.
• 40th Percentile (2080): 40% of employees earn less than or equal to
2080 USD per month.

iv)Box and Whisker Plot for Monthly Salaries


Comment on the Distribution
The box plot shows a relatively symmetrical distribution with no extreme out-
liers. The salaries are concentrated between Q1 and Q3, with a moderate spread
between the minimum and maximum values.

3
Figure 1: Box and Whisker Plot for Monthly Salaries

4
Question 4
Given the dataset:

10, 12, 15, 18, 20, 22, 25, 28, 30, 35, 40

i.) Identifying the minimum, Q1, median (Q2), Q3, and


maximum values
The values are already in ascending order.

• Minimum: The smallest value is 10.


• Maximum: The largest value is 40.
• Median (Q2): Since the dataset has 11 numbers, the median is the 6th
value, which is 22.

• First Quartile (Q1): The median of the lower half of the data is 15.
• Third Quartile (Q3): The median of the upper half of the data is 30.

Thus, the values are:

Minimum = 10, Q1 = 15, Median (Q2) = 22, Q3 = 30, Maximum = 40

ii.) Determining whether the dataset is skewed


To determine if the dataset is skewed, we compare the distances between the
median and the quartiles.

• Distance between Q1 and the median: 22 − 15 = 7.

• Distance between Q3 and the median: 30 − 22 = 8.

Since Q3 is farther from the median than Q1, the dataset is positively
skewed (skewed to the right).

iii.) Interquartile Range (IQR)


The interquartile range (IQR) is the difference between Q3 and Q1:

IQR = Q3 − Q1 = 30 − 15 = 15

Thus, the IQR = 15, indicating the spread of the middle 50% of the data.

5
Question 5
Exam Score Distribution Analysis
Given the following class intervals and frequencies:

Class Interval Frequency


40 − 49 5
50 − 59 8
60 − 69 12
70 − 79 15
80 − 89 10
90 − 99 5

i.) Cumulative Frequency and Class Boundaries


First, we calculate the cumulative frequency and class boundaries.
Cumulative Frequency:

Class Interval Frequency Cumulative Frequency


40 − 49 5 5
50 − 59 8 13
60 − 69 12 25
70 − 79 15 40
80 − 89 10 50
90 − 99 5 55
Class Boundaries:

Class Interval Class Boundaries


40 − 49 39.5 − 49.5
50 − 59 49.5 − 59.5
60 − 69 59.5 − 69.5
70 − 79 69.5 − 79.5
80 − 89 79.5 − 89.5
90 − 99 89.5 − 99.5

ii.) Estimated Mean


To calculate the estimated mean, we find the midpoint of each class interval and
then compute the weighted average using the midpoints and frequencies.

Midpoint of each class interval:

6
Class Interval Midpoint (x) Frequency (f) f × x
40 − 49 44.5 5 222.5
50 − 59 54.5 8 436
60 − 69 64.5 12 774
70 − 79 74.5 15 1117.5
80 − 89 84.5 10 845
90 − 99 94.5 5 472.5

Total Frequency = 5 + 8 + 12 + 15 + 10 + 5 = 55

Sum of(f × x) = 222.5 + 436 + 774 + 1117.5 + 845 + 472.5 = 3867.5


P
(f × x) 3867.5
Estimated Mean = = ≈ 70.3
Total Frequency 55

iii.) Estimate the Median Class and Compute the Approx-


imate Median
To find the median class, we first find half of the total frequency:
55
= 27.5
2
The cumulative frequency closest to 27.5 is 25 (in the class 60 - 69) and 40
(in the class 70 - 79), so the median class is 70 - 79.
Now, using the formula for the median:
!
N
− F
Median = L + 2 ×h
f
Where: - L = 69.5 (Lower boundary of the median class), - N = 55 (Total
frequency), - F = 25 (Cumulative frequency before the median class), - f = 15
(Frequency of the median class), - h = 10 (Class width).
Substituting the values:
 
27.5 − 25
Median = 69.5 + × 10 = 69.5 + 1.67 = 71.17
15
So, the approximate median score is **71.17**.

iv.) Determine the Modal Class and Explain Why It Rep-


resents the Mode
The modal class is the class with the highest frequency. From the table, the
highest frequency is 15 in the class 70 - 79. Thus, the modal class is **70 - 79**.
The mode represents the value that occurs most frequently, and in grouped
data, the modal class is the class interval with the highest frequency.

7
v.) Calculate the Mode
We use the following formula to calculate the mode:
 
f1 − f0
Mode = L + ×h
2f1 − f0 − f2
Where: - L = 69.5 (Lower boundary of the modal class), - f1 = 15 (Fre-
quency of the modal class), - f0 = 12 (Frequency of the class before the modal
class), - f2 = 10 (Frequency of the class after the modal class), - h = 10 (Class
width).
Substituting the values:

   
15 − 12 3
Mode = 69.5+ ×10 = 69.5+ ×10 = 69.5+3.75 = 73.25
2(15) − 12 − 10 8

So, the mode is 73.25.

vi.) Compute Q1, Q2, and Q3


To compute the quartiles Q1 , Q2 , and Q3 , we use the formulas:
1. For Q1 :
!
N
4 −F
Q1 = L1 + ×h
f

2. For Q2 (which is the median):

Q2 = 71.17 (already computed)


3. For Q3 :
!
3N
4 −F
Q3 = L3 + ×h
f
We can calculate these values similarly by identifying the cumulative fre-
quencies corresponding to N4 and 3N
4 .

vii.) Confirm if the Median Score is Equal to Q2


Since the median is defined as Q2 , it is inherently true that the median score is
equal to Q2 . Thus, the answer is:

Yes, the median score is equal to Q2 .

8
Question 6
Analysis of Household Income in Three Cities
The average household income in each city is provided below:

City Household Income (USD)


City A 2500, 2700, 3000, 3200, 3500, 4000, 5000
City B 2000, 2200, 2300, 2500, 2600, 2800, 2900
City C 1500, 1700, 1800, 2000, 2200, 10000

i.) Compute the Mean and Median Income for Each City
Mean Calculation
The formula for the mean is:
P
Income
Mean =
n
Where n is the number of data points.
City A:

Sum of incomes = 2500 + 2700 + 3000 + 3200 + 3500 + 4000 + 5000 = 24300

Number of data points(n) = 7


24300
Mean of City A = = 3471.43
7
City B:

Sum of incomes = 2000 + 2200 + 2300 + 2500 + 2600 + 2800 + 2900 = 17300

Number of data points(n) = 7


17300
Mean of City B = = 2471.43
7
City C:

Sum of incomes = 1500 + 1700 + 1800 + 2000 + 2200 + 10000 = 19300

Number of data points(n) = 6


19300
Mean of City C = = 3216.67
6

9
Median Calculation
To compute the median, we first arrange the data in ascending order and identify
the middle value.
City A:

Ordered incomes: 2500, 2700, 3000, 3200, 3500, 4000, 5000

Since there are 7 values, the median is the 4th value: Median of City A = 3200
City B:

Ordered incomes: 2000, 2200, 2300, 2500, 2600, 2800, 2900

Since there are 7 values, the median is the 4th value: Median of City B = 2500
City C:

Ordered incomes: 1500, 1700, 1800, 2000, 2200, 10000

Since there are 6 values, the median is the average of the 3rd and 4th values:
1800 + 2000
Median of City C = = 1900
2

ii.) Which City Should the Company Choose for its New
Branch?
Based on the computed mean and median incomes, the company should consider
City A for the following reasons:

• City A has the highest mean income ($3471.43) and median income
($3200), suggesting a wealthier population, which is ideal for business.

• City B has a lower mean income ($2471.43) and median income ($2500),
indicating a less affluent population.
• City C has a high income outlier ($10000), but the median income ($1900)
is much lower, suggesting greater income inequality and a lower average
income.

Thus, City A is the most favorable choice for the company to open its new
branch.

10

You might also like