STAT - 101 - TUTORIAL - 4 Solutions
STAT - 101 - TUTORIAL - 4 Solutions
gosego.mpaesele
March 2025
Question 1
i ) For class section A, median = 60, Q1 = 52, Q3 = 68. For class section B,
median = 76, Q1 = 60, Q3 = 84. For class section C, median = 58, Q1 =
50, Q3 = 80.
ii ) IQR = Q3 - Q1. For class Section C, IQR = 80 - 50 = 30. A larger IQR
means the data is more spread out. More variability in the data.
iii ) Yes, the potential outliers are located in class section A.
iv ) If the median is closer to Q1 (lower end of the box) then the data is
right-skewed.
v ) Class A is normally distributed, the median is centered inside the box.
Class B is left-skewed,the median is closer to Q3. Class C is right-skewed,
the median is closer to Q1.
Question 2
Given dataset:
45, 50, 52, 55, 55, 60, 62, 65, 65, 70, 75, 78, 80, 85, 90
1
45 + 50 + 52 + 55 + 55 + 60 + 62 + 65 + 65 + 70 + 75 + 78 + 80 + 85 + 90 = 987
Median
The median is the middle value in the ordered dataset. The dataset is already
ordered:
45, 50, 52, 55, 55, 60, 62, 65, 65, 70, 75, 78, 80, 85, 90
Since there are 15 data points, the median is the 8th value, which is:
Median = 65
Mode
The mode is the value that appears most frequently. In this case, 55 and 65
both appear twice, so the dataset is bimodal with:
Mode = 55 and 65
Question 3
Given Dataset:
1500, 1700, 1800, 2000, 2200, 2500, 2700, 3000, 3200, 3500
2
i) First Quartile (Q1), Median (Q2), and Third
Quartile (Q3)
First Quartile (Q1)
The first quartile is the median of the lower half of the data:
Q1 = 1800
Median (Q2)
The median is the middle value of the dataset:
2200 + 2500
Q2 = = 2350
2
Q3 = 3000
• Q3 (3000): 75% of employees earn less than or equal to 3000 USD per
month.
• 40th Percentile (2080): 40% of employees earn less than or equal to
2080 USD per month.
3
Figure 1: Box and Whisker Plot for Monthly Salaries
4
Question 4
Given the dataset:
10, 12, 15, 18, 20, 22, 25, 28, 30, 35, 40
• First Quartile (Q1): The median of the lower half of the data is 15.
• Third Quartile (Q3): The median of the upper half of the data is 30.
Since Q3 is farther from the median than Q1, the dataset is positively
skewed (skewed to the right).
IQR = Q3 − Q1 = 30 − 15 = 15
Thus, the IQR = 15, indicating the spread of the middle 50% of the data.
5
Question 5
Exam Score Distribution Analysis
Given the following class intervals and frequencies:
6
Class Interval Midpoint (x) Frequency (f) f × x
40 − 49 44.5 5 222.5
50 − 59 54.5 8 436
60 − 69 64.5 12 774
70 − 79 74.5 15 1117.5
80 − 89 84.5 10 845
90 − 99 94.5 5 472.5
Total Frequency = 5 + 8 + 12 + 15 + 10 + 5 = 55
7
v.) Calculate the Mode
We use the following formula to calculate the mode:
f1 − f0
Mode = L + ×h
2f1 − f0 − f2
Where: - L = 69.5 (Lower boundary of the modal class), - f1 = 15 (Fre-
quency of the modal class), - f0 = 12 (Frequency of the class before the modal
class), - f2 = 10 (Frequency of the class after the modal class), - h = 10 (Class
width).
Substituting the values:
15 − 12 3
Mode = 69.5+ ×10 = 69.5+ ×10 = 69.5+3.75 = 73.25
2(15) − 12 − 10 8
8
Question 6
Analysis of Household Income in Three Cities
The average household income in each city is provided below:
i.) Compute the Mean and Median Income for Each City
Mean Calculation
The formula for the mean is:
P
Income
Mean =
n
Where n is the number of data points.
City A:
Sum of incomes = 2500 + 2700 + 3000 + 3200 + 3500 + 4000 + 5000 = 24300
Sum of incomes = 2000 + 2200 + 2300 + 2500 + 2600 + 2800 + 2900 = 17300
9
Median Calculation
To compute the median, we first arrange the data in ascending order and identify
the middle value.
City A:
Since there are 7 values, the median is the 4th value: Median of City A = 3200
City B:
Since there are 7 values, the median is the 4th value: Median of City B = 2500
City C:
Since there are 6 values, the median is the average of the 3rd and 4th values:
1800 + 2000
Median of City C = = 1900
2
ii.) Which City Should the Company Choose for its New
Branch?
Based on the computed mean and median incomes, the company should consider
City A for the following reasons:
• City A has the highest mean income ($3471.43) and median income
($3200), suggesting a wealthier population, which is ideal for business.
• City B has a lower mean income ($2471.43) and median income ($2500),
indicating a less affluent population.
• City C has a high income outlier ($10000), but the median income ($1900)
is much lower, suggesting greater income inequality and a lower average
income.
Thus, City A is the most favorable choice for the company to open its new
branch.
10