0% found this document useful (0 votes)
81 views

Tutorial 02 PAS

This document contains a tutorial with 8 questions covering various statistical concepts: 1) A study on driving speed and fuel efficiency is analyzed by plotting a scatter diagram and commenting on any relationship between the variables. 2) Household income data by region is analyzed using row percentages, histograms, and column percentages to identify any relationships between region and income level. 3) The performance of two mutual funds over several years is compared to determine which fund performed better. 4) Download data for an online game from the previous and current year is analyzed using measures of center, quartiles, and comparisons to understand changes over time. 5) Delivery time data for two companies is analyzed using range and standard deviation

Uploaded by

Winnie Nguyễn
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
81 views

Tutorial 02 PAS

This document contains a tutorial with 8 questions covering various statistical concepts: 1) A study on driving speed and fuel efficiency is analyzed by plotting a scatter diagram and commenting on any relationship between the variables. 2) Household income data by region is analyzed using row percentages, histograms, and column percentages to identify any relationships between region and income level. 3) The performance of two mutual funds over several years is compared to determine which fund performed better. 4) Download data for an online game from the previous and current year is analyzed using measures of center, quartiles, and comparisons to understand changes over time. 5) Delivery time data for two companies is analyzed using range and standard deviation

Uploaded by

Winnie Nguyễn
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

STA 2022 TUTORIAL 2

TUTORIAL 2

1. Driving Speed and Fuel Efficiency. A study on driving speed (miles per hour) and fuel efficiency
(miles per gallon) for midsize automobiles resulted in the following data:

a. Construct a scatter diagram with driving speed on the horizontal axis and fuel efficiency on the
vertical axis.
b. Comment on any apparent relationship between these two variables.

2. The following crosstabulation shows the number of households (1000s) in each of the four
regions of the United States and the number of households at each income level.

(Total)

21,479 (row total 1)


/26,391
/43,690
/26,057

(value in row 1/ row total 1)


a. Compute the row percentages and identify the percent frequency distributions of income for
households in each region.
b. What percentage of households in the West region have an income level of $50,000 or more?
What percentage of households in the South region have an income level of $50,000 or more?
c. Construct percent frequency histograms for each region of households. Do any relationships
between regions and income level appear to be evident in your findings?
d. Compute the column percentages. What information do the column percentages provide?
e. What percent of households with a household income of $100,000 and over are from the South
region? What percentage of households from the South region have a household income of
$100,000 and over? Why are these two percentages different?

1
STA 2022 TUTORIAL 2

3. Suppose that at the beginning of Year 1 you invested $10,000 in the Stivers mutual fund and
$5000 in the Trippi mutual fund. The value of each investment at the end of each subsequent year
is provided in the table below. Which mutual fund performed better?

$5,000
$10,000

700/5000=14%

4. The creator of a new online multiplayer survival game has been tracking the monthly downloads
of the newest game. The following table shows the monthly downloads (in thousands) for each
month of the current and previous year.

mode: 37
L = (n+1)/2 Mean: sum/10=35.9 L25= (11x25)100 = 2.25
= (11+1)/2 median:37
=6 32 33 35 36 37 37 37 37 37 38 L75= (11x75)/100 = 6.75
L25= (n+1) x 25/100 L= (n+1)/2 = (10+1)/2 = 5.5
mode: 34 =3
mean : sum/11 +34.18
=> Q1=33 Q1=33+0.75(35-33) = 34.5
median: 34. Q3= 35 Q3=37+0.25(37-37) = 37
32 32 33 33 34 34 34
35 35 37 37 L75= (12x75)/100
=9

(34)
a. Compute the mean, median, and mode for the number of downloads in the previous year.
b. Compute the mean, median, and mode for the number of downloads in the current year.
c. Compute the first and third quartiles for downloads in the previous year.
(Q1 & Q3)
d. Compute the first and third quartiles for downloads in the current year.

2
STA 2022 TUTORIAL 2

e. Compare the values calculated in parts a through d for the previous and current
years. What does this tell you about the downloads of the game in the current year
compared to the previous year?

5. The following data were used to construct the histograms of the number of days required to fill
orders for Dawson Supply, Inc., and J.C. Clark Distributors (see Figure 3.2).
X bar = sum/10= 10.3
Dawson Supply Days for Delivery: 11 10 9 10 11 11 10 11 10 10 s^2=
s=
[(11-10.3)^2 + (10-10.3)^2 + (9-10.3)^2 +...+ (10-10.3)^2] /9 =

Clark Distributors Days for Delivery: 8 10 13 7 10 11 10 7 15 12


Use the range and standard deviation to support the previous observation that Dawson Supply
Dawson Clark
provides the more consistent and reliable delivery times. range:
SD
11-9=2 < 15-7=8
lower variability

6. The results of a national survey showed that on average, adults sleep 6.9 hours per night.
Suppose that the standard deviation is 1.2 hours.
a. Use Chebyshev’s theorem to calculate the percentage of individuals who sleep between 4.5 and
9.3 hours.
b. Use Chebyshev’s theorem to calculate the percentage of individuals who sleep between 3.9 and
9.9 hours.
c. Assume that the number of hours of sleep follows a bell-shaped distribution. Use the empirical
rule to calculate the percentage of individuals who sleep between 4.5 and 9.3 hours per day. How
does this result compare to the value that you obtained using Chebyshev’s theorem in part (a)?

7. The Graduate Management Admission Test (GMAT) is a standardized exam used by many
universities as part of the assessment for admission to graduate study in business. The average
GMAT score is 547 (Magoosh website). Assume that GMAT scores are bell-shaped with a
standard deviation of 100.
a. What percentage of GMAT scores are 647 or higher?
b. What percentage of GMAT scores are 747 or higher?
c. What percentage of GMAT scores are between 447 and 547?
d. What percentage of GMAT scores are between 347 and 647?

8. Annual sales, in millions of dollars, for 21 pharmaceutical companies follow.

3
STA 2022 TUTORIAL 2

8408 1374 1872 8879 2459 11413


608 14138 6452 1850 2818 1356
10498 7478 4019 4341 739 2127
3653 5794 8305
a. Provide a five-number summary.
b. Compute the lower and upper limits.
c. Do the data contain any outliers?
d. Johnson & Johnson’s sales are the largest on the list at $14,138 million. Suppose a data entry
error (a transposition) had been made and the sales had been entered as $41,138 million. Would
the method of detecting outliers in part (c) identify this problem and allow for correction of the
data entry error?
e. Show a box plot.

You might also like