0% found this document useful (0 votes)

9 views9 pages

Stat 1010 Guiding Answers

The document contains homework solutions for a STAT 1010 course, focusing on data analysis and interpretation. It discusses the relationship between disposable income and Best Buy's sales, sales data by day of the week, the impact of outliers on statistics, customer reactions to a new phone, and vehicle purchase preferences related to weather. Each section includes questions and detailed answers, showcasing various statistical concepts and analyses.

Uploaded by

Beryl Malaika Jiotsa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views9 pages

Stat 1010 Guiding Answers

Uploaded by

Beryl Malaika Jiotsa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

STAT 1010 Homework Solutions

Fall 2024

1 Chapter 2
Data don’t always arrive in the most convenient form. You may have the data, but they’re not
in the same place or the same format. Unless you have data organized in a common data table,
you’ll find it hard or impossible to get the answers you need.
For this exercise, you must prepare the data that are needed to explore the relationship between
the sales of a company, retailer Best Buy, and the health of the economy. Is there a relationship
between the amount of money available to be spent, called the disposable income, and the net
sales of retailer Best Buy?
The economic data come from the online repository hosted by the Federal Reserve Bank of St.
Louis. The data are collected monthly, are reported in the number of billions of dollars (at an
annual rate), and date back to 1959. The data for 2010 are as shown in the following table:

Month Disp Income

2010/01/01 $11, 041.1
2010/02/01 $11, 023.0
2010/03/01 $11, 060.3
2010/04/01 $11, 141.1
2010/05/01 $11, 220.6
2010/06/01 $11, 231.2
2010/07/01 $11, 253.9
2010/08/01 $11, 304.7
2010/09/01 $11, 301.3
2010/10/01 $11, 355.5
2010/11/01 $11, 407.2
2010/12/01 $11, 514.5

The data for Best Buy come from another source. Compustat maintains a database of company
information gleaned from reports that are required of all publicly traded firms. For a company to
have its stock bought and sold, it must report data such as these quarterly gross profits, given in
millions of dollars. The company data extend back to 2005 (only one year is shown).

1
Year Quarter Gross Profits
2010 1 $3, 036.00
2010 2 $3, 156.00
2010 3 $3, 233.00
2010 4 $4, 214.00

(a) Explain why it would it be useful to merge these two data tables. What questions do you
think would be interesting to answer based on the merged information?

Answer: It would be useful to merge the two datasets to explore the relationship between
the amount of money available to be spent, called the disposable income, and the net
sales of Best Buy, captured by its quarterly gross profits. An interesting question to be
asked is whether there is an association between these two variables in the sense that one
of them influences the other or not.

(b) Describe the difference in interpretation of a row in the two tables. Do the tables have a
common frequency?

Answer: In the first table, the rows indicate monthly disposable income (in billion
dollars) for 12 months of 2010. In the second table, the rows indicate quarterly gross
profits (in million dollars) of Best Buy for the 4 quarters of 2010. Since the first table
records monthly data and the second table records quarterly data, these two tables do
not have a common frequency.

(c) The separate data tables each have a numerical column of gross profits or disposable income.
Are the units of these comparable, or should they be expressed with common scales?

Answer: In the first table, the numerical column of monthly disposable income is ex-
pressed in billion dollars, whereas in the second table, the numerical column of quarterly
gross profits is expressed in million dollars. In order to merge the two tables it would be
better to have a common scale.

(d) What should you do if you want to arrange the data in a table that has a quarterly time fre-
quency? Can you copy the data columns directly, or do you have to perform some aggregation
or recording first?

Answer: In order to arrange the data in a table having a quarterly frequency, we have
to aggregate the monthly disposable income data into 4 quarters of 2010. For the gross
profit data, since it is already in quarterly frequency, we can directly copy the numerical
column from the second table.

Page 2
(e) Suggest improved names for the columns in the merged data table. How do you want to
represent the information about the date?

Answer: In the merged data table, we can have column names: Year, Quarter, Dispos-
able Income (in billion dollars) and Best Buy’s Gross Profit (in billion dollars).

(f) Show the merged data table for 2010.

Answer:

Disposable Income Best Buy’s Gross Profit

Year Quarter
(in billion dollars) (in billion dollars)
2010 1 33124.4 3.036
2010 2 33592.9 3.156
2010 3 33859.9 3.233
2010 4 34277.2 4.214

(g) With the data merged, what annual shopping ritual becomes apparent?

Answer: Even while disposable income does rise progressively each quarter, Best Buy’s
gross earnings climbed by more than a billion dollars in the fourth quarter compared to
the preceding quarters. Most likely, Black Friday and holiday shopping are the reasons
for this.

0.5 marks for signing the work, 0.5 marks for submitting a pdf, 0.5 marks for each question but
part (f), 1 mark for part (f).

2 Chapter 3
• 2 marks for completing a submission.

• 3 marks for proper representation of data.

• 2 marks for animation (parts of figure appearing one at a time).

• 2 marks for using customized colours (not using the default colours).

• 1 mark for signing the work.

Page 3
3 Chapter 4
The following table summarizes sales by day of the week at a convenience store operated by a
major domestic gasoline retailer. The data cover about one year, with 52 values for each day of
the week.

Mean ($) Std.Dev. ($)

Mon 2, 824.00 575.587
Tue 2, 768.85 314.924
Wed 3, 181.32 494.785
Thur 3, 086.00 712.135
Fri 3, 100.82 415.291
Sat 4, 199.33 632.983
Sun 3, 807.27 865.858

(a) Which consecutive two-day period produces the highest total level of sales during the year?

Answer: The sale totals for each consecutive two-day period are approximately
5592, 5949, 6267, 6186, 7299, 8006. The two-day period with the largest sale totals are
Saturday and Sunday, corresponding to the weekend – as expected.

(b) Do the distributions of the sales data grouped by day (as summarized here) overlap, or are
the seven groups relatively distinct?

Answer: The distributions overlap. When ordering the means from smallest to largest
and looking at the differences, ordering their differences from smallest to largest yields
14, 56, 81, 262, 392, 626. Meanwhile, the sorted standard deviations from smallest to
largest are 314, 393, 415, 575, 632, 712, 865. Most of the differences in means are less than
the smallest standard deviation, and are all less than the largest standard deviation. We
would therefore expect them to overlap if the empirical rule holds.
We can take a quick sanity check by plotting seven normal distributions with the means
and standard deviations of the sales data on Desmos. This confirms the overlap.

Figure 1: Overlapping distributions of normal distributions with the means and standard
deviations given by the sales data.

Page 4
(c) These data summarize sales over 52 × 7 = 364 consecutive days. With that in mind, what
important aspect of these data is hidden by this table?

Answer: This aggregated data hides trends and other seasonal patterns. The sales take
place over time, and as such the sales on one day are not necessarily independent of the
sales on another. For instance, sales by the major domestic gasoline retailer may be on
an uptrend or downtrend, and aggregating the data by day of week obscures this. There
may also be additional important seasonal variation in the data – perhaps sales pick up
during the summer and wane during the winter.

4 Chapter 4 and Empirical Rule

Beatles Outliers affect more than the statistics that measure the center of a distribution. Outliers
also affect statistics that measure the spread of the distribution. Use the sizes of the songs in
Exercise 55 for this exercise.

(a) Find the standard deviation and interquartile range of the sizes of the songs (in megabytes).

Answer: The standard deviation is 0.9503484, while the IQR is 0.8076458.

(b) Which of these summary statistics is most affected by the presence of the outlier? How do
you know?

Answer: The IQR is not as affected by the presence of the outlier. It is based off
quantiles, which are more robust to outliers than the standard deviation, which is based
off the mean. We can confirm this by excluding the most extreme outlier from the
data and finding the mean and standard deviation without it, inspired by the following
problem. The standard deviation without the song is 0.5777002, while the IQR without
it is 0.7606113. Only the standard deviation changes by much.

(c) Exclude the most extreme outlier from the data and find the mean and SD without this song.
Which summary changes more when the outlier is excluded: the mean or SD?

Answer: The mean including the outlier is 2.734379, while the mean without it is
2.587528. The standard deviation without the outlier is 0.5777002. The mean changes
by 0.146851 while the standard deviation changes by 0.3726481. The standard deviation
changes by more than the mean.
This question would have made more sense if the question had asked to compare the
standard deviation and IQR. We will accept answers for this as well.

Page 5
(d) Create a histogram of the data and describe its shape.

Answer:

Figure 2: Histogram of Beatles data. The size and time columns are right-skewed with
a single outlier, and are very correlated. Most Beatles songs are around 2-4 MB and
between 2-4 minutes, but Hey Jude is the exception at around 7 minutes long. The year
column is relatively uniformly distributed and bounded between 1962 and 1970, apart
from two peaks in 1964 and 1969.

From the introduction, one should have used the size of the songs in the exercise for the
histogram. Given that it was not clear and there were many questions about this in office
hours, we will also accept credit for providing a histogram of at least one column that
contains numerical data. The point breakdown should be as follows:

• 3 points for providing a histogram of at least one of Size, Time or Year.

• 1 point for correctly labeled x-axis.

• 1 point for correctly labeled y-axis.

• 1 point for legible ticks on the x-axis that do not overlap.

• 1 point for title.

• 1 point for choosing a reasonable number of histogram bins.

• 1 point for noting the outlier in either size or time, or for noting that the number
of years is bounded in an interval between 1962 to 1970.

• 1 point for noting the right-skew in the size and time histograms, or for noting the
lack of right-skew in the year histogram.

Page 6
5 Chapter 5
To gauge the reactions of possible customers, the manufacturer of a new type of cellular telephone
displayed the product at a kiosk in a busy shopping mall. The following table summarizes the
results for the customers who stopped to look at the phone:

Reaction Male Female

Favorable 36 18
Ambivalent 42 7
Unfavorable 29 9

(a) Is the reaction to the new phone associated with the sex of the customer? How strong is the
association?

Answer: In the mosaic plot on the right below, we see that the proportion of different
reactions are different for the men and the women. Thus according to the mosaic plot
of the data, the reaction seems to be associated with the sex of the customer. However,
the association does not seem to be very strong.

(b) How should the company use the information from this study when marketing its new prod-
uct?

Answer: From the study, it is observed that among the men who responded to the sur-
vey, most of the reactions were ambivalent, and among the women who responded, more
than half of the reactions were favorable. It would be better to use different marketing
strategies for men and women. For the men, the company should focus on turning the
ambivalent reactions into favorable ones as ”ambivalent male” customers have the highest
frequency. Whereas for women, the company should focus on meeting the expectations
of the ”favorable female” customers.

(c) Can you think of an underlying lurking variable that might complicate the relationship shown
here? Justify your answer.

Answer: An underlying lurking variable in this study can be the age of the customers.
It is possible that the males who responded to the survey were mostly elderly people
which may be a possible explanation for a lot of ambivalent reactions from the males.
Whereas it is possible that the females surveyed were younger people having a favorable
approach towards new technology. Another possible underlying lurking variable can be
the time of the day the survey is conducted. Stratifying the data using these lurking
variables may lead to phenomenon like Simpson’s Paradox.

Page 7
8 marks for the mosaic plot. A correct mosaic plot of the reactions of the customers versus
their gender is shown below.

Figure 3: Mosaic plot for Chapter 5.

6 Chapter 6
Drive Preferences These data give the percentage of new vehicles bought with four-wheel drive,
state by state in the continental United States in 2014. The data include the average temperature
in that state in January.

(a) Do you expect the correlation between these variables to be positive or negative? Explain
your choice.

Answer: We expect the correlation between the two variables to be negative. Four-wheel
drive vehicles are very helpful in especially poor winter weather, when the temperature
is low and conditions are poor. As such, as the temperature in January decreases, we
would expect the percentage of vehicles bought with four-wheel drive to increase.

(b) Draw the associated scatterplot. Does the direction of the association match what you
expected to find?

Answer:

Page 8
Figure 4: Scatterplot of percentage of new vehicles with four-wheel drive against the
average temperature in January, where each datapoint represents a state within the
continental United States. The direction of the association matches what we expect.

Answer: It is lowest in region 11, where 11% of new vehicles have four-wheel drive and
the average temperature in January is 59 degrees Fahrenheit. It is highest in regions 36
and 43, where 76% of new vehicles have four-wheel drive and the average temperatures
in January are 20.8 and 7.58 degrees Fahrenheit respectively.

(d) Find the correlation between the variables. Is it weaker or stronger than you expected?

Answer: The correlation between the two variables is -0.8315649. This is stronger than
what one would expect, as the temperature in winter is only one of many factors (price,
availability, resale value, etc.) that one would consider when buying a car.

Page 9

QA Ken Black All Chapters Solution
88% (129)
QA Ken Black All Chapters Solution
1,244 pages
Ken Black QA All Odd No Chapter Solution
83% (6)
Ken Black QA All Odd No Chapter Solution
919 pages
Chapter
No ratings yet
Chapter
20 pages
291 Practice Midterms and Solutions
100% (2)
291 Practice Midterms and Solutions
116 pages
Appendix C: Answers: Answers To Odd-Numbered Chapter Exercises
No ratings yet
Appendix C: Answers: Answers To Odd-Numbered Chapter Exercises
9 pages
11 Questions BBA (Hons) (19 Pages)
No ratings yet
11 Questions BBA (Hons) (19 Pages)
18 pages
Multiple Choice Questions Business Statistics
No ratings yet
Multiple Choice Questions Business Statistics
60 pages
Cost Control and Cost Reduction
100% (1)
Cost Control and Cost Reduction
71 pages
CH 2
83% (6)
CH 2
30 pages
Statictics and Measures of Central Tendency
80% (5)
Statictics and Measures of Central Tendency
46 pages
Boam Bitch
100% (1)
Boam Bitch
67 pages
QNT 275 Enitre Course (2018 New)
0% (2)
QNT 275 Enitre Course (2018 New)
12 pages
4b-Assignment of Displaying & Exploring Data
100% (1)
4b-Assignment of Displaying & Exploring Data
6 pages
Assignment Questions
0% (2)
Assignment Questions
5 pages
Solution Manual For Statistics Data Analysis and Decision Modeling 5th Edition Evans 0132744287 9780132744287
100% (49)
Solution Manual For Statistics Data Analysis and Decision Modeling 5th Edition Evans 0132744287 9780132744287
7 pages
Mas202 2022
No ratings yet
Mas202 2022
53 pages
Questions by Topics S1
No ratings yet
Questions by Topics S1
36 pages
Module 4 (Data Management) - Math 101
No ratings yet
Module 4 (Data Management) - Math 101
8 pages
Mi Beard Trimmer Invoice
No ratings yet
Mi Beard Trimmer Invoice
1 page
Mock Exam Midterm Statistics I
No ratings yet
Mock Exam Midterm Statistics I
24 pages
Example Sheet 1
100% (1)
Example Sheet 1
5 pages
Worksheet 1 GRADE 12
No ratings yet
Worksheet 1 GRADE 12
2 pages
Lectures and Notes MATH 212 (Part 1)
No ratings yet
Lectures and Notes MATH 212 (Part 1)
8 pages
Qa Ken Black All Chapters Solution PDF
No ratings yet
Qa Ken Black All Chapters Solution PDF
1,244 pages
Chapater 1, 2 and 3 Fs
No ratings yet
Chapater 1, 2 and 3 Fs
80 pages
Profit and Loss: TYPE-I: Questions Based On The Basic Concept of C.P. & S.P. and Profit & Loss .........
No ratings yet
Profit and Loss: TYPE-I: Questions Based On The Basic Concept of C.P. & S.P. and Profit & Loss .........
8 pages
Gilette Case - V3
100% (3)
Gilette Case - V3
23 pages
Lecture 2
No ratings yet
Lecture 2
61 pages
Mas202 - 2022
No ratings yet
Mas202 - 2022
53 pages
False - Cold Canvass Method
100% (1)
False - Cold Canvass Method
7 pages
Chapter 4 Statistics and Data Measures of Variation
No ratings yet
Chapter 4 Statistics and Data Measures of Variation
15 pages
Review of Basic Statistical Concepts Hanke
No ratings yet
Review of Basic Statistical Concepts Hanke
28 pages
Business Math and Statistics
No ratings yet
Business Math and Statistics
25 pages
11 Economics B Annual Exam 2024 25 MS
No ratings yet
11 Economics B Annual Exam 2024 25 MS
19 pages
Quantitative Methods - Reading 8
No ratings yet
Quantitative Methods - Reading 8
49 pages
DA Answer-Key
No ratings yet
DA Answer-Key
12 pages
Bistat
No ratings yet
Bistat
19 pages
Ba Lecture 2
No ratings yet
Ba Lecture 2
54 pages
In Class Exercises T1 With Final Solutions
No ratings yet
In Class Exercises T1 With Final Solutions
11 pages
Student Study Guide and Solutions Chapter 1 FA14 PDF
No ratings yet
Student Study Guide and Solutions Chapter 1 FA14 PDF
7 pages
Use of Statistics in Data Science
No ratings yet
Use of Statistics in Data Science
11 pages
UTS Business Statistics: Desciptive Stats
No ratings yet
UTS Business Statistics: Desciptive Stats
7 pages
FT243037 - Statistical Methods For Decision Making Assignment-01
No ratings yet
FT243037 - Statistical Methods For Decision Making Assignment-01
11 pages
TC3AD13
No ratings yet
TC3AD13
9 pages
Ken Black QA 5th Chapter 1 Solution
No ratings yet
Ken Black QA 5th Chapter 1 Solution
6 pages
Statistics
No ratings yet
Statistics
52 pages
Thông Kê
No ratings yet
Thông Kê
13 pages
Stat - Ques. Bank
No ratings yet
Stat - Ques. Bank
10 pages
Descriptive Statistics: Tabular and Graphical Displays: Nguyen Thi Lien
No ratings yet
Descriptive Statistics: Tabular and Graphical Displays: Nguyen Thi Lien
21 pages
Your Results For: "Multiple Choice": Site Title: Book Title: Book Author: Location On Site: Date/Time Submitted
No ratings yet
Your Results For: "Multiple Choice": Site Title: Book Title: Book Author: Location On Site: Date/Time Submitted
40 pages
ISOM2700 FA21 - Quiz - 1 - Sol - Ch1 - Ch7
No ratings yet
ISOM2700 FA21 - Quiz - 1 - Sol - Ch1 - Ch7
9 pages
Asa QB
No ratings yet
Asa QB
5 pages
QM Questons
No ratings yet
QM Questons
6 pages
Answer of Assignment of Displaying & Exploring Data
No ratings yet
Answer of Assignment of Displaying & Exploring Data
7 pages
ws8-6 Measures of Spread
No ratings yet
ws8-6 Measures of Spread
5 pages
Calculating For Descriptive Statistics Jazmine Ibarra
No ratings yet
Calculating For Descriptive Statistics Jazmine Ibarra
4 pages
Year 10 Single Variable and Bivariate Data
No ratings yet
Year 10 Single Variable and Bivariate Data
8 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
4 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Assessment 1
No ratings yet
Assessment 1
4 pages
11 4variationswithinadataset
No ratings yet
11 4variationswithinadataset
4 pages
Midterm Self Tests
No ratings yet
Midterm Self Tests
4 pages
Porters Five Forces Group No-12
No ratings yet
Porters Five Forces Group No-12
4 pages
Name: Score: Course & Section: Date
No ratings yet
Name: Score: Course & Section: Date
11 pages
Qa - Installment Sales
0% (1)
Qa - Installment Sales
3 pages
Market Structure
100% (2)
Market Structure
65 pages
Pressure Washing Business Plan
No ratings yet
Pressure Washing Business Plan
34 pages
Chapter 3 Industrial Buyer Behavior BB
No ratings yet
Chapter 3 Industrial Buyer Behavior BB
8 pages
ACI INTER Report.-Word
No ratings yet
ACI INTER Report.-Word
49 pages
Ama Set 36
No ratings yet
Ama Set 36
6 pages
PFMT - Pidilite - Apo 7 - Sec F
No ratings yet
PFMT - Pidilite - Apo 7 - Sec F
19 pages
Project Report On Bisleri-Final 1
No ratings yet
Project Report On Bisleri-Final 1
93 pages
3rd Sem Cost Accounting Apr 2023
No ratings yet
3rd Sem Cost Accounting Apr 2023
8 pages
LTCC Consultation
No ratings yet
LTCC Consultation
2 pages
L 13 Suply Contract
No ratings yet
L 13 Suply Contract
17 pages
Branch Accounting: by Dr. Pranabananda Rath Consultant & Visiting Faculty
No ratings yet
Branch Accounting: by Dr. Pranabananda Rath Consultant & Visiting Faculty
10 pages
Quiz 2 (Managerial Accounting) : First: TRUE/FALSE
No ratings yet
Quiz 2 (Managerial Accounting) : First: TRUE/FALSE
4 pages
Malaysian TP Guidelines 2012
No ratings yet
Malaysian TP Guidelines 2012
100 pages
Cera 156
No ratings yet
Cera 156
1 page
Year 7
No ratings yet
Year 7
44 pages
CMA - CMA1 - Book Short Term Items 1
No ratings yet
CMA - CMA1 - Book Short Term Items 1
24 pages
Group 1 STOCK PITCH
No ratings yet
Group 1 STOCK PITCH
19 pages
Chapter 3 Personal Selling
No ratings yet
Chapter 3 Personal Selling
15 pages
MKT460.6 Esquire Group Case Study Group Updated
No ratings yet
MKT460.6 Esquire Group Case Study Group Updated
10 pages
INV1
No ratings yet
INV1
1 page
Huda Radiowala
No ratings yet
Huda Radiowala
1 page
Toilet Paper History: How America Convinced The World To Wipe
No ratings yet
Toilet Paper History: How America Convinced The World To Wipe
5 pages
Economic and Business Forecasting: Analyzing and Interpreting Econometric Results
From Everand
Economic and Business Forecasting: Analyzing and Interpreting Econometric Results
John E. Silvia
No ratings yet
Why Moats Matter: The Morningstar Approach to Stock Investing
From Everand
Why Moats Matter: The Morningstar Approach to Stock Investing
Heather Brilliant
4/5 (3)
Get Rich with Dividends: A Proven System for Earning Double-Digit Returns
From Everand
Get Rich with Dividends: A Proven System for Earning Double-Digit Returns
Marc Lichtenfeld
No ratings yet

Stat 1010 Guiding Answers

Uploaded by

Stat 1010 Guiding Answers

Uploaded by

STAT 1010 Homework Solutions

Month Disp Income

(f) Show the merged data table for 2010.

Disposable Income Best Buy’s Gross Profit

• 3 marks for proper representation of data.

• 2 marks for animation (parts of figure appearing one at a time).

• 1 mark for signing the work.

Mean ($) Std.Dev. ($)

4 Chapter 4 and Empirical Rule

Answer: The standard deviation is 0.9503484, while the IQR is 0.8076458.

• 3 points for providing a histogram of at least one of Size, Time or Year.

• 1 point for correctly labeled x-axis.

• 1 point for correctly labeled y-axis.

• 1 point for legible ticks on the x-axis that do not overlap.

• 1 point for title.

• 1 point for choosing a reasonable number of histogram bins.

Reaction Male Female

Figure 3: Mosaic plot for Chapter 5.

You might also like