Chapter 01 SSM-FINAL
Chapter 01 SSM-FINAL
Section 1.1
1.1 This application is primarily descriptive in nature. The owner wishes to develop a presentation. She
will most likely use charts, graphs, tables and numerical measures to describe her data.
1.3 A bar chart is used whenever you want to display data that has already been categorized while a
histogram is used to display data over a range of values for the factor under consideration. Another
fundamental difference is that there typically are gaps between the bars on a bar chart but there are
no gaps between the bars of a histogram.
1.5 The company could use statistical inference to determine if its parts last longer. Because it is not
possible to examine every part that could be produced the company could examine a randomly
chosen subset of its parts and compare the average life of the subset to the average life of a
randomly chosen subset of the competitor’s parts. By using statistical inference procedures the
company could reach a conclusion about whether its parts last longer or not.
1.7 The appropriate chart in this case is a histogram where the horizontal axis contains the number of
missed days and the height of the bars represent the number of employees who missed each number
of days.
1.9 Student answers will vary depending on the business periodical or newspaper selected and the
article referenced. Some representative examples might include estimates of the number of CEO's
who will vote for a particular candidate, estimates of the percentage increase in wages for factory
workers, estimates of the average dollar advertising expenditures for pharmaceutical companies in a
specific year, and the expected increase in R&D expenditures for the coming quarter.
1.11 As discussed in this section, the pet store would most likely use a written survey or a telephone
survey to collect the customer satisfaction data.
1.13 An experiment is any process that generates data as its outcome. The plan for performing the
experiment in which the variable of interest is defined is referred to as an experimental design. In
the experimental design one or more factors are identified to be changed so that the impact on the
variable of interest can be observed or measured.
1.15
a. Observation would be the most likely method. Observers could be located at various bike
routes and observe the number of riders with and without helmets. This would likely be better
than asking people if they wear a helmet since the popular response might be to say yes even
when they don’t always do so.
b. A telephone survey to gas stations in the state. This could be a cost effective way of getting
data from across the state. The respondent would have the information and be able to provide
the correct price.
c. A written survey of passengers. This could be given out on the plane before the plane lands and
passengers could drop the surveys in a box as they de-plane. This method would likely garner
higher response rates compared to sending the survey to passengers’ mailing address and
asking them to return the completed survey by mail.
1.17 This data could have been collected through a survey. Employees of the USDA could provide
periodic reports of fire ant activity in their region. Also, medical reports could be used to collect
data for those people who required medical attention.
1.19 For retailers technology that scans the product UPC code at checkout makes the collection of data
fast and accurate. Retailers that use such technology can automatically update their inventory
records and develop an extensive collection of customer buying habits. By applying advanced
statistical techniques to the data the retailer can identify relationships among purchases that might
otherwise go unnoticed. Such information could enable retailers to target their advertising or even
rearrange the placement of products in the store to increase sales. Manufacturing firms use bar code
scanning to collect information concerning product availability and product quality. Credit card
purchases are automatically tracked by the retailer and the bankcard company. In this way the credit
card company is able to track your purchases and even alert you to potential fraud if purchases on
your card appear to be unusual. Finally, some companies are using radio frequency identification
(RFID) to track products through their supply chain, so that product delays and inventory problems
can be minimized.
1.21. Student answers will vary. Look for clarity of questions and to see that the issue questions are
designed to gather useful data. Look for appropriate demographic questions.
1.23 Student answers will vary. However, the issue questions should be designed to gather the desired
data regarding customers’ preferences for the use of the space. Demographic questions should
provide data so that the responses can be broken down appropriately so that United Fitness Center
managers can determine which subset of customers have what opinion about this issue. Regarding
questionnaire layout, look at neatness and answer location space. Make sure questions are properly
worded, used reasonable vocabulary, and are not leading questions.
1.25.
a. Because the population is spread over a large geographical area, a cluster random sample could
be selected to reduce travel costs.
b. A stratified random sample would probably be used to keep sample size as small as possible.
c. Most likely a convenience sample would be used since doing a statistical sample would be too
difficult.
1.27 Whenever a descriptive numerical measure such as an average is calculated from the entire
population it is a parameter. The corresponding measure calculated from a subset of the population,
that is to say a sample, is a statistic.
1.29 From a numbered list of all customers who own a certificate of deposit the bank would need to
randomly determine a starting point between 1 and k, where k would be equal to 25000/1000 = 25.
This could be done using a random number table or by having a statistical package or a spreadsheet
generate a random number between 1 and 25. Once this value is determined the bank would select
that numbered customer as the first sampled customer and then select every 25th customer after that
until 100 customers are sampled.
1.31 Values computed from a sample are always considered statistics. In order for a value, such as an
average, to be considered a parameter it must be computed from all items in the population.
1.33 Using Excel, choose the Data tab, select Data Analysis from the Analysis Group , then Random
Number Generation – shown as follows:
Note, the students’ answers may differ since Excel generates different streams of random numbers
each time it is used. Also, if the application requires integer numbers, the Decrease Decimal option
can be used.
1.35 This is a statistic. A poll would be a sample of eligible voters rather than all eligible voters.
1.37 This is a statistical sample. Every employee has an equal chance of being selected using this
method. In fact, this is an example of a simple random sample because every possible sample of
size 50 has an equal chance of being selected.
1.39 Students should choose the Data tab, select Data Analysis from the Analysis group – Random
Number Generation process. Students’ answers will differ since Excel generates different streams
of random numbers each time it is used, but 40 random numbers should be generated from a
uniform distribution with values ranging from 1 to 578. Since the application requires integer
numbers, the Decrease Decimal option should be used.
1.41
a. Since there are 4,000 patient files, we could give each file a unique identification number
consisting of 4 digits. The first file would be given the identification number “0001”. The last
file would be given the identification number of “4000”. By assigning each patient a number
and randomly selecting the 100 numbers allows each possible sample of 100 an equal chance of
being selected.
b. Either use a random number table (randomly select the starting row and column), or use a
computer program, such as Microsoft Excel, which has a random number generator.
Section 1.4
1.43 Qualitative data are categories or numerical values that represent categories. Quantitative data is
data that is purely numerical.
1.45 Nominal data involves placing observations in separate categories according to some measurable
characteristic. Ordinal data also involves placing observations into separate categories, but the
categories can be rank-ordered.
1.47
a. The data are cross-sectional. The data are collected from 2,300 customers at approximately the
same point in time
b. This is a ratio level, quantitative variable. The data represent a measurement of time.
c. Ordinal with a numerical value representing customers rating of level of service
1.49
a. Cross-sectional
b. Time-series
c. Cross-sectional
d. Cross-sectional
e. Time-Series
End of Chapter
1.51 Answers will vary with the student. But a good discussion should include the following factors:
Sampling techniques and possible problems selecting a representative sample.
Determining how to develop questions to measure approval.
Structuring questions to avoid bias.
The measurement scale associated with the questions.
The fact these polls tend to develop time-series data.
1.55 Answers will vary with the student. But a good discussion should include the following factors:
Sampling techniques and possible problems selecting a representative sample.
Determining how to measure confidence.
Structuring questions to avoid bias.
The measurement scale associated with the questions.
The fact this poll is specifically intended to develop time-series data.