STT 215 Exam 1 Example
STT 215 Exam 1 Example
Show your work for short answer problems requiring more than one word answers.
1) MATCHING (15 points) Match the following vocabulary terms with their definitions.
A. Sampling frame
B. Parameter
C. Stratified random sampling
D. Double-blinding
E. Census
F. Discrete variable
G. Cluster random sampling
H. Explanatory variable
I. Simpson’s Paradox
J. Simple random sampling
K. Statistic
L. Retrospective study
M. Nonresponse bias
N. Lurking variable
O. Response variable
____ Each subject (or sample of that size) has the same chance of being selected
____ An issue when some subjects cannot be reached or refuse to participate
____ A numerical summary of a sample taken from the population
____ A quantitative variable with possible values that form a separate set of numbers (e.g. 0, 1, 2, …)
____ Identify existing groups of subjects, take simple random sample of the groups
____ Divide the population into groups by a shared characteristic, take simple random sample of each
group
____ A complete enumeration of an entire population
____ A numerical summary of the population
____ A list of possible subjects in the population
____ When neither the subjects nor the researchers in contact with them know the treatment
assignment
____ The outcome variable
____ The variable that describes differences in the response variable
____ An often-unobserved influence that affects the association between the primary variables of
interest
____ When the direction of the association between two variables switches direction after the
introduction of a third variable
____ An observational study that relies on subjects’ past experiences or behavior
Form A 1
Name: Student ID: Date:
2) MULTIPLE CHOICE (5 points): A study attempted to estimate the proportion of North Carolina
residents who were willing to spend more tax dollars on protecting the North Carolina beaches from
environmental disasters. Twenty-four hundred North Carolina residents were surveyed. Which is the
population of interest in the study?
a. All NC residents who live on the beaches
b. All NC residents who were willing to spend more tax dollars on protecting the beaches from
environmental disasters
c. All US residents
d. The 2400 NC residents surveyed
e. All NC residents
3) MULTIPLE CHOICE (5 points): Match each plot with its r-value by writing the number in blank.
1) 𝑟 = 0.85
2) 𝑟 = 0.15
3) 𝑟 = −0.50
4) 𝑟 = 1.00
Plot a _______
Plot b _______
Plot c _______
Plot d _______
4) MULTIPLE CHOICE (5 points): A researcher recorded the years of education and starting salaries
for 12 subjects. A scatterplot of the data with a regression line. The point (14, 4) was omitted when
computing the regression equation.
Is the point (14, 4) an outlier on x? Is it an outlier on y? It is a
regression outlier?
Form A 2
Name: Student ID: Date:
5) SHORT ANSWER (10 points): A sample of fifty motorists was taken on a federal highway where the speed
limit was 60 miles per hour. A dot plot of their speeds is shown below.
6) SHORT ANSWER (10 points): Globally, systolic blood pressure readings have a mean of 128.4 and a
standard deviation of 19.6. A reading above 140 is considered high blood pressure.
a. What is the z-score for a blood pressure reading of 140? Interpret this z-score.
b. The systolic blood pressure values have a bell-shaped distribution. Report an interval which about
95% of the systolic blood pressure values fall.
7) SHORT ANSWER (15 points): Use StatCrunch to analyze the youth_unemployment data under Chapter 2.
This dataset includes unemployment data for 15- to 24-year-olds living in the 28 EU countries in 2013
(source: World Bank).
a. Find the five-number summary for the total youth population and the female youth population.
(Hint: create side-by-side boxplots.)
Total Female
Minimum/lower
Q1
Median
Q3
Maximum/upper
b. Do the box plots for total youth and female youth show any outliers? If so, identify and explain.
c. The mean total unemployment rate is 11.1 with a standard deviation of 5.6. For these data, does
any country have a z-score that is larger than 3 in absolute value? Explain. (Hint: analyze outliers.)
Form A 3
Name: Student ID: Date:
8) SHORT ANSWER (15 points): In 2019, 22% of US college students reported vaping tobacco products in the
past month, according to the National Institutes of Health. Imagine that a researcher wants to determine
whether an anti-vaping campaign was effective in reducing use among college students.
a. Find the margin of error and confidence interval if, one year later, the researcher found a sample
statistic of 12% of students vaping tobacco products for a sample of 20 students.
b. Find the margin of error and confidence interval if, one year later, the researcher found a sample
statistic of 12% of students vaping tobacco products for a sample of 200 students.
c. In either case, can the researcher conclude with confidence that the percentage of students vaping
tobacco products has decreased in the population from 22% before the intervention? Explain.
9) SHORT ANSWER (20 points): Use StatCrunch to analyze the Animals dataset under Chapter 3. This dataset
provides gestational period (in days) and average life expectancy (in years) for various animals.
a. What is the regression equation that describes these data, using gestation as the explanatory variable
and longevity as the response variable?
b. Interpret the slope value of your equation. (i.e., write a sentence to describe the relationship between
the variables).
c. What is the y-intercept for the line? Can you interpret it? Why or why not?
d. What is the correlation value for these data? What does it tell you about the data?
e. Find the 𝑟 2 value and explain its meaning for these data.
Form A 4
Name: Student ID: Date: