0% found this document useful (0 votes)
17 views28 pages

TB - 2 Exploring Two Variable Data

The document is an AP Statistics test booklet that includes various questions related to exploring two-variable data, including surveys, relationships between variables, and statistical interpretations. It contains multiple-choice questions assessing students' understanding of concepts such as association, correlation, and regression analysis. The test covers a range of scenarios, including student employment, health sciences studies, and survey return rates.

Uploaded by

xuanyuan122486
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
17 views28 pages

TB - 2 Exploring Two Variable Data

The document is an AP Statistics test booklet that includes various questions related to exploring two-variable data, including surveys, relationships between variables, and statistical interpretations. It contains multiple-choice questions assessing students' understanding of concepts such as association, correlation, and regression analysis. The test covers a range of scenarios, including student employment, health sciences studies, and survey return rates.

Uploaded by

xuanyuan122486
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 28

AP Statistics Test Booklet

2Exploring Two-Variable Data Name

1.

A survey of 57 students was conducted to determine whether or not they held jobs outside of school. The
two-way table above shows the number of students by employment status (job, no job), and class (juniors,
seniors). Which of the following best describes the relationship between employment status and class?

A There appears to be no association, since the same number of juniors and seniors have jobs.

B There appears to be no association, since close to half of the students have jobs.

C There appears to be an association, since there are more seniors than juniors in the survey.

There appears to be an association, since the proportion of juniors having jobs is much larger than the
D
proportion of seniors having jobs.

E A measure of association cannot be determined from these data.

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 1 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

2. The director of a technical school was curious about whether there is a relationship between students who
complete one of the school's most popular health sciences certificate programs and whether those students
go on to complete more advanced studies in the health sciences within two years of completing the
certificate program. She randomly selected 100 students who completed the program. Data collected on
these students are shown in the table below.

Which of the following statements is true for these 100 students?

Being a person who completed more advanced studies is more likely than being a person who did not
A
complete more advanced studies.

Being a person who completed the program is less likely than being a person who did not complete the
B
program.

Being a person who completed the program and completed more advanced studies is less likely than being a
C
person who did not complete the program and did not complete more advanced studies.

Being a person who did not complete the program but completed more advanced studies is less likely than
D
being a person who completed the program and completed more advanced studies.

Being a person who completed the program but did not complete more advanced studies is more likely than
E
being a person who did not complete the program and did not complete more advanced studies.

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 2 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

3. A field researcher who studies lions conjectured that the more time a cub spends playing, the sooner the cub
will begin to hunt. Observational data were collected from 20 lion cubs. The researcher recorded how long
they spent playing and the age when they began hunting. Because male and female lions have different
hunting behaviors, the researcher recorded the data for males and females separately. The two scatterplots
show the data for the 10 female lions and the 10 male lions.

Based on the scatterplots, for which gender does there appear to be evidence that the more time a lion cub
spends playing, the sooner the cub is likely to begin hunting?

A For female cubs only

B For male cubs only

C For both male cubs and female cubs, with equal evidence

D For both male cubs and female cubs, with more evidence for female cubs than for male cubs

E For neither male cubs nor female cubs

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 3 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

4. College researchers wanted to know under what conditions people are more likely to complete and return a
survey. As part of a study, the researchers prepared three sets of identical surveys and used three methods of
delivering and returning the surveys. The methods are described as follows.
• In Class: The surveys were given to students in a class, and students were asked to return completed
surveys to their instructor.
• Psychology: The surveys were given to students participating in a psychology experiment, and students
were asked to return completed surveys to a collection box in the hallway of the psychology building.
• Dining Hall: The surveys were given to students in the dining hall, and students were asked to return
completed surveys to a collection box outside the dining hall.
The graph shows the percent of surveys returned and not returned for each delivery method.

Which statement about delivery method and rate of survey return is supported by the graph?

A There is a positive association between delivery method and rate of return.

B There is a negative association between delivery method and rate of return.

The number of surveys given using the Dining Hall delivery method was less than the number given using
C
either of the other delivery methods.

The Psychology delivery method displays the most symmetric results; the other delivery methods display
D
skewed results.

The In Class delivery method had the greatest rate of return, and the Dining Hall delivery method had the
E
least rate of return.

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 4 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

5. First-year students enrolled at a college were asked whether they play video games. The responses,
classified by whether the students were enrolled in the school of sciences or the school of arts, are shown in
the table.

Of all the students enrolled in the school of arts who responded, approximately what proportion responded
that they play video games?

A 0.242

B 0.401

C 0.438

D 0.554

E 0.605

6. A sample of 942 homeowners are classified, in the two-way frequency table below, by the number of credit
cards they have and the number of years they have owned their current homes.

Of the homeowners in the sample who have four or more credit cards, what proportion have owned their
current homes for at least one year?

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 5 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

A 78/212

B 78/258

C 78/942

D 212/942

E 258/942

7. As part of a study on the relationship between the use of tanning booths and the occurrence of skin cancer,
researchers reviewed the medical records of 1,436 people. The table below summarizes tanning booth use
for people in the study who did and did not have skin cancer.

Of the people in the study who had skin cancer, what fraction used a tanning booth?

A 190/265

B 190/896

C 190/1,436

D 265/1, 436

E 896/1, 436

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 6 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

8. At a local ice-cream store, 210 people were surveyed on whether they preferred eating ice cream from a
cone or a cup. Of the 210 people surveyed, 70 were adults and 140 were children. Of the responses, 150
indicated the cone as the preferred method of eating ice cream. For those surveyed, there was no association
between age and preferred method of eating ice cream. Which of the following tables shows the distribution
of responses?

9. Suppose a certain scale is not calibrated correctly, and as a result, the mass of any object is displayed as 0.75
kilogram less than its actual mass. What is the correlation between the actual masses of a set of objects and
the respective masses of the same set of objects displayed by the scale?

display = actual - 0.75

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 7 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

C 0

D 0.75

E 1

10. For which of the following scatterplots is the correlation between x and y closest to 0 ?

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 8 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 9 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

11. A local company is interested in supporting environmentally friendly initiatives such as carpooling among
employees. The company surveyed all of the 200 employees at the downtown offices. Employees responded
as to whether or not they own a car and to the location of the home where they live. The results are shown in
the table below.

Which of the following statements about a randomly chosen person from these 200 employees is true?

If the person owns a car, he or she is more likely to live elsewhere in the city than to live in the downtown
A
area in the city.

If the person does not own a car, he or she is more likely to live outside the city than to live in the city
B
(downtown area or elsewhere).

The person is more likely to own a car if he or she lives in the city (downtown area or elsewhere) than if he
C
or she lives outside the city.

D The person is more likely to live in the downtown area in the city than elsewhere in the city.

E The person is more likely to own a car than not to own a car.

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 10 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

12. An experiment was conducted to investigate the relationship between the dose of a pain medication and the
number of hours of pain relief. Twenty individuals with chronic pain were randomly assigned to one of five
doses—0.0, 0.5, 1.0, 1.5, 2.0—in milligrams (mg) of medication. The results are shown in the scatterplot
below.

The data were used to fit a least-squares regression line to predict the number of hours of pain relief for a
given dose. Which of the following would be revealed by a plot of the residuals of the regression versus the
dose?

A The sum of the residuals is less than 0.

B The sum of the residuals is greater than 0.

C There are outliers associated with the lower doses.

D The variation in the hours of pain relief is not the same across the doses.

E There is a positive linear relationship between the residuals and the dose.

13. A 90 percent confidence interval for the slope of a regression line is determined to be (-0.181, 1.529).
Which of the following statements must be true?

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 11 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

A The correlation coefficient of the data is positive.

B The sum of the residuals for the data based on the regression line is positive.

C A scatterplot of the data would show a linear pattern.

D The slope of the sample regression line is 1.348.

E The slope of the sample regression line is 0.

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 12 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

Data were collected on the fiber diameter and the fleece weight of wool taken from a sample of sheep. The data
14.
are shown in the following graphs. Graph is a scatterplot of fleece weight versus fiber diameter with the
respective least-squares regression line shown. Graph is the associated plot of the residuals versus the predicted
values.

One point is circled on graph . Five points labeled A, B, C, D, and E are identified on graph . Which point on
graph represents the residual for the circled point on graph ?

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 13 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

A A

B B

C C

D D

E E

15. A scatterplot of student height, in inches, versus corresponding arm span length, in inches, is shown below.
One of the points in the graph is labeled A.

If the point labeled A is removed, which of the following statements would be true?

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 14 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

A The slope of the least squares regression line is unchanged and the correlation coefficient increases.

B The slope of the least squares regression line is unchanged and the correlation coefficient decreases.

C The slope of the least squares regression line increases and the correlation coefficient increases.

D The slope of the least squares regression line increases and the correlation coefficient decreases.

E The slope of the least squares regression line decreases and the correlation coefficient increases.

16. At a large airport, data were recorded for one month on how many baggage items were unloaded from each
flight upon arrival as well as the time required to deliver all the baggage items on the flight to the baggage
claim area. A scatterplot of the two variables indicated a strong, positive linear association between the
variables. Which of the following statements is a correct interpretation of the word “strong” in the
description of the association?

A least-squares model predicts that the more baggage items that are unloaded from a flight, the greater the
A
time required to deliver the items to the baggage claim area.

The actual time required to deliver all the items to the baggage claim area based on the number of items
B
unloaded will be very close to the time predicted by a least-squares model.

The time required to deliver an item to the baggage claim area is relatively constant, regardless of the
C
number of baggage items unloaded from a flight.

The variability in the time required to deliver all items to the baggage claim area is about the same for all
D
flights, regardless of the number of items unloaded from a flight.

The time required to unload baggage items from a flight is related to the time required to deliver the items to
E
the baggage claim area.

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 15 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

17. Three brands of candy pieces—X, Y, and Z—are made in many colors. Shaela bought one bag of each
brand and counted the number of pieces of each color. The graph below shows the relative frequency
distribution of colors for each bag.

Which of the following statements must be true?

A For Brand X, there were more green candy pieces than red candy pieces in the bag.

B For Brand Y, there were more red candy pieces than green candy pieces in the bag.

C There were more green candy pieces in the Brand X bag than were in the Brand Z bag.

D There were the same number of blue candy pieces in the Brand X bag as were in the Brand Y bag.

The number of blue candy pieces in the Brand Z bag was equal to the sum of the number of blue candy
E
pieces in the other two bags.

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 16 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

18. As part of a community service program, students in three middle school grades (grade 6, grade 7, grade 8)
each chose to participate in one of three school-sponsored volunteer activities. The graph below shows the
distribution for each class for the three activities.

Based on the graph, which statement must be true?

A Of all the students who chose activity B, the greatest number of students were in grade 6.

B Grade 7 and grade 8 had the same number of students who did not choose activity A.

C The grade with the greatest percentage of students who chose activity C was grade 8.

D For students in grade 7, the number who chose activity C was greater than the number who chose activity B.

E For students in grade 8, the number who chose activity A was greater than the number who chose activity B.

The height and age of each child in a random sample of children was recorded. The value of the correlation
19.
coefficient between height and age for the children in the sample was . Based on the least-squares
regression line created from the data to predict the height of a child based on age, which of the following is
a correct statement?

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 17 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

A On average, the height of a child is of the age of the child.

B The least-squares regression line of height versus age will have a slope of .

C The proportion of the variation in height that is explained by a regression on age is .

D The least-squares regression line will correctly predict height based on age of the time.

E The least-squares regression line will correctly predict height based on age of the time.

20. A roadrunner is a desert bird that tends to run instead of fly. While running, the roadrunner uses its tail as a
balance. A sample of 10 roadrunners was taken, and the birds’ total length, in centimeters (cm), and tail
length, in cm, were recorded. The output shown in the table is from a least-squares regression to predict tail
length given total length.

Suppose a roadrunner has a total length of 59.0 cm and tail length of 31.1 cm. Based on the residual, does
the regression model overestimate or underestimate the tail length of the roadrunner?

A Underestimate, because the residual is positive.

B Underestimate, because the residual is negative.

C Overestimate, because the residual is positive.

D Overestimate, because the residual is negative.

E Neither, because the residual is 0.

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 18 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

21. A factory has two machines, A and B, making the same part for refrigerators. The number of defective parts
produced by each machine during the first hour of operation was recorded on 19 randomly selected days.
The scatterplot below shows the number of defective parts produced by each machine on the selected days.

Which statement gives the best comparison between the number of defective parts produced by the
machines during the first hour of operation on the 19 days?

A Machine A always produced the same number of defective parts as machine B.

B Machine A always produced fewer defective parts than machine B.

C Machine A always produced more defective parts than machine B.

D Machine A usually, but not always, produced fewer defective parts than machine B.

E Machine A usually, but not always, produced more defective parts than machine B.

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 19 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

22. In a recent survey, high school students and their parents were asked to rate 60 recently released movies.
The ratings were on a scale from 1 to 9, where 1 was “horrible” and 9 was “excellent”. For each movie, the
average rating by the students and the average rating by their parents was calculated and the scatterplot
below was constructed.The horizontal axis represents the student rating, and the vertical axis represents the
parent rating.Thus, an individual data point would represent the rating of a single movie.

Which of the following statements is justified by the scatterplot?

The movies that the students liked the best also tended to be the movies that the parents liked the best, but
A
the students tended to give lower scores.

The movies that the students liked the best also tended to be the movies that the parents liked the best, but
B
the students tended to give higher scores.

The movies that the students liked the best also tended to be the movies that the parents liked the best, but
C
each group tended to give the same scores.

The movies that the students liked the best tended to be the movies that the parents liked the least, but the
D
students tended to give lower scores.

The movies that the students liked the best tended to be the movies that the parents liked the least, but the
E
students tended to give higher scores.

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 20 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

23. An agriculturalist working with Australian pine trees wanted to investigate the relationship between the age
and the height of the Australian pine. A random sample of Australian pine trees was selected, and the age, in
years, and the height, in meters, was recorded for each tree in the sample. Based on the recorded data, the
agriculturalist created the following regression equation to predict the height, in meters, of the Australian
pine based on the age, in years, of the tree.
predicted height = 0.29 + 0.48(age)
Which of the following is the best interpretation of the slope of the regression line?

A The height increases, on average, by 1 meter each 0.48 year.

B The height increases, on average, by 0.48 meter each year.

C The height increases, on average, by 0.29 meter each year.

D The height increases, on average, by 0.29 meter each 0.48 year.

E The difference between the actual height and the predicted height is, on average, 0.48 meter for each year.

24. The computer output below shows the result of a linear regression analysis for predicting the concentration
of zinc, in parts per million (ppm), from the concentration of lead, in ppm, found in fish from a certain river.

Which of the following statements is a correct interpretation of the value 19.0 in the output?

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 21 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

On average there is a predicted increase of 19.0 ppm in concentration of lead for every increase of 1 ppm in
A
concentration of zinc found in the fish.

On average there is a predicted increase of 19.0 ppm in concentration of zinc for every increase of 1 ppm in
B
concentration of lead found in the fish.

C The predicted concentration of zinc is 19.0 ppm in fish with no concentration of lead.

D The predicted concentration of lead is 19.0 ppm in fish with no concentration of zinc.

Approximately 19% of the variability in zinc concentration is predicted by its linear relationship with lead
E
concentration.

25. There is a linear relationship between the number of chirps made by the striped ground cricket and the air
temperature. A least squares fit of some data collected by a biologist gives the model
ŷ = 25.2 + 3.3x 9 < x < 25,
where x is the number of chirps per minute and ŷ is the estimated temperature in degrees Fahrenheit. What
is the estimated increase in temperature that corresponds to an increase of 5 chirps per minute?

A 3.3 ° F

B 16.5 ° F

C 25.2 ° F

D 28.5 ° F

E 41.7 ° F

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 22 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

26. Exercise physiologists are investigating the relationship between lean body mass (in kilograms) and the
resting metabolic rate (in calories per day) in sedentary males.

Based on the computer output above, which of the following is the best interpretation of the value of the
slope of the regression line?

For each additional kilogram of lean body mass, the resting metabolic rate increases on average by 22.563
A
calories per day.

For each additional kilogram of lean body mass, the resting metabolic rate increases on average by 264.0
B
calories per day.

For each additional kilogram of lean body mass, the resting metabolic rate increases on average by 144.9
C
calories per day.

For each additional calorie per day for the resting metabolic rate, the lean body mass increases on average
D
by 22.563 kilograms.

For each additional calorie per day for the resting metabolic rate, the lean body mass increases on average
E
by 264.0 kilograms.

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 23 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

27. A random sample of people were asked whether color was a consideration in buying a new car. They
were also asked to identify one additional feature that was important. The responses are shown in the table.
Color Consideration

Yes No Maybe Total

Comfort

Cost

Performance

Reliability

Safety

Total
Which of the following is closest to the proportion of people who responded no to color consideration and
who identified safety as the additional feature that was important?

28. Which of the following scatterplots could represent a data set with a correlation coefficient of r = -1?

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 24 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 25 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

29. A veterinarian collected data on the weights of 1,000 cats and dogs treated at a veterinary clinic. The weight
of each animal was classified as either healthy, underweight, or overweight. The data are summarized in the
table.

Based on the data in the table, which of the following is the most appropriate type of graph to visually show
whether a relationship exists between the type of animal and the weight classification?

A Back-to-back stemplots

B Scatterplot

C Side-by-side boxplots

D Segmented bar chart

E Dotplot

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 26 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

30. Researchers conducted a telephone survey of 427 adults living in a large city. The adults were asked
whether they planned to purchase a smart watch in the next year. The table shows the responses categorized
by the region of the city in which the residents live.

Which of the following graphical displays is most appropriate for comparing the proportions of those
surveyed who plan to purchase a smart watch within the four regions?

A A scatterplot

B A boxplot

C A segmented bar chart

D A back-to-back stemplot

E A dotplot

31. Consider n pairs of numbers (x1,y1), (x2,y2), ..., and (xn, yn). The mean and standard deviation of the x-values
are x̄ =5 and sx = 4, respectively. The mean and standard deviation of the y-values are ȳ = 10 and
sy = 10
respectively. Of the following, which could be the least squares regression line?

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your
school’s participation in the program is prohibited.
Page 27 of 28
AP Statistics Test Booklet

2Exploring Two-Variable Data

A ŷ = -5.0 + 3.0x

B ŷ = 3.0x

C ŷ = 5.0 + 2.5x

D ŷ = 8.5 + 0.3x

E ŷ = 10.0 + 0.4x

Copyright © 2021. The College Board. These materials are part of a College Board program. Use or distribution of these materials online or in print beyond your school’s participation in
the program is prohibited.
Page 28 of 28

You might also like