AP Stat 2019 Practice
AP Stat 2019 Practice
AP Statistics
®
Practice Exam
© 2019 The College Board. College Board, Advanced Placement, AP, AP Central, and the acorn logo are
registered trademarks of the College Board. Visit the College Board on the web: collegeboard.org.
AP Central is the official online home for the AP Program: apcentral.collegeboard.org.
Contents
Exam Instructions
Scoring Worksheet
© 2019 The College Board. College Board, Advanced Placement Program, AP, SAT and the acorn logo are
registered trademarks of the College Board. All other products and services may be trademarks of their
respective owners. Permission to use copyrighted College Board materials may be requested online at:
www.collegeboard.org/request-form.
AP Statistics Exam
Regularly Scheduled Exam Date: Thursday afternoon, May 16, 2019
Late-Testing Exam Date: Wednesday morning, May 22, 2019
Before Distributing Exams: Check that the title on all exam covers is Statistics. If there are any exam booklets
with a different title, contact the AP coordinator immediately.
Students are expected to bring graphing calculators with statistical capabilities to the
AP Statistics Exam
AP Statistics Exam. Nongraphing scientific calculators are permitted as long as they
have the required computational capabilities. Before starting the exam administration,
make sure each student has a graphing calculator from the approved list on page 53 of
the 2018-19 AP Coordinator’s Manual or a scientific calculator. It is up to the student
to determine if a nongraphing scientific calculator has the required computational
capabilities. If a student does not have a graphing calculator from the approved list or an
appropriate scientific calculator, you may provide one from your supply. See pages 50–53
of the AP Coordinator’s Manual for more information. If the student does not want to use
the calculator you provide, or does not want to use a calculator at all, he or she must hand
copy, date, and sign the release statement on page 52 of the AP Coordinator’s Manual.
Students may have no more than two calculators on their desks. Calculators may not be
shared. Calculator memories do not need to be cleared before or after the exam. Students
with Hewlett-Packard 48–50 Series and Casio FX-9860 graphing calculators may use cards
designed for use with these calculators. Proctors should make sure infrared ports (Hewlett-
Packard) are not facing each other. Since graphing calculators can be used to store
data, including text, proctors should monitor that students are using their calculators
appropriately. Attempts by students to use the calculator to remove exam questions
and/or answers from the room may result in the cancellation of AP Exam scores.
Make sure you begin the exam at the designated time. Remember, you must complete a
seating chart for this exam. See pages 295–296 for a seating chart template and instructions.
See the 2018-19 AP Coordinator’s Manual for exam seating requirements (pages 56–59).
If you are giving the alternate exam for late testing, say:
It is Wednesday morning, May 22, and you will be taking the AP Statistics Exam.
Look at your exam packet and confirm that the exam title is “AP Statistics.”
Raise your hand if your exam packet contains any title other than “AP Statistics,”
and I will help you.
Once you confirm that all students have the correct exams, say:
In a moment, you will open the exam packet. By opening this packet, you
agree to all of the AP Program’s policies and procedures outlined in the
2018-19 Bulletin for AP Students and Parents.
You may now remove the shrinkwrap from the outside only of your exam
packet. Do not open the Section I booklet; do not remove the shrinkwrap from
the Section II materials. Put the white seals and the shrinkwrapped Section II
booklet aside. . . .
Carefully remove the AP Exam label found near the top left of your exam
booklet cover. Place it on page 1 of your answer sheet on the light blue box near
the top right corner that reads “AP Exam Label.”. . .
If students accidentally place the exam label in the space for the number label or vice
versa, advise them to leave the labels in place. They should not try to remove the label;
their exam can still be processed correctly.
Listen carefully to all my instructions. I will give you time to complete each
step. Please look up after completing each step. Raise your hand if you have
any questions.
Give students enough time to complete each step. Don’t move on until all students are ready.
Check that students are marking their answers in pencil on their answer sheets and
that they have not opened their shrinkwrapped Section II booklets. Proctors should walk
around and make sure Hewlett-Packard calculators’ infrared ports are not facing each other
and that students are not sharing calculators.
back cover. When you have finished, place the booklet on your desk, faceup.
AP Statistics Exam
I will now collect your Section I booklet. . . .
Collect a Section I booklet from each student. Check that each student has signed the front
cover of the sealed Section I booklet.
When all Section I materials have been collected and accounted for and you are
ready for the break, say:
Please listen carefully to these instructions before we take a 10-minute break.
All items you placed under your chair at the beginning of this exam, including
your Student Pack, must stay there, and you are not permitted to open or
access them in any way. Leave your shrinkwrapped Section II packet on your
desk during the break. You are not allowed to consult teachers, other students,
notes, textbooks, or any other resources during the break. You may not make
phone calls, send text messages, use your calculators, check email, use a
social networking site, or access any electronic or communication device. You
may not leave the designated break area. Remember, you may never discuss
the multiple-choice exam content with anyone, and if you disclose the content
through any means, your AP Exam score will be canceled. Are there any
questions? . . .
You may begin your break. Testing will resume at .
Collect the Student Packs from students who are taking any other AP Exams this year.
Then say:
You should also make sure that Hewlett-Packard calculators’ infrared ports are not facing
each other and that students are not sharing calculators.
Then say:
Remain in your seat, without talking, while the exam materials are collected. . . .
Collect a Section II booklet from each student. Check for the following:
Exam booklet front cover: The student placed an AP number label on the shaded box
and printed their initials and today’s date.
Exam booklet back cover: The student completed the “Important Identification
Information” area.
When all exam materials have been collected and accounted for, return to students any
electronic devices you may have collected before the start of the exam.
AP Statistics Exam
If you are giving the regularly scheduled exam, say:
You may not discuss or share the free-response exam content with anyone
unless it is released on the College Board website in about two days. Your AP
Exam score results will be available online in July.
If you are giving the alternate exam for late testing, say:
None of the content in this exam may ever be discussed or shared in any way at
any time. Your AP Exam score results will be available online in July.
If any students completed the AP number card at the beginning of this exam, say:
Please remember to take your AP number card with you. You will need the
information on this card to view your scores and order AP score reporting
services online.
Then say:
You are now dismissed.
After-Exam Tasks
Be sure to give the completed seating chart to the AP coordinator. Schools must retain
seating charts for at least six months (unless the state or district requires that they be
retained for a longer period of time). Schools should not return any seating charts in their
exam shipments unless they are required as part of an Incident Report.
NOTE: If you administered exams to students with accommodations, review the 2018-19 AP
Coordinator’s Manual and the 2018-19 AP SSD Guidelines for information about completing
the Nonstandard Administration Report (NAR) form, and returning these exams.
The exam proctor should complete the following tasks if asked to do so by the
AP coordinator. Otherwise, the AP coordinator must complete these tasks:
Complete an Incident Report for any students who used extra paper for the free-response
section. (Incident Report forms are provided in the coordinator packets sent with
the exam shipments.) These forms must be completed with a No. 2 pencil. It is
best to complete a single Incident Report for multiple students per exam subject, per
administration (regular or late testing), as long as all required information is provided.
Include all exam booklets with extra sheets of paper in an Incident Report return
envelope (see page 68 of the 2018-19 AP Coordinator’s Manual for complete details).
Return all exam materials to secure storage until they are shipped back to the
AP Program. (See page 27 of the 2018-19 AP Coordinator’s Manual for more information
about secure storage.) Before storing materials, check the “School Use Only” section
on page 1 of the answer sheet and:
Fill in the appropriate section number circle in order to access a separate
AP Instructional Planning Report (for regularly scheduled exams only) or subject
score roster at the class section or teacher level. See “Post-Exam Activities” in the
2018-19 AP Coordinator’s Manual.
Check your list of students who are eligible for fee reductions and fill in the
appropriate circle on their registration answer sheets.
Instructions
At a Glance
Section I of this exam contains 40 multiple-choice questions. Fill in only the circles for
Total Time numbers 1 through 40 on your answer sheet.
1 hour and 30 minutes
Number of Questions Indicate all of your answers to the multiple-choice questions on the answer sheet. No
40 credit will be given for anything written in this exam booklet, but you may use the booklet
Percent of Total Score for notes or scratch work. After you have decided which of the suggested answers is best,
50% completely fill in the corresponding circle on the answer sheet. Give only one answer to
Writing Instrument each question. If you change an answer, be sure that the previous mark is erased
Pencil required completely. Here is a sample question and answer.
Electronic Device
Graphing calculator
expected
Use your time effectively, working as quickly as you can without losing accuracy. Do not
spend too much time on any one question. Go on to other questions and come back to
the ones you have not answered if you have time. It is not expected that everyone will
know the answers to all of the multiple-choice questions.
Your total score on the multiple-choice section is based only on the number of questions
answered correctly. Points are not deducted for incorrect answers or unanswered
questions.
Form I
Form Code 4PBP4-S
90
Formulas begin on page 3.
Questions begin on page 6.
Tables begin on page 42.
-2-
Formulas
xi
x
n
1 2
sx xi x
n 1
b1
xi x yi y
xi x 2
b0 y b1 x
1 x x yi y
r i s
n 1 sx y
sy
b1 r
sx
yi yˆi 2
sb n2
1
xi x 2
-3-
(II) Probability
P ( A B ) P ( A) P ( B ) P ( A B )
P ( A B)
P ( A B)
P ( B)
E ( X ) µ x xi pi
2
Var( X ) s 2x xi µ x pi
n
P ( X k ) p k (1 p)n k
k
µ x np
s x np(1 p)
µ pˆ p
p(1 p)
s pˆ
n
µx µ
s
sx
n
-4-
(III) Inferential Statistics
statistic parameter
Standardized test statistic:
standard deviation of statistic
Single-Sample
Standard Deviation
Statistic
of Statistic
s
Sample Mean n
p(1 p)
Sample Proportion n
Two-Sample
Standard Deviation
Statistic
of Statistic
Difference of s12 s 22
sample means
n1 n2
Difference of p1 (1 p1 ) p2 (1 p2 )
sample proportions
n1 n2
observed expected 2
Chi-square test statistic expected
-5-
STATISTICS
SECTION I
Time—1 hour and 30 minutes
Number of questions—40
Percent of total score—50
Directions: Solve each of the following problems, using the available space for scratch work. Decide which is the
best of the choices given and fill in the corresponding circle on the answer sheet. No credit will be given for
anything written in the test book. Do not spend too much time on any one problem.
1. A group of students played a game in which they earned points for answering questions correctly. The following
dotplot shows the total number of points earned by each student.
Which of the following is the best description of the distribution of points earned?
(A) Approximately normal
(B) Bimodal without a gap
(C) Bimodal with a gap
(D) Skewed to the right without a gap
(E) Skewed to the right with a gap
One person from those who responded will be selected at random. Which of the following is closest to the
probability that the person selected will be someone who responded no, given that the person selected is age
55 or older?
(A) 0.350
(B) 0.427
(C) 0.462
(D) 0.757
(E) 0.818
One point is circled on graph 1. Five points labeled A, B, C, D, and E are identified on graph 2. Which point on
graph 2 represents the residual for the circled point on graph 1 ?
(A) A
(B) B
(C) C
(D) D
(E) E
Which of the following shapes is NOT represented by one of the four distributions?
(A) Uniform
(B) Bimodal
(C) Skewed to the left
(D) Skewed to the right
(E) Symmetric and unimodal
Color Consideration
Yes No Maybe Total
Comfort 40 96 12 148
Cost 108 68 8 184
Performance 62 62 12 136
Reliability 128 116 4 248
Safety 152 192 32 376
Total 490 534 68 1,092
Which of the following is closest to the proportion of people who responded no to color consideration and who
identified safety as the additional feature that was important?
(A) 0.18
(B) 0.34
(C) 0.36
(D) 0.49
(E) 0.51
Based on the boxplots, which statement provides the best comparison of the two locations?
(A) The number of single-day rentals is greater for location A than for location B.
(B) The number of single-day rentals is less for location A than for location B.
(C) Compared with location A, the miles driven for location B display more variability, and the median is
greater.
(D) Compared with location A, the miles driven for location B display less variability, and the median is greater.
(E) Compared with location A, the miles driven for location B display less variability, and the median is about
the same.
10. The distribution of number of hours worked by volunteers last year at a large hospital is approximately normal
with mean 80 and standard deviation 7. Volunteers in the top 20 percent of hours worked will receive a
certificate of merit. If a volunteer from last year is selected at random, which of the following is closest to the
probability that the volunteer selected will receive a certificate of merit given that the number of hours the
volunteer worked is less than 90 ?
(A) 0.077
(B) 0.123
(C) 0.134
(D) 0.618
(E) 0.923
The values of 60, 62, and 84 were common to both samples. The three values are identified as outliers with
respect to the age-group 20 years to 30 years because they are either 1.5 times the interquartile range (IQR)
greater than the upper quartile or 1.5 times the IQR less than the lower quartile.
Using the same method for identifying outliers, which of the three values are identified as outliers for the
age-group 40 years to 50 years?
(A) None of the three values is identified as an outlier.
(B) Only 60 is identified as an outlier.
(C) Only 60 and 62 are identified as outliers.
(D) Only 60 and 84 are identified as outliers.
(E) The three values are all identified as outliers.
Which of the following graphs is a sampling distribution of the sample mean x for samples of size 40 taken
from the population?
(A) (B)
(C) (D)
(E)
18. According to data from the United States Elections Project, only 36 percent of eligible voters voted in the
2014 elections. For random samples of size 40, which of the following best describes the sampling distribution
of p̂, the sample proportion of people who voted in the 2014 elections?
(A) The sampling distribution is skewed to the left, with mean 0.36 and standard deviation 0.076.
(B) The sampling distribution is skewed to the right, with mean 0.64 and standard deviation 0.006.
(C) The sampling distribution is approximately normal, with mean 0.36 and standard deviation 0.076.
(D) The sampling distribution is approximately normal, with mean 0.36 and standard deviation 0.006.
(E) The sampling distribution is approximately normal, with mean 0.64 and standard deviation 0.076.
The graph shows the percent of surveys returned and not returned for each delivery method.
Which statement about delivery method and rate of survey return is supported by the graph?
(A) There is a positive association between delivery method and rate of return.
(B) There is a negative association between delivery method and rate of return.
(C) The number of surveys given using the Dining Hall delivery method was less than the number given using
either of the other delivery methods.
(D) The Psychology delivery method displays the most symmetric results; the other delivery methods display
skewed results.
(E) The In Class delivery method had the greatest rate of return, and the Dining Hall delivery method had the
least rate of return.
22. At a certain restaurant, the distribution of wait times between ordering a meal and receiving the meal has mean
11.4 minutes and standard deviation 2.6 minutes. The restaurant manager wants to find the probability that the
mean wait time will be greater than 12.0 minutes for a random sample of 84 customers. Assuming the wait times
among customers are independent, which of the following describes the sampling distribution of the sample
mean wait time for random samples of size 84 ?
(A) Approximately normal with mean 11.4 minutes and standard deviation 2.6 minutes
2.6
(B) Approximately normal with mean 11.4 minutes and standard deviation minute
84
(C) Approximately normal with mean 12.0 minutes and standard deviation 2.6 minutes
(D) Binomial with mean 84(0.41) minutes and standard deviation 84(0.41)(0.59) minutes
(E) Binomial with mean 84(0.5) minutes and standard deviation 84(0.5)(0.5) minutes
Based on the histogram, which of the following is closest to the interquartile range, in inches, of the distribution?
(A) 2
(B) 5
(C) 9
(D) 12
(E) 15
Based on the probability distribution, what is the mean number of people in passengers cars on the highway?
(A) 0.28
(B) 0.56
(C) 1.7
(D) 2
(E) 3
Toss number 1 2 3 4
Face color yellow yellow red blue
What is the probability that a player will toss the die at least 2 times before blue lands faceup?
(A) 0.1406
(B) 0.4219
(C) 0.4375
(D) 0.5625
(E) 0.5781
H 0 : m = 64
H a : m > 64
31. The director of a marketing department wants to estimate the proportion of people who purchase a certain
product online. The director originally planned to obtain a random sample of 2,500 people who purchased the
product. However, because of budget concerns, the sample size will be reduced to 1,500 people. Which of the
following describes the effect of reducing the number of people in the sample?
(A) The variance of the sample will increase.
(B) The variance of the population will decrease.
(C) The variance of the sampling distribution of the estimator will increase.
(D) The variance of the sampling distribution of the estimator will decrease.
(E) The variance of the sampling distribution of the estimator will remain the same.
Ê (0.32)(0.68)ˆ
(B) 0.32 ± 1.645 Á
Ë 1,005 ¯˜
(0.32)(0.68)
(C) 0.32 ± 2.575
1,005
(0.32)(0.68)
(D) 0.32 ± 1.960
1,005
(0.32)(0.68)
(E) 0.32 ± 1.645
1,005
Which of the following is a 95 percent confidence interval for the slope of the population regression line?
(A) (0.001, 2.107 )
(B) (0.396, 1.712 )
(C) (0.423, 1.685)
(D) (0.732, 1.376 )
(E) (53.07, 70.99 )
Based on the residual plot, which condition appears to have been violated?
(A) The errors are independent.
(B) The sum of the residuals is 0.
(C) The expected value of the errors is 0.
(D) There is a linear relationship between the response variable and the explanatory variable.
(E) The variance of the response variable is constant for all values of the explanatory variable.
(A) 1 - 0.0627
(B) 1 - 2 (0.0627)
(C) 1 - ( 12 )(0.0627)
(D) 2 (0.0627 )
1
(E) (0.0627)
2
(0.37)(0.63) (0.88)(0.12)
(B) -
2,002 3,748
(0.37)(0.63) (0.88)(0.12)
(C) +
2,002 3,748
(0.70)(0.30) (0.70)(0.30)
(D) +
2,002 3,748
(0.37)(0.63) (0.88)(0.12)
(E) +
2,002 3,748
What is the contribution to the chi-square test statistic for men who selected business networking as the most
important factor?
(A) 0.5
(B) 5
(C) 7.5
(D) 30
(E) 45
END OF SECTION I
IF YOU FINISH BEFORE TIME IS CALLED, YOU MAY
CHECK YOUR WORK ON THIS SECTION.
-41-
Probability
z .00 .01 .02 .03 .04 .05 .06 .07 .08 .09
– 3.4 .0003 .0003 .0003 .0003 .0003 .0003 .0003 .0003 .0003 .0002
– 3.3 .0005 .0005 .0005 .0004 .0004 .0004 .0004 .0004 .0004 .0003
– 3.2 .0007 .0007 .0006 .0006 .0006 .0006 .0006 .0005 .0005 .0005
– 3.1 .0010 .0009 .0009 .0009 .0008 .0008 .0008 .0008 .0007 .0007
– 3.0 .0013 .0013 .0013 .0012 .0012 .0011 .0011 .0011 .0010 .0010
– 2.9 .0019 .0018 .0018 .0017 .0016 .0016 .0015 .0015 .0014 .0014
– 2.8 .0026 .0025 .0024 .0023 .0023 .0022 .0021 .0021 .0020 .0019
– 2.7 .0035 .0034 .0033 .0032 .0031 .0030 .0029 .0028 .0027 .0026
– 2.6 .0047 .0045 .0044 .0043 .0041 .0040 .0039 .0038 .0037 .0036
– 2.5 .0062 .0060 .0059 .0057 .0055 .0054 .0052 .0051 .0049 .0048
– 2.4 .0082 .0080 .0078 .0075 .0073 .0071 .0069 .0068 .0066 .0064
– 2.3 .0107 .0104 .0102 .0099 .0096 .0094 .0091 .0089 .0087 .0084
– 2.2 .0139 .0136 .0132 .0129 .0125 .0122 .0119 .0116 .0113 .0110
– 2.1 .0179 .0174 .0170 .0166 .0162 .0158 .0154 .0150 .0146 .0143
– 2.0 .0228 .0222 .0217 .0212 .0207 .0202 .0197 .0192 .0188 .0183
– 1.9 .0287 .0281 .0274 .0268 .0262 .0256 .0250 .0244 .0239 .0233
– 1.8 .0359 .0351 .0344 .0336 .0329 .0322 .0314 .0307 .0301 .0294
– 1.7 .0446 .0436 .0427 .0418 .0409 .0401 .0392 .0384 .0375 .0367
– 1.6 .0548 .0537 .0526 .0516 .0505 .0495 .0485 .0475 .0465 .0455
– 1.5 .0668 .0655 .0643 .0630 .0618 .0606 .0594 .0582 .0571 .0559
– 1.4 .0808 .0793 .0778 .0764 .0749 .0735 .0721 .0708 .0694 .0681
– 1.3 .0968 .0951 .0934 .0918 .0901 .0885 .0869 .0853 .0838 .0823
– 1.2 .1151 .1131 .1112 .1093 .1075 .1056 .1038 .1020 .1003 .0985
– 1.1 .1357 .1335 .1314 .1292 .1271 .1251 .1230 .1210 .1190 .1170
– 1.0 .1587 .1562 .1539 .1515 .1492 .1469 .1446 .1423 .1401 .1379
– 0.9 .1841 .1814 .1788 .1762 .1736 .1711 .1685 .1660 .1635 .1611
– 0.8 .2119 .2090 .2061 .2033 .2005 .1977 .1949 .1922 .1894 .1867
– 0.7 .2420 .2389 .2358 .2327 .2296 .2266 .2236 .2206 .2177 .2148
– 0.6 .2743 .2709 .2676 .2643 .2611 .2578 .2546 .2514 .2483 .2451
– 0.5 .3085 .3050 .3015 .2981 .2946 .2912 .2877 .2843 .2810 .2776
– 0.4 .3446 .3409 .3372 .3336 .3300 .3264 .3228 .3192 .3156 .3121
– 0.3 .3821 .3783 .3745 .3707 .3669 .3632 .3594 .3557 .3520 .3483
– 0.2 .4207 .4168 .4129 .4090 .4052 .4013 .3974 .3936 .3897 .3859
– 0.1 .4602 .4562 .4522 .4483 .4443 .4404 .4364 .4325 .4286 .4247
– 0.0 .5000 .4960 .4920 .4880 .4840 .4801 .4761 .4721 .4681 .4641
-42-
Probability
z .00 .01 .02 .03 .04 .05 .06 .07 .08 .09
0.0 .5000 .5040 .5080 .5120 .5160 .5199 .5239 .5279 .5319 .5359
0.1 .5398 .5438 .5478 .5517 .5557 .5596 .5636 .5675 .5714 .5753
0.2 .5793 .5832 .5871 .5910 .5948 .5987 .6026 .6064 .6103 .6141
0.3 .6179 .6217 .6255 .6293 .6331 .6368 .6406 .6443 .6480 .6517
0.4 .6554 .6591 .6628 .6664 .6700 .6736 .6772 .6808 .6844 .6879
0.5 .6915 .6950 .6985 .7019 .7054 .7088 .7123 .7157 .7190 .7224
0.6 .7257 .7291 .7324 .7357 .7389 .7422 .7454 .7486 .7517 .7549
0.7 .7580 .7611 .7642 .7673 .7704 .7734 .7764 .7794 .7823 .7852
0.8 .7881 .7910 .7939 .7967 .7995 .8023 .8051 .8078 .8106 .8133
0.9 .8159 .8186 .8212 .8238 .8264 .8289 .8315 .8340 .8365 .8389
1.0 .8413 .8438 .8461 .8485 .8508 8531 .8554 .8577 .8599 .8621
1.1 .8643 .8665 .8686 .8708 .8729 .8749 .8770 .8790 .8810 .8830
1.2 .8849 .8869 .8888 .8907 .8925 .8944 .8962 .8980 .8997 .9015
1.3 .9032 .9049 .9066 .9082 .9099 .9115 .9131 .9147 .9162 .9177
1.4 .9192 .9207 .9222 .9236 .9251 .9265 .9279 .9292 .9306 .9319
1.5 .9332 .9345 .9357 .9370 .9382 .9394 .9406 .9418 .9429 .9441
1.6 .9452 .9463 .9474 .9484 .9495 .9505 .9515 .9525 .9535 .9545
1.7 .9554 .9564 .9573 .9582 .9591 .9599 .9608 .9616 .9625 .9633
1.8 .9641 .9649 .9656 .9664 .9671 .9678 .9686 .9693 .9699 .9706
1.9 .9713 .9719 .9726 .9732 .9738 .9744 .9750 .9756 .9761 .9767
2.0 .9772 .9778 .9783 .9788 .9793 9798 .9803 .9808 .9812 .9817
2.1 .9821 .9826 .9830 .9834 .9838 .9842 .9846 .9850 .9854 .9857
2.2 .9861 .9864 .9868 .9871 .9875 .9878 .9881 .9884 .9887 .9890
2.3 .9893 .9896 .9898 .9901 .9904 .9906 .9909 .9911 .9913 .9916
2.4 .9918 .9920 .9922 .9925 .9927 .9929 .9931 .9932 .9934 .9936
2.5 .9938 .9940 .9941 .9943 .9945 .9946 .9948 .9949 .9951 .9952
2.6 .9953 .9955 .9956 .9957 .9959 .9960 .9961 .9962 .9963 .9964
2.7 .9965 .9966 .9967 .9968 .9969 .9970 .9971 .9972 .9973 .9974
2.8 .9974 .9975 .9976 .9977 .9977 .9978 .9979 .9979 .9980 .9981
2.9 .9981 .9982 .9982 .9983 .9984 .9984 .9985 .9985 .9986 .9986
3.0 .9987 .9987 .9987 .9988 .9988 9989 .9989 .9989 .9990 .9990
3.1 .9990 .9991 .9991 .9991 .9992 .9992 .9992 .9992 .9993 .9993
3.2 .9993 .9993 .9994 .9994 .9994 .9994 .9994 .9995 .9995 .9995
3.3 .9995 .9995 .9995 .9996 .9996 .9996 .9996 .9996 .9996 .9997
3.4 .9997 .9997 .9997 .9997 .9997 .9997 .9997 .9997 .9997 .9998
-43-
Table entry for p and
C is the point t* with Probability p
probability p lying
above it and
probability C lying
between t * and t*.
t*
Tail probability p
df .25 .20 .15 .10 .05 .025 .02 .01 .005 .0025 .001 .0005
1 1.000 1.376 1.963 3.078 6.314 12.71 15.89 31.82 63.66 127.3 318.3 636.6
2 .816 1.061 1.386 1.886 2.920 4.303 4.849 6.965 9.925 14.09 22.33 31.60
3 .765 .978 1.250 1.638 2.353 3.182 3.482 4.541 5.841 7.453 10.21 12.92
4 .741 .941 1.190 1.533 2.132 2.776 2.999 3.747 4.604 5.598 7.173 8.610
5 .727 .920 1.156 1.476 2.015 2.571 2.757 3.365 4.032 4.773 5.893 6.869
6 .718 .906 1.134 1.440 1.943 2.447 2.612 3.143 3.707 4.317 5.208 5.959
7 .711 .896 1.119 1.415 1.895 2.365 2.517 2.998 3.499 4.029 4.785 5.408
8 .706 .889 1.108 1.397 1.860 2.306 2.449 2.896 3.355 3.833 4.501 5.041
9 .703 .883 1.100 1.383 1.833 2.262 2.398 2.821 3.250 3.690 4.297 4.781
10 .700 .879 1.093 1.372 1.812 2.228 2.359 2.764 3.169 3.581 4.144 4.587
11 .697 .876 1.088 1.363 1.796 2.201 2.328 2.718 3.106 3.497 4.025 4.437
12 .695 .873 1.083 1.356 1.782 2.179 2.303 2.681 3.055 3.428 3.930 4.318
13 .694 .870 1.079 1.350 1.771 2.160 2.282 2.650 3.012 3.372 3.852 4.221
14 .692 .868 1.076 1.345 1.761 2.145 2.264 2.624 2.977 3.326 3.787 4.140
15 .691 .866 1.074 1.341 1.753 2.131 2.249 2.602 2.947 3.286 3.733 4.073
16 .690 .865 1.071 1.337 1.746 2.120 2.235 2.583 2.921 3.252 3.686 4.015
17 .689 .863 1.069 1.333 1.740 2.110 2.224 2.567 2.898 3.222 3.646 3.965
18 .688 .862 1.067 1.330 1.734 2.101 2.214 2.552 2.878 3.197 3.611 3.922
19 .688 .861 1.066 1.328 1.729 2.093 2.205 2.539 2.861 3.174 3.579 3.883
20 .687 .860 1.064 1.325 1.725 2.086 2.197 2.528 2.845 3.153 3.552 3.850
21 .686 .859 1.063 1.323 1.721 2.080 2.189 2.518 2.831 3.135 3.527 3.819
22 .686 .858 1.061 1.321 1.717 2.074 2.183 2.508 2.819 3.119 3.505 3.792
23 .685 .858 1.060 1.319 1.714 2.069 2.177 2.500 2.807 3.104 3.485 3.768
24 .685 .857 1.059 1.318 1.711 2.064 2.172 2.492 2.797 3.091 3.467 3.745
25 .684 .856 1.058 1.316 1.708 2.060 2.167 2.485 2.787 3.078 3.450 3.725
26 .684 .856 1.058 1.315 1.706 2.056 2.162 2.479 2.779 3.067 3.435 3.707
27 .684 .855 1.057 1.314 1.703 2.052 2.158 2.473 2.771 3.057 3.421 3.690
28 .683 .855 1.056 1.313 1.701 2.048 2.154 2.467 2.763 3.047 3.408 3.674
29 .683 .854 1.055 1.311 1.699 2.045 2.150 2.462 2.756 3.038 3.396 3.659
30 .683 .854 1.055 1.310 1.697 2.042 2.147 2.457 2.750 3.030 3.385 3.646
40 .681 .851 1.050 1.303 1.684 2.021 2.123 2.423 2.704 2.971 3.307 3.551
50 .679 .849 1.047 1.299 1.676 2.009 2.109 2.403 2.678 2.937 3.261 3.496
60 .679 .848 1.045 1.296 1.671 2.000 2.099 2.390 2.660 2.915 3.232 3.460
80 .678 .846 1.043 1.292 1.664 1.990 2.088 2.374 2.639 2.887 3.195 3.416
100 .677 .845 1.042 1.290 1.660 1.984 2.081 2.364 2.626 2.871 3.174 3.390
1000 .675 .842 1.037 1.282 1.646 1.962 2.056 2.330 2.581 2.813 3.098 3.300
.674 .841 1.036 1.282 1.645 1.960 2.054 2.326 2.576 2.807 3.091 3.291
50% 60% 70% 80% 90% 95% 96% 98% 99% 99.5% 99.8% 99.9%
Confidence level C
-44-
Probability p
Table entry for p is the point
( 2 ) with probability p lying
above it.
(χ2 )
-45-
®
AP Statistics Exam
SECTION II: Free Response 2019
DO NOT OPEN THIS BOOKLET UNTIL YOU ARE TOLD TO DO SO.
At a Glance
Total Time
1 hour and 30 minutes
Number of Questions
6
Percent of Total Score
50%
Writing Instrument
Either pencil or pen with
black or dark blue ink
Electronic Device
Graphing calculator
expected
Part A
Number of Questions
5 Instructions
Suggested Time
1 hour and 5 minutes The questions for both Part A and Part B are printed in this booklet. You may use any
Percent of Section II Score blank space in the booklet to organize your answers and for scratch work, but you must
75% write your answers in the spaces provided for each answer. Pages containing statistical
tables and useful formulas are printed in this booklet.
Part B
You may wish to look over the questions before starting to work on them. It is not
Number of Questions
1
expected that everyone will be able to complete all parts of all questions. Show all your
Suggested Time work. Indicate clearly the methods you use because you will be scored on the correctness
25 minutes of your methods as well as the accuracy and completeness of your results and
Percent of Section II Score explanations. Correct answers without supporting work may not receive credit. Write your
25% solution to each part of each question in the space provided for that part. Write clearly
and legibly. Cross out any errors you make; erased or crossed-out work will not be scored.
Manage your time carefully. The proctor will announce the suggested time for Part A and
Part B, but you may proceed freely from one question to the next. You may review your
responses if you finish before the end of the exam is announced.
Form I
Form Code 4PBP4-S
90
Formulas begin on page 3.
Questions begin on page 6.
Tables begin on page 22.
-2-
Formulas
xi
x
n
1 2
sx xi x
n 1
b1
xi x yi y
xi x 2
b0 y b1 x
1 x x yi y
r i s
n 1 sx y
sy
b1 r
sx
yi yˆi 2
sb n2
1
xi x 2
-3-
(II) Probability
P ( A B ) P ( A) P ( B ) P ( A B )
P ( A B)
P ( A B)
P ( B)
E ( X ) µ x xi pi
2
Var( X ) s 2x xi µ x pi
n
P ( X k ) p k (1 p)n k
k
µ x np
s x np(1 p)
µ pˆ p
p(1 p)
s pˆ
n
µx µ
s
sx
n
-4-
(III) Inferential Statistics
statistic parameter
Standardized test statistic:
standard deviation of statistic
Single-Sample
Standard Deviation
Statistic
of Statistic
s
Sample Mean n
p(1 p)
Sample Proportion n
Two-Sample
Standard Deviation
Statistic
of Statistic
Difference of s12 s 22
sample means
n1 n2
Difference of p1 (1 p1 ) p2 (1 p2 )
sample proportions
n1 n2
observed expected 2
Chi-square test statistic expected
-5-
STATISTICS
SECTION II
Part A
Questions 1-5
Spend about 1 hour and 5 minutes on this part of the exam.
Percent of Section II score—75
Directions: Show all your work. Indicate clearly the methods you use, because you will be scored on the
correctness of your methods as well as on the accuracy and completeness of your results and explanations.
1. Thirty-four college students were asked how much money they spent on textbooks for the current semester.
Their responses are shown in the following stemplot.
1 233455678
2 12345688999
3 122789
4 1457
5 13
6 2
7
8 1
Key: 1 2 = $120
(a) Describe a procedure for identifying potential outliers, and use the procedure to decide whether there are
outliers among the responses for the money spent on textbooks.
(a) Assume all conditions for inference are met. Construct and interpret a 95 percent confidence interval for the
slope of the least-squares regression line.
(ii) How many of the 60 sites had an exit time of 11:00 A.M. or later?
(c) Based on the histograms, what is a reasonable estimate of the median exit time for the random sample of
60 sites? Explain your reasoning.
The company will use four of each type of filter to conduct the investigation. A randomized block design will be
used.
(a) Assuming there is a difference in the effectiveness of the two filters, under what conditions will a
randomized block design be better for detecting the difference than a completely randomized design?
(c) Describe how to assign filters to wells to create a randomized block design.
(b) What is the probability that Sasha’s first delay due to traffic will occur after the fifth day of travel to work?
Directions: Show all your work. Indicate clearly the methods you use, because you will be scored on the
correctness of your methods as well as on the accuracy and completeness of your results and explanations.
6. Researchers are studying two different designs of computer keyboards, J and K, to investigate the effectiveness
of the design on the speed of data entry. The researchers believe there is a tendency for people entering data with
keyboard J to have faster entry times compared with people entering data with keyboard K. Using the same set
of data for entry, the researchers randomly assigned 5 people to keyboard J and 6 people to keyboard K and
recorded the number of seconds each person took to enter the data. The following dotplots show the observed
entry times for the two keyboards.
(a) Explain why it is not appropriate to conduct a two-sample t-test for the difference in population means.
(b) Based on the dotplots, explain why it might be more appropriate to compare population medians instead of
population means.
Rank 1 2 3 4 5 6 7 8 9 10 11
Keyboard J K J K K
(ii) Use the completed table in (i) to calculate the sum of the ranks assigned to each keyboard.
Sum of ranks for J (SRJ ) :
The hypotheses for the Wilcoxon Rank Sum Test are as follows.
H0 : The median of the distribution of entry times for all users of keyboard J and the median of the
distribution of entry times for all users of keyboard K are the same.
Ha : The median of the distribution of entry times for all users of keyboard J is less than the median of
the distribution of entry times for all users of keyboard K.
n (n + 1)
The test statistic W for the test is W = SRJ - J J , where nJ is the number of observations for
2
keyboard J.
(d) Use the formula to calculate the test statistic W for the keyboard data.
(ii) Assign 5 ranks to keyboard J and 6 ranks to keyboard K so that W = 0. Show your assignments by
completing the following table.
Rank 1 2 3 4 5 6 7 8 9 10 11
Keyboard
END OF EXAM
-20-
Probability
z .00 .01 .02 .03 .04 .05 .06 .07 .08 .09
– 3.4 .0003 .0003 .0003 .0003 .0003 .0003 .0003 .0003 .0003 .0002
– 3.3 .0005 .0005 .0005 .0004 .0004 .0004 .0004 .0004 .0004 .0003
– 3.2 .0007 .0007 .0006 .0006 .0006 .0006 .0006 .0005 .0005 .0005
– 3.1 .0010 .0009 .0009 .0009 .0008 .0008 .0008 .0008 .0007 .0007
– 3.0 .0013 .0013 .0013 .0012 .0012 .0011 .0011 .0011 .0010 .0010
– 2.9 .0019 .0018 .0018 .0017 .0016 .0016 .0015 .0015 .0014 .0014
– 2.8 .0026 .0025 .0024 .0023 .0023 .0022 .0021 .0021 .0020 .0019
– 2.7 .0035 .0034 .0033 .0032 .0031 .0030 .0029 .0028 .0027 .0026
– 2.6 .0047 .0045 .0044 .0043 .0041 .0040 .0039 .0038 .0037 .0036
– 2.5 .0062 .0060 .0059 .0057 .0055 .0054 .0052 .0051 .0049 .0048
– 2.4 .0082 .0080 .0078 .0075 .0073 .0071 .0069 .0068 .0066 .0064
– 2.3 .0107 .0104 .0102 .0099 .0096 .0094 .0091 .0089 .0087 .0084
– 2.2 .0139 .0136 .0132 .0129 .0125 .0122 .0119 .0116 .0113 .0110
– 2.1 .0179 .0174 .0170 .0166 .0162 .0158 .0154 .0150 .0146 .0143
– 2.0 .0228 .0222 .0217 .0212 .0207 .0202 .0197 .0192 .0188 .0183
– 1.9 .0287 .0281 .0274 .0268 .0262 .0256 .0250 .0244 .0239 .0233
– 1.8 .0359 .0351 .0344 .0336 .0329 .0322 .0314 .0307 .0301 .0294
– 1.7 .0446 .0436 .0427 .0418 .0409 .0401 .0392 .0384 .0375 .0367
– 1.6 .0548 .0537 .0526 .0516 .0505 .0495 .0485 .0475 .0465 .0455
– 1.5 .0668 .0655 .0643 .0630 .0618 .0606 .0594 .0582 .0571 .0559
– 1.4 .0808 .0793 .0778 .0764 .0749 .0735 .0721 .0708 .0694 .0681
– 1.3 .0968 .0951 .0934 .0918 .0901 .0885 .0869 .0853 .0838 .0823
– 1.2 .1151 .1131 .1112 .1093 .1075 .1056 .1038 .1020 .1003 .0985
– 1.1 .1357 .1335 .1314 .1292 .1271 .1251 .1230 .1210 .1190 .1170
– 1.0 .1587 .1562 .1539 .1515 .1492 .1469 .1446 .1423 .1401 .1379
– 0.9 .1841 .1814 .1788 .1762 .1736 .1711 .1685 .1660 .1635 .1611
– 0.8 .2119 .2090 .2061 .2033 .2005 .1977 .1949 .1922 .1894 .1867
– 0.7 .2420 .2389 .2358 .2327 .2296 .2266 .2236 .2206 .2177 .2148
– 0.6 .2743 .2709 .2676 .2643 .2611 .2578 .2546 .2514 .2483 .2451
– 0.5 .3085 .3050 .3015 .2981 .2946 .2912 .2877 .2843 .2810 .2776
– 0.4 .3446 .3409 .3372 .3336 .3300 .3264 .3228 .3192 .3156 .3121
– 0.3 .3821 .3783 .3745 .3707 .3669 .3632 .3594 .3557 .3520 .3483
– 0.2 .4207 .4168 .4129 .4090 .4052 .4013 .3974 .3936 .3897 .3859
– 0.1 .4602 .4562 .4522 .4483 .4443 .4404 .4364 .4325 .4286 .4247
– 0.0 .5000 .4960 .4920 .4880 .4840 .4801 .4761 .4721 .4681 .4641
-22-
Probability
z .00 .01 .02 .03 .04 .05 .06 .07 .08 .09
0.0 .5000 .5040 .5080 .5120 .5160 .5199 .5239 .5279 .5319 .5359
0.1 .5398 .5438 .5478 .5517 .5557 .5596 .5636 .5675 .5714 .5753
0.2 .5793 .5832 .5871 .5910 .5948 .5987 .6026 .6064 .6103 .6141
0.3 .6179 .6217 .6255 .6293 .6331 .6368 .6406 .6443 .6480 .6517
0.4 .6554 .6591 .6628 .6664 .6700 .6736 .6772 .6808 .6844 .6879
0.5 .6915 .6950 .6985 .7019 .7054 .7088 .7123 .7157 .7190 .7224
0.6 .7257 .7291 .7324 .7357 .7389 .7422 .7454 .7486 .7517 .7549
0.7 .7580 .7611 .7642 .7673 .7704 .7734 .7764 .7794 .7823 .7852
0.8 .7881 .7910 .7939 .7967 .7995 .8023 .8051 .8078 .8106 .8133
0.9 .8159 .8186 .8212 .8238 .8264 .8289 .8315 .8340 .8365 .8389
1.0 .8413 .8438 .8461 .8485 .8508 8531 .8554 .8577 .8599 .8621
1.1 .8643 .8665 .8686 .8708 .8729 .8749 .8770 .8790 .8810 .8830
1.2 .8849 .8869 .8888 .8907 .8925 .8944 .8962 .8980 .8997 .9015
1.3 .9032 .9049 .9066 .9082 .9099 .9115 .9131 .9147 .9162 .9177
1.4 .9192 .9207 .9222 .9236 .9251 .9265 .9279 .9292 .9306 .9319
1.5 .9332 .9345 .9357 .9370 .9382 .9394 .9406 .9418 .9429 .9441
1.6 .9452 .9463 .9474 .9484 .9495 .9505 .9515 .9525 .9535 .9545
1.7 .9554 .9564 .9573 .9582 .9591 .9599 .9608 .9616 .9625 .9633
1.8 .9641 .9649 .9656 .9664 .9671 .9678 .9686 .9693 .9699 .9706
1.9 .9713 .9719 .9726 .9732 .9738 .9744 .9750 .9756 .9761 .9767
2.0 .9772 .9778 .9783 .9788 .9793 9798 .9803 .9808 .9812 .9817
2.1 .9821 .9826 .9830 .9834 .9838 .9842 .9846 .9850 .9854 .9857
2.2 .9861 .9864 .9868 .9871 .9875 .9878 .9881 .9884 .9887 .9890
2.3 .9893 .9896 .9898 .9901 .9904 .9906 .9909 .9911 .9913 .9916
2.4 .9918 .9920 .9922 .9925 .9927 .9929 .9931 .9932 .9934 .9936
2.5 .9938 .9940 .9941 .9943 .9945 .9946 .9948 .9949 .9951 .9952
2.6 .9953 .9955 .9956 .9957 .9959 .9960 .9961 .9962 .9963 .9964
2.7 .9965 .9966 .9967 .9968 .9969 .9970 .9971 .9972 .9973 .9974
2.8 .9974 .9975 .9976 .9977 .9977 .9978 .9979 .9979 .9980 .9981
2.9 .9981 .9982 .9982 .9983 .9984 .9984 .9985 .9985 .9986 .9986
3.0 .9987 .9987 .9987 .9988 .9988 9989 .9989 .9989 .9990 .9990
3.1 .9990 .9991 .9991 .9991 .9992 .9992 .9992 .9992 .9993 .9993
3.2 .9993 .9993 .9994 .9994 .9994 .9994 .9994 .9995 .9995 .9995
3.3 .9995 .9995 .9995 .9996 .9996 .9996 .9996 .9996 .9996 .9997
3.4 .9997 .9997 .9997 .9997 .9997 .9997 .9997 .9997 .9997 .9998
-23-
Table entry for p and
C is the point t* with Probability p
probability p lying
above it and
probability C lying
between t * and t*.
t*
Tail probability p
df .25 .20 .15 .10 .05 .025 .02 .01 .005 .0025 .001 .0005
1 1.000 1.376 1.963 3.078 6.314 12.71 15.89 31.82 63.66 127.3 318.3 636.6
2 .816 1.061 1.386 1.886 2.920 4.303 4.849 6.965 9.925 14.09 22.33 31.60
3 .765 .978 1.250 1.638 2.353 3.182 3.482 4.541 5.841 7.453 10.21 12.92
4 .741 .941 1.190 1.533 2.132 2.776 2.999 3.747 4.604 5.598 7.173 8.610
5 .727 .920 1.156 1.476 2.015 2.571 2.757 3.365 4.032 4.773 5.893 6.869
6 .718 .906 1.134 1.440 1.943 2.447 2.612 3.143 3.707 4.317 5.208 5.959
7 .711 .896 1.119 1.415 1.895 2.365 2.517 2.998 3.499 4.029 4.785 5.408
8 .706 .889 1.108 1.397 1.860 2.306 2.449 2.896 3.355 3.833 4.501 5.041
9 .703 .883 1.100 1.383 1.833 2.262 2.398 2.821 3.250 3.690 4.297 4.781
10 .700 .879 1.093 1.372 1.812 2.228 2.359 2.764 3.169 3.581 4.144 4.587
11 .697 .876 1.088 1.363 1.796 2.201 2.328 2.718 3.106 3.497 4.025 4.437
12 .695 .873 1.083 1.356 1.782 2.179 2.303 2.681 3.055 3.428 3.930 4.318
13 .694 .870 1.079 1.350 1.771 2.160 2.282 2.650 3.012 3.372 3.852 4.221
14 .692 .868 1.076 1.345 1.761 2.145 2.264 2.624 2.977 3.326 3.787 4.140
15 .691 .866 1.074 1.341 1.753 2.131 2.249 2.602 2.947 3.286 3.733 4.073
16 .690 .865 1.071 1.337 1.746 2.120 2.235 2.583 2.921 3.252 3.686 4.015
17 .689 .863 1.069 1.333 1.740 2.110 2.224 2.567 2.898 3.222 3.646 3.965
18 .688 .862 1.067 1.330 1.734 2.101 2.214 2.552 2.878 3.197 3.611 3.922
19 .688 .861 1.066 1.328 1.729 2.093 2.205 2.539 2.861 3.174 3.579 3.883
20 .687 .860 1.064 1.325 1.725 2.086 2.197 2.528 2.845 3.153 3.552 3.850
21 .686 .859 1.063 1.323 1.721 2.080 2.189 2.518 2.831 3.135 3.527 3.819
22 .686 .858 1.061 1.321 1.717 2.074 2.183 2.508 2.819 3.119 3.505 3.792
23 .685 .858 1.060 1.319 1.714 2.069 2.177 2.500 2.807 3.104 3.485 3.768
24 .685 .857 1.059 1.318 1.711 2.064 2.172 2.492 2.797 3.091 3.467 3.745
25 .684 .856 1.058 1.316 1.708 2.060 2.167 2.485 2.787 3.078 3.450 3.725
26 .684 .856 1.058 1.315 1.706 2.056 2.162 2.479 2.779 3.067 3.435 3.707
27 .684 .855 1.057 1.314 1.703 2.052 2.158 2.473 2.771 3.057 3.421 3.690
28 .683 .855 1.056 1.313 1.701 2.048 2.154 2.467 2.763 3.047 3.408 3.674
29 .683 .854 1.055 1.311 1.699 2.045 2.150 2.462 2.756 3.038 3.396 3.659
30 .683 .854 1.055 1.310 1.697 2.042 2.147 2.457 2.750 3.030 3.385 3.646
40 .681 .851 1.050 1.303 1.684 2.021 2.123 2.423 2.704 2.971 3.307 3.551
50 .679 .849 1.047 1.299 1.676 2.009 2.109 2.403 2.678 2.937 3.261 3.496
60 .679 .848 1.045 1.296 1.671 2.000 2.099 2.390 2.660 2.915 3.232 3.460
80 .678 .846 1.043 1.292 1.664 1.990 2.088 2.374 2.639 2.887 3.195 3.416
100 .677 .845 1.042 1.290 1.660 1.984 2.081 2.364 2.626 2.871 3.174 3.390
1000 .675 .842 1.037 1.282 1.646 1.962 2.056 2.330 2.581 2.813 3.098 3.300
.674 .841 1.036 1.282 1.645 1.960 2.054 2.326 2.576 2.807 3.091 3.291
50% 60% 70% 80% 90% 95% 96% 98% 99% 99.5% 99.8% 99.9%
Confidence level C
-24-
Probability p
Table entry for p is the point
( 2 ) with probability p lying
above it.
(χ2 )
-25-
Answer Key for AP Statistics
Practice Exam, Section I
Question 1
Describing the
2.A UNC-1.H Distribution of a
Quantitative Variable
(A) Incorrect. The distribution is not approximately normal, since the
distribution is neither mound shaped nor symmetric.
(B) Incorrect. It is true that the distribution is bimodal. However, there
are no observed data values between 1 and 8, so there is a gap
displayed in the distribution.
(C) Correct. The distribution is bimodal, with one mode at 10 and
another mode at 17. Also, there are no observed data values between
1 and 8, so there is a gap displayed in the distribution.
(D) Incorrect. The distribution is not skewed to the right. A distribution
is skewed to the right when the right tail is longer than the left.
However, there are no observed data values between 1 and 8, so
there is a gap displayed in the distribution.
(E) Incorrect. It is correct that there is a gap in the distribution.
However, the distribution is not skewed to the right. A distribution is
skewed to the right when the right tail is longer than the left tail.
Question 2
Describing the
2.A UNC-1.H Distribution of a
Quantitative Variable
(A) Correct. The only shape listed that is not represented by one of the
distributions is a uniform shape. The shape of the weight distribution
is bimodal. The shape of the pH distribution is skewed to the right.
The shape of the flexibility rating distribution is skewed to the left.
The shape of the octane rating distribution is symmetric and
unimodal.
(B) Incorrect. The shape of the weight distribution is bimodal.
(C) Incorrect. The shape of the flexibility rating distribution is skewed to
the left.
(D) Incorrect. The shape of the pH distribution is skewed to the right.
(E) Incorrect. The shape of the octane rating distribution is symmetric
and unimodal.
Question 5
The Normal
3.A VAR-2.B
Distribution
(A) Incorrect. This is an age that is close to the age of a tortoise at the
10th percentile, not the 90th percentile, of the distribution.
(B) Incorrect. This is an age that is close to the age of a tortoise at the
85th percentile, not the 90th percentile, of the distribution.
(C) Correct. The value of approximately 119.22, found using
technology, is the value that has 90 percent of the area to the left of
it in the normal distribution with mean 100 and standard deviation
15. Of the values listed, 120 is the tortoise age that is closest to
119.22.
(D) Incorrect. This is an age that is close to the age of a tortoise at the
95th percentile, not the 90th percentile, of the distribution.
(E) Incorrect. This is an age that is close to the age of a tortoise at the
98th percentile, not the 90th percentile, of the distribution.
Question 7
Comparing
2.D UNC-1.N Distributions of a
Quantitative Variable
(A) Incorrect. Boxplots provide information on the proportion of values
between certain measures in a distribution, but they give no
information about the number of rentals for the locations.
(B) Incorrect. Boxplots provide information on the proportion of values
between certain measures in a distribution, but they give no
information about the number of rentals for the locations.
(C) Correct. There is more variability in the miles driven for location B
than for location A since the interquartile range is greater for B than
for A (120 > 50 ) and the range of values for B is greater than the
range of values for A. Also, the median number of miles driven is
greater for location B than for location A ( 80 > 50 ) .
(D) Incorrect. It is true that the median is greater for B than for A.
However, the miles driven for location B display more variability, not
less variability.
(E) Incorrect. The miles driven for location B display more variability,
not less variability, and the median is not about the same for B as it is
for A.
Question 8
The Normal
3.A VAR-2.B
Distribution
(A) Incorrect. The z -score for the Ohio weight should be positive, so the
number of standard deviations should be above the mean, not below
the mean.
(B) Correct. The number of standard deviations from the mean is given
x−µ
by z = . For the farm in Iowa, the z -score is 1.645, the value
σ
of x is 1.39, the value of µ is 1.26, and the value of σ is unknown.
1.39 − 1.26
Thus, 1.645 = , and solving for σ yields approximately
σ
0.079. For the farm in Ohio, the value of σ is 0.01 greater than the
value of σ for Iowa, so σ = 0.079 + 0.01 = 0.089. The z -score for
1.39 − 1.26
Ohio is=equal to z ≈ 1.46, so the weight with respect
0.089
to the Ohio distribution is 1.46 standard deviations above the mean.
(C) Incorrect. The z -score for the Ohio weight was incorrectly
calculated by using a standard deviation of 0.079; 0.089 should
have been used.
(D) Incorrect. The z -score for the Ohio weight was incorrectly
calculated by using a standard deviation of 0.069; 0.089 should
have been used. Also, the number of standard deviations should be
above the mean, not below the mean.
(E) Incorrect. The z -score for the Ohio weight was incorrectly
calculated by using a standard deviation of 0.069; 0.089 should
have been used.
Question 10
The Normal
3.A VAR-6.B
Distribution, Revisited
(A) Incorrect. This is the probability that the number of hours worked by
a volunteer selected at random is greater than 90 in a normal
distribution with mean 80 and standard deviation 7, not the
probability that the volunteer selected will receive the certificate of
merit given that the number of hours the volunteer worked is less
than 90.
(B) Incorrect. This is the probability that a volunteer selected at random
will have worked between 85.89 hours and 90 hours, not the
probability that the volunteer selected will receive the certificate of
merit given that the number of hours the volunteer worked is less
than 90.
(C) Correct. If X represents the number of hours worked, then the
value of X for which 20 percent of the hours worked are greater
than X in a normal distribution with mean 80 and standard
deviation 7 can be found using technology to be approximately
85.89. Then the probability that the volunteer selected will receive a
certificate of merit given that the number of hours the volunteer
worked is less than 90 is given by
P ( 85.89 < X < 90 )
P ( X > 85.89 | X < 90) = . Technology can be
P ( X < 90)
used to find that P ( 85.89 < X < 90 ) ≈ 0.1235 and that
P ( X < 90) ≈ 0.924 in a normal distribution with mean 80 and
standard deviation 7, so
P (85.89 < X < 90) 0.1235
P ( X > 85.89
= | X < 90) ≈ ≈ 0.134.
P ( X < 90) 0.9234
(D) Incorrect. This is approximately equal to dividing the probability
that a volunteer selected at random will have worked greater than 90
hours by the probability that a volunteer selected at random will have
worked between 85.89 hours and 90 hours.
(E) Incorrect. This is the probability that a volunteer selected at random
will have worked less than 90 hours, not the probability that the
volunteer selected will receive the certificate of merit given that the
number of hours the volunteer worked is less than 90.
Question 11
Describing the
2.A UNC-1.H Distribution of a
Quantitative Variable
(A) Incorrect. One of the three values ( 60 ) is an outlier.
(B) Correct. The interquartile range is 76 − 70 =6 for the age-group
40 to 50, and 1.5 times the interquartile range is (1.5 )( 6 ) = 9.
Then Q1 − 9 = 70 − 9 = 61, and Q3 + 9 = 76 + 9 = 85. Of the
numbers 60, 62, and 84, only 60 is less than 61 or greater than
85, so 60 is the only outlier.
(C) Incorrect. It is true that the value 60 is an outlier. However, the
value 62 is not an outlier because 62 is not less than
Q1 − 1.5 ( IQR ) or greater than Q3 + 1.5 ( IQR ) .
(D) Incorrect. It is true that the value 60 is an outlier. However, the
value 84 is not an outlier because 84 is not less than Q1 − 1.5 ( IQR )
or greater than Q3 + 1.5 ( IQR ) .
(E) Incorrect. Only one of the three values ( 60 ) is an outlier.
Question 12
Sampling Distributions
4.B UNC-3.Q
for Sample Means
(A) Incorrect. This sampling distribution has the same shape as the
population distribution (left-skewed). Because the sample size is
sufficiently large, the sampling distribution of the sample mean
should be approximately normal.
(B) Correct. For samples of size 40, the sampling distribution of the
sample mean should be approximately normal, with a mean equal to
µ x= µ= 85 and standard deviation equal to
σ 18
σ= x = ≈ 2.85. This graph appears to be approximately
n 40
normal, centered at 85, and with a standard deviation of
approximately 2.85.
(C) Incorrect. It is correct that the sampling distribution of the sample
mean should be approximately normal with a mean of 85. However,
the standard deviation of the sampling distribution of the sample
σ 18
mean should be equal to σ = x = ≈ 2.85, and the standard
n 40
deviation in this graph appears to be much less than 2.85.
(D) Incorrect. It is correct that the sampling distribution of the sample
mean should be approximately normal. However, the sampling
distribution of the sample mean should be centered at the population
mean of 85, not centered at 66.
(E) Incorrect. Because the sample size is sufficiently large, the sampling
distribution of the sample mean should be approximately normal,
not right-skewed. Also, the sampling distribution of the sample mean
should be centered at the population mean of 85, not at
approximately 35.
Question 14
Introduction to
1.C VAR-3.A
Experimental Design
(A) Incorrect. Replication exists because there were 10 members
assigned to each exercise type, not because there are four types of
exercise.
(B) Incorrect. Replication exists because there were 10 members
assigned to each exercise type, not because the experiment was
conducted over a six-week period.
(C) Incorrect. The response variable is the change in maximal oxygen
consumption measured, not the type of exercise.
(D) Correct. The values for the explanatory variable (exercise) are the
treatments, and these values are strength training, flexibility training,
aerobics, and jogging.
(E) Incorrect. An experimental unit is the smallest unit to which a
treatment is applied. Each of the 40 members who participated is an
experimental unit, not the four different types of exercise.
Question 17
Combining Random
3.B VAR-5.E
Variables
(A) Incorrect. This value was calculated by using a normal distribution
with a correct mean of −15 but using a standard deviation that was
incorrectly calculated by subtracting the standard deviations of Sean
and Evan.
(B) Incorrect. This value was calculated by using a normal distribution
with a correct mean of −15 but by incorrectly using Evan's standard
deviation.
(C) Incorrect. This value was calculated by using a normal distribution
with a correct mean of −15 but using a standard deviation that was
incorrectly calculated as 252 − 152 =
20.
(D) Correct. Let S and E represent Sean's weekly income and Evan's
weekly income, respectively. Because S and E are both
approximately normal and independent, the distribution of S − E
will be approximately normal with mean
xS − xE = 225 − 240 = −15 and standard deviation
σ S2 − E = σ S2 + σ E2 =
252 + 152 = 850. The probability that
Sean's income is greater than Evan's income is P ( S − E > 0 ) in a
normal distribution with mean −15 and standard deviation 850,
which can be found using technology to be approximately 0.303.
(E) Incorrect. This value was calculated by using a normal distribution
with a correct mean of −15 but using a standard deviation that was
incorrectly calculated as 25 + 15 = 40.
Question 18
Sampling Distributions
3.C UNC-3.L|UNC-3.K
for Sample Proportions
(A) Incorrect. It is correct that the mean is 0.36 and the standard
deviation is 0.076. However, the sampling distribution of the sample
proportion is approximately normal because the sample size is large
enough.
(B) Incorrect. The sampling distribution of the sample proportion is
approximately normal because the sample size is large enough. Also,
the mean and standard deviation are not correct. The mean of the
sampling distribution of the sample proportion is given by µ pˆ = p,
p (1 − p )
and the standard deviation is given by σ pˆ = .
n
(C) Correct. The sampling distribution of the sample proportion is
approximately normal because the sample size is large enough
( np 40
= = ( 0.36 ) 14.4 and n (1 − p ) = 40 (1 − 0.36 ) = 25.6, each
of which is greater than 10). The mean of the sampling distribution
of p̂ is µ p̂= p= 0.36, and the standard deviation of the sampling
p (1 − p ) ( 0.36 )( 0.64 )
=
distribution of p̂ is σ p̂ = ≈ 0.076.
n 40
(D) Incorrect. It is correct that the sampling distribution is
approximately normal and the mean is 0.36. However, the standard
deviation is incorrect. The standard deviation of the sampling
p (1 − p )
distribution of the sample proportion is given by σ pˆ = .
n
(E) Incorrect. It is correct that the sampling distribution is
approximately normal and the standard deviation is 0.076.
However, the mean is incorrect. The mean of the sampling
distribution of the sample proportion is given by µ p̂ = p.
Question 19
Inference and
4.B VAR-3.E
Experiments
(A) Incorrect. The 25 student athletes who received the beetroot juice
are the athletes in the treatment group, but the results of the study
can be generalized to the population from which the sample was
selected.
(B) Incorrect. The 50 student athletes in the sample are the athletes used
in the experiment, but the results of the study can be generalized to
the population from which the sample was selected.
(C) Correct. The largest population to which the results can be
generalized is the population from which the sample was selected,
which is all student athletes at the college.
(D) Incorrect. The results of the study can only be generalized to the
population from which the sample was selected, which only includes
student athletes at the college, not other students at the college who
are not athletes.
(E) Incorrect. The results of the study can only be generalized to the
population from which the sample was selected, which only includes
student athletes at the college and does not include other people who
exercise but are not from the college.
Question 20
Representing Two
2.D UNC-1.P
Categorical Variables
(A) Incorrect. Association cannot be determined from the bar graph.
(B) Incorrect. Association cannot be determined from the bar graph.
(C) Incorrect. The graph shows the percents of returned surveys, but the
numbers cannot be determined unless the total number of surveys is
known.
(D) Incorrect. Symmetric and skewed results have no meaning in the
context of the bar graph.
(E) Correct. According to the graph, the rate of return for the Dining
Hall delivery method was approximately 33 percent, for the
Psychology delivery method was approximately 48 percent, and for
the In Class delivery method was approximately 58 percent. The In
Class delivery method had the greatest rate of return, and the Dining
Hall delivery method had the least rate of return.
Question 21
Least Squares
2.A DAT-1.G
Regression
(A) Incorrect. This incorrectly describes the meaning of the correlation
coefficient r ; the correlation coefficient is a measure of the strength
of the linear association between age and height and does not give
the relationship between an individual age and height.
(B) Incorrect. The correlation coefficient r is not equal to the slope of
the regression line; the correlation coefficient is a measure of the
strength of the linear association between age and height.
(C) Correct. The coefficient of determination, r 2 , is the proportion of
the variation in height that is explained by the least-squares
regression line. The value of the coefficient of determination is
=r 2 (=0.8 )2 0.64, so the proportion of the variation in height that
is explained by a regression on age is 0.64.
(D) Incorrect. The correlation coefficient r does not give a probability of
predicting the height; the correlation coefficient is a measure of the
strength of the linear association between age and height.
(E) Incorrect. The square of the correlation coefficient, r 2 , does not
give a probability of predicting the height; the coefficient of
( )
determination r 2 is the proportion of the variation in the
response variable explained by the least-squares regression line.
Question 22
Sampling Distributions
3.C UNC-3.R|UNC-3.Q
for Sample Means
(A) Incorrect. It is correct that the sampling distribution of the sample
mean is approximately normal and that the mean is 11.4. However,
the standard deviation is incorrect. The standard deviation is given
by the formula σ x =
σ
.
n
(B) Correct. The distribution of wait times is approximately normal
because the sample size of 84 is greater than 30. The mean of the
sampling distribution of the sample mean is µ x= µ= 11.4, and the
standard deviation of the sampling distribution of the sample mean
is σ
σ 2.6
= x = .
n 84
(C) Incorrect. It is correct that the sampling distribution of the sample
mean is approximately normal. However, the value of 12.0 is the
value for which the manager wishes to calculate a probability; it is
not equal to the mean of the sampling distribution. The mean of the
sampling distribution of the sample mean is µ x = µ . Also, the value
of 2.6 is not the correct standard deviation for the sampling
distribution of the sample mean. The standard deviation of the
sampling distribution is given by σ x =
σ
.
n
(D) Incorrect. The distribution of the sample mean is not binomial. The
mean and standard deviation are not correct since they are calculated
using formulas for the binomial distribution.
(E) Incorrect. The distribution of the sample mean is not binomial. The
mean and standard deviation are not correct since they are calculated
using formulas for the binomial distribution.
Question 23
Question 27
Introduction to
1.B VAR-3.B
Experimental Design
(A) Incorrect. It is not a requirement that the number of subjects in each
block in a randomized block design be different. The number of
subjects in each block can be equal or different.
(B) Correct. A feature of a well-designed experiment is randomization,
which reduces the chance of bias in experimental groups.
Randomization can be achieved in an experiment by randomly
assigning treatments to subjects within each block.
(C) Incorrect. Blocking by age-group does not mean that there cannot be
a control group.
(D) Incorrect. There is no matching between groups in this experiment.
The subjects in one group and the subjects in the other group are
different and not paired in any way.
(E) Incorrect. In a randomized block design, subjects within each of the
blocks are randomly assigned to the two treatments.
Question 28
Sampling Distributions
3.B UNC-3.K
for Sample Proportions
(A) Incorrect. There is no variance associated with a single sample
proportion.
(B) Incorrect. There is no variance associated with a single population
proportion.
(C) Correct. The variance of the sampling distribution of the sample
p (1 − p )
proportion is given by σ 2p̂ = . If the value of n is
n
decreased, the value of the fraction will increase. Therefore, the
variance of the sampling distribution of the sample proportion will
increase.
(D) Incorrect. As sample size decreases, the variance of the sampling
distribution of the sample proportion will increase, not decrease.
(E) Incorrect. The variance of the sampling distribution of the sample
proportion will change as the value of n changes in the formula
p (1 − p )
σ 2p̂ = .
n
Question 32
given by p̂c
test is =
n1 p̂1 + n2 p̂2
=
40 ( 1440=
) + 40 ( 1040 ) 0.3. The
n1 + n2 40 + 40
test statistic is equal to
14 10
( pˆ1 − p̂2 ) − 0 −
= z = 40 40 ≈ 0.976.
p̂c (1 − p̂c )
1
+
n1 n2
1 3
10
1−
3 1
10 40 40
+
1
( )
The corresponding p -value, found using technology, is
approximately 0.1645, which is very large, so there is not
convincing statistical evidence at any reasonable significance level.
Question 35
Combining Random
3.B VAR-5.E
Variables
(A) Incorrect. It is true that the mean is 34 seconds. It is not true,
however, that the variables are independent, since X and Y
represent the running times before and after training for the same
student, and it is not true that the standard deviation is 10 seconds.
(B) Incorrect. It is true that the mean is 34 seconds. It is not true,
however, that the variables are independent, since X and Y
represent the running times before and after training for the same
student, and it is not true that the standard deviation is 50 seconds.
(C) Incorrect. It is true that the variables X and Y are not independent,
since X and Y represent the running times before and after
training for the same student. There is, however, enough information
to calculate the mean, but there is not enough information provided
to calculate the standard deviation.
(D) Correct. The random variables X and Y represent the running
times before and after training for the same student, so the variables
are dependent, not independent. The mean of X − Y is
µ X −Y = µ X − µY = 402 − 368 = 34 seconds. If X and Y are
independent, the variance σ X2 −Y of X − Y is equal to σ X2 + σ Y2 .
Since X and Y are not independent, the variance and hence the
standard deviation cannot be determined with the given information.
(E) Incorrect. It is true that the variables X and Y are not independent,
since X and Y represent the running times before and after
training for the same student, and it is true that there is not enough
information to calculate the standard deviation. There is, however,
enough information to calculate the mean.
Question 38
Question 1
Intent of Question
The primary goals of this question were to assess a student’s ability to (1) describe a procedure for identifying a
potential outlier, (2) apply the outlier identification procedure to data presented in a stemplot and (3) describe
the distribution of data presented in a stemplot.
Solution
Part (a):
Outliers are identified as any observation outside of the interval bounded by Q1 − 1.5 ( IQR ) and
Q3 + 1.5 ( IQR ) . The IQR is Q3= − Q1 390 –= 180 $210. The interval lower bound is
180 − 1.5 ( 210 ) = −$135 and the interval upper bound is 390 + 1.5 ( 210 ) =$705. Since $810 is outside of
this interval, it is an outlier. No other observation is outside the interval.
Part (b):
The distribution of the amount of money students spent on textbooks is unimodal and skewed to the right.
The sample median is between $280 and $290. Money spent on textbooks ranges from about $120 to $810
for this sample of students, with fifty percent of the amounts between about $180 and $390. The largest
observation is a possible outlier.
Question 1 (continued)
Scoring
Parts (a) and (b) are scored as essentially correct (E), partially correct (P), or incorrect (I).
Essentially correct (E) if the response includes the following two components:
1. Describes a reasonable procedure for identifying potential outliers.
2. Correctly applies the described procedure for detecting potential outliers to the data presented in
the stemplot AND clearly identifies potential outliers.
Partially correct (P) if the response includes only one of the two components.
Incorrect (I) if the response does not meet the requirements for E or P.
Notes:
• Responses that that satisfy component 1 include, but are not limited to, checking for observations
that are more than (1.5 )( IQR ) above the upper quartile or more than (1.5 )( IQR ) below the lower
quartile, checking for observations that are more than two (or three) standard deviations away from
the center of the data (sample mean or sample median), looking for one or more large gaps
between any extreme observation and the rest of the data.
• Using the values in the stemplot, the sample mean is $308.82 and the sample standard deviation is
$155.71. The sample median is between $280 and $290. There are no observations more than two
standard deviations below the sample mean or sample median. The largest observation is a
potential outlier because it is more than two (and three) standard deviations above the sample mean
(or sample median).
• If the response does not have a clearly described procedure but shows correct work for the IQR or
standard deviation approach, credit is given for component 2.
• Responses that interpret values in the stemplot as purchase amounts that are rounded down, should
not be penalized. For example, the second largest observation may be interpreted as a purchase
amount between $620 and $629.
• If the values in the stemplot are interpreted as being rounded down, the largest observation is an
outlier because it is more than two (and three) standard deviations above the sample mean (or
sample median). The second largest observation is a potential outlier based on the two standard
deviation criterion because ( sample mean ) + 2 ( standard deviations ) = $620.24 and the second
largest observation is interpreted as a purchase amount between $620 and $629.
• If the procedure checks for a gap, it must identify the largest observation as a potential outlier and
it must not identify any other observation as a potential outlier.
Question 1 (continued)
Essentially correct (E) if the response includes reasonable comments on the following four components.
1. The shape of the distribution is skewed to the right.
2. The center of the distribution is around $300. (Any value between $280 and $320 is acceptable.)
3. The spread of the distribution.
4. The response includes context.
Partially correct (P) if the response includes only three of the four components.
Incorrect (I) if the response does not meet the requirements for E or P.
Notes:
• Because part (a) addresses possible outliers, the response to part (b) is not required to address
potential outliers.
• Responses that satisfy component 3 include, but are not limited to, referring to the range ($690),
referring to the standard deviation (any value between $150 and $160 is acceptable), referring to
the interquartile range (any value between $200 and $240 is acceptable), or stating that most or all
purchase amounts are between two reasonable amounts (simply stating the values without linking
them as endpoints of an interval would not receive credit).
• Responses that interpret values in the stemplot as purchase amounts that are rounded down, should
not be penalized; for example, stating that the mean amount of money spent on textbooks is
between $308.82 and $317.82.
Question 1 (continued)
4 Complete Response
3 Substantial Response
2 Developing Response
1 Minimal Response
Question 2
Intent of Question
The primary goal of this question were to assess a student’s ability to (1) construct and interpret a confidence
interval for the slope of a regression line; and (2) determine if the confidence interval contradicts or supports a
prior belief/claim.
Solution:
(a) The 95 percent confidence interval for the slope is computed as follows:
−2.158 ± t18,0.975 × (0.149) = − 2.158 ± (2.101) × (0.149)
= (−2.471, − 1.845)
We are 95% confident that the slope of the true regression line is between –2.471 and –1.845 thousands
of dollars per mile. This implies that for each additional mile that a three-bedroom house is away from the
city center, the selling price of the house is expected to decline between $1,845 and $2,471.
(b) Because the confidence interval contains −2, corresponding to a $2,000 decrease, it is a plausible value for
the slope of the regression line. Consequently, the data do not contradict the agent’s belief that the selling
prices of three bedroom houses decrease about $2,000 for every one-mile increase in the distance of the
house from the city center.
Scoring
This question is scored in three sections. Section 1 consists of computing the 95% confidence interval for the
slope of the true regression line in part (a). Section 2 consists of the interpretation of the confidence interval in
part (a). Section 3 consists of the response to part (b). Sections 1, 2, and 3 are each scored as
essentially correct (E), partially correct (P), or incorrect (I).
Question 2 (continued)
Essentially correct (E) if the response satisfies the following four components:
1. Uses a 95% level of confidence
2. Provides a correct statement about potential values for the population or expected slope of the
regression line.
3. Uses the end points of the confidence interval.
4. Presents the interpretation in context.
Partially correct (P) if the response satisfies only three of the four components.
Incorrect (I) if the response does not meet the criteria for E or P.
Essentially correct (E) if the response satisfies the following two components:
1. Provides a correct conclusion about whether the confidence interval contradicts the agent’s belief
based on the results from part (a). If the correct confidence interval is reported in part (a), then the
correct conclusion is that the confidence interval does not contradict (or supports) the agents belief,
but it should not indicate that it proves the agent’s belief.
2. Links the conclusion to the confidence interval based on where −2,000, a decrease of 2,000, or −2,
is relative to the interval.
Partially correct (P) if the response satisfies only one of the two components.
Incorrect (I) if the response does not meet the criteria for E or P.
Question 2 (continued)
4 Complete Response
3 Substantial Response
2 Developing Response
1 Minimal Response
Question 3
Intent of Question
The primary goals of this question were to assess a student’s ability to (1) extract information about counts from
a histogram; (2) use histograms to compare distributions; (3) estimate a median from the combined information
in two histograms.
Solution
Part (a):
(i) Three sites were vacated before 8:30 A.M. This is the sum of the counts represented by the two leftmost
bars of the histogram of exit times for campsites without young children. No campers with small
children vacated their campsites before 8:30 A.M.
(ii) Eight sites were vacated at 11:00 A.M. or later. This is the sum of the counts represented by the two
rightmost bars on each histogram.
Part (b):
The distribution of exit times for campers without young children is skewed to the left while the distribution
for campers with young children is roughly symmetric. Both exit time distributions appear to be unimodal.
The distribution of exit times is more spread out for campers without young children; the largest possible
range is around 210 minutes compared to a largest possible range of around 105 minutes for campers with
young children. Campers without young children tend to leave the campground earlier than campers with
young children; the median exit time for campers without young children is between 60 and 75 minutes after
9:00 A.M. which is less than the median exit time for campers with young children, which is between 90
and 105 minutes after 9:00 A.M.
Part (c)
There are a total of 60 exit times, so the median falls between the 30th and 31st exit times. Since the 30th and
31st exit times fall between 10:15 and 10:30, any time between 10:15 A.M. and 10:30 A.M. provides a
reasonable estimate of the median exit time.
Scoring
Parts (a), (b) and (c) are each scored as essentially correct (E), partially correct (P), or incorrect (I).
Essentially correct (E) if the response satisfies the following two components:
1. Correctly states that 3 sites were vacated before 8:30 A.M. in part (i).
2. Correctly states that 8 sites were vacated at 11:00 A.M. or later in part (ii)
Partially correct (P) if the response contains only one of the two components.
OR
Correctly estimates the counts at each site separately, but does not combine them for a total count.
Incorrect (I) if the response does not meet the criteria for E or P.
Question 3 (continued)
Essentially correct (E) if the response satisfies the following four components:
1. Correct comparison of the centers of the two distributions
2. Correct comparison of the spread of the two distributions
3. Correct comparison of the shapes of the two distributions
4. Includes context
Partially correct (P) if the response satisfies only two or three of the four components.
Incorrect (I) if the response does not meet the criteria for E or P.
Essentially correct (E) if the response satisfies the following two components:
1. Reports a specific time or range of times between 10:15 A.M. and 10:30 A.M. (between 75 and 90
minutes after 9:00 A.M)
2. Gives a reasonable justification.
Partially correct (P) if the response satisfies only one of the two components
OR
if the response correctly estimates median exit times for both sets of sites (between 10:00 A.M and 10:15
A.M., or 60-75 minutes, for campers without young children, and between 10:30 A.M and 10:45 A.M., or
90-105 minutes, for campers with young children )
Question 3 (continued)
4 Complete Response
3 Substantial Response
2 Developing Response
1 Minimal Response
Question 4
Intent of Question
The primary goals of this question were to assess a student’s ability to (1) identify conditions under which a
randomized block design would be better than a completely randomized design for detecting a difference in
mean responses for two treatments, and (2) describe how an experiment can be conducted as a randomized block
design.
Solution
Part (a):
A randomized block experiment will be better for detecting a difference between the abilities of the new
and old filters to reduce arsenic concentrations when (1) there is at least one feature of the wells that enable
the researchers to create blocks such that each block consists of wells with similar arsenic concentrations
and (2) the average arsenic concentration varies among blocks. This would occur, for example, when well
water arsenic concentrations at all four wells near the burned forest are higher than the arsenic
concentration at any well near the granite ledge.
Part (b):
To take advantage of the differences among wells, well water arsenic concentrations must be similar for
wells with each block, but average arsenic concentrations should vary among blocks. A randomized block
experiment with two blocks could be conducted by including wells 1, 2, 3, and 4, on the granite ledge side
of the field in one block and including wells 5, 6, 7, and 8, on the burned forest side of the field in the other
block.
Part (c)
Within each block, the new filter should be randomly assigned to the same number of wells as the old
filter. For the blocks identified in part (b), this can be accomplished by writing well numbers 1, 2, 3, and 4
on four slips of paper, placing the four slips into a hat, and mixing them. New filters would be used for the
wells on the first two slips drawn from the hat and old filters would be used for the other two wells.
Similarly, well numbers 5, 6, 7, and 8 can be written on four slips of paper that are put into a hat and
mixed. New filters would be used for the wells on the first two slips drawn from the hat and old filters
would be used for the other two wells.
Question 4 (continued)
Scoring
Parts (a), (b) and (c) are each scored as essentially correct (E), partially correct (P), or incorrect (I).
Essentially correct (E) if the response satisfies the following three components:
1. Well water arsenic concentrations are similar for all wells within the same block.
2. Average well water arsenic concentrations vary substantially from block to block.
3. Statements are presented in the context of arsenic levels and wells.
Partially correct (P) if the response includes only two of the three components.
Essentially correct (E) if the response satisfies the following two components. Blocks are formed so that
1. Each block contains wells in locations with similar features, consistent with the scenario identified
in the response,
2. Each block contains the same number of wells (either two or four).
Partially correct (P) if the response satisfies only one of the two components.
Incorrect (I) it the response does not meet the criteria for E or P.
Notes:
• Suppose the response to part (a) indicates that arsenic concentrations might be similar for wells on
the granite ledge side of the field, and arsenic concentrations might be similar for wells of the
burned forest side of the field, but arsenic concentrations might be quite different on those two sides
of the field. Then, the two components required for E are satisfied by including wells 1, 2, 3, 4, in
one block and including wells 5, 6, 7 and 8 in the other block. The two components required for E
are also satisfied with four blocks with wells 1 and 2 in one block, wells 3 and 4 in a second block,
wells 5 and 6 in a third block, and wells 7 and 8 in a fourth block. For this response to part (a),
component 1 is not satisfied with four blocks consisting of wells 1 and 5, 2 and 6, 3 and 7, 4 and 8.
• If the response to part (a) uses a scenario in which arsenic levels decrease as one moves from the
left toward the right side of the field, for example, then components 1 and 2 are satisfied with four
blocks consisting of wells 1 and 5, 2 and 6, 3 and 7, 4 and 8. For this response to part (a), however,
component 1 is not satisfied with four blocks consisting of wells 1 and 2, 3 and 4, 5 and 6, 7 and 8.
• If no conditions are identified in the response to part (a) and no conditions are given in the response
to part (b), use the granite ledge and burned forest sides of the field as the conditions for creating
blocks in scoring the response to part (b).
• If additional or alternative conditions are identified in part (b), use those conditions in scoring the
response to part (b).
Question 4 (continued)
Essentially correct (E) if the response satisfies the following three components:
1. An indication that types of filters are randomly assigned to wells.
2. Within each block, each type of filter is assigned to half of the wells.
3. Describes how to correctly implement the random assignment process.
Partially correct (P) if the response satisfies only two of the three components.
Incorrect (I) if the response does not meet the criteria for E or P.
Notes:
• Sufficient detail must be provided to enable a knowledgeable statistics user to implement the
randomization method. Some additional acceptable methods are:
o Using a random number generator to select half of the wells in a block for which the newer
filters will be used.
o For blocks consisting of just two wells, tossing a coin.
• Each well must have a 50% chance of using a newer filter and a 50% chance of using an older filter,
and half of the wells within the block must be assigned to each filter.
• Responses to part (c) for designs that do not have an even number of wells in each block are scored
no higher than P.
Question 4 (continued)
4 Complete Response
3 Substantial Response
2 Developing Response
1 Minimal Response
Question 5
Intent of Question
The primary goals of this question were to assess a student’s ability to (1) recognize a situation in which a
binomial probability distribution may be applied; (2) discuss conditions for accurately using a normal
approximation to a binomial probability; (3) compute appropriate binomial probabilities.
Solution
Part (a):
The probability that Sasha will experience at least three days with traffic delays in the next 21 days that she
travels to work is
Part (b):
The probability that Sasha’s first traffic delay will occur after the fifth day is the probability that no traffic
delay occurs on any of the first five days:
5
P (No delay during first 5 =
days) (0.8)
= 0.32768.
Part (c):
No. The sampling distribution of the sample proportion is right skewed. It may not be well approximated by
a normal distribution because the expected number of traffic delays in 21= = 4.2 , is
days, np (21)(0.20)
less than 10.
Question 5 (continued)
Scoring
Parts (a), (b) and (c) are each scored as essentially correct (E), partially correct (P), or incorrect (I).
Essentially correct (E) if the response satisfies the following three components:
1. Clearly indicates a binomial distribution with n = 21 and p = 0.2.
2. Indicates the correct boundary value and direction of the event.
3. Reports the correct probability.
Partially correct (P) if the response satisfies component 1 and it does not satisfy one or both of the other two
components
OR
the response does not satisfy component 1 and both of the other two components are satisfied.
Incorrect (I) if the response does not meet the criteria for E or P.
Notes:
• The response B(21, 0.2) satisfies component 1.
• Component 1 and 2 are satisfied by displaying the correct formula for computing the binomial
probability using the correct values for n and p, e.g.
21 21 21
1 − (0.2)0 (0.8)21 + (0.2)1 (0.8)20 + (0.2)2 (0.8)19
0 1 2
Only component 1 is satisfied if the correct binomial distribution is used in an incorrect probability
formula, e.g.
21 3 18
3 (0.2) (0.8) .
• For component 2, the boundary value and direction may be described in words, e.g. P(at least three
delays in the next 21 days).
• Component 2 may be satisfied by displaying a bar graph of a binomial distribution with the
appropriate bars shaded.
• The response of 1 − binomcdf ( n = 21, p =0.2, upper bound = 2 ) ≈ 0.8213 is scored E since n, p
and the boundary value are clearly identified.
The response of 1 − binomcdf ( n = 21, p =0.2, 2 ) ≈ 0.8213 is scored P since n, p are clearly
identified and the boundary value is not identified.
The response of 1 − binomcdf ( 21,0.2, 2 ) ≈ 0.8213 is scored I.
Question 5 (continued)
• A normal approximation to the binomial is not appropriate since np =21 × 0.2 =4.2 < 5.
A response using the normal approximation can score at most P. To score P, the response must
include all of the following:
o An indication that the probability calculated is a normal approximation for the binomial
probability
o A correct mean and standard deviation based on the binomial parameters
o Clear indication of boundary and direction with a z-score or diagram
o The probability computed correctly
An example of a response, which meets these four criteria, is
3 − np 3 − (21)(0.2)
P Z ≥ P Z ≥
= ≈ 0.744 and the binomial
np (1 − p ) (21)(0.2)(0.8)
distribution is mentioned.
Essentially correct (E) if the response satisfies the following two components:
1. The correct probability
2. An appropriate justification.
Partially correct (P) if the response satisfies only one of the two components
OR
if the response correctly computes the probability that the first traffic delay is on day 6,
(0.8)(0.8)(0.8)(0.8)(0.8)(0.2) = 0.065536, with supporting work.
Incorrect (I) if the response does not meet the criteria for E or P.
Notes:
• Component 2 is satisfied if the response uses a binomial distribution with n = 5 and p = 0.2 to
compute the probability of no successes in n = 5 trials; or uses direct probability rules to obtain
(0.8)(0.8)(0.8)(0.8)(0.8) = 0.32768.
• Responses that find the probability of the first delay occurring on day 6 or more, with an incorrect
upper bound for the sum, should be scored P.
For example, 0.85 ( 0.2 ) + 0.86 ( 0.2 ) + ... + 0.820 ( 0.2 ) .
• Responses that find the probability of traffic delays on each of the first five days,
( 0.2 )( 0.2 )( 0.2 )( 0.2 )( 0.2 ) = 0.00032, should be scored P.
• Responses using a normal approximation to the binomial distribution should be scored I.
Question 5 (continued)
Essentially correct (E) if the response contains the following three components:
1. Calculates
= np (= 21)( 0.2 ) 4.2
2. Indicates that np is less than 10 (or less than 5).
3. Reaches the conclusion that the condition is not met.
Partially correct (P) if the response satisfies only two of the three components;
OR
if the response satisfies component 3 and indicates that the sample size is too small with respect to a
commonly used standard (for example, n < 30 ), but does not establish a link to the expected count
condition.
Incorrect (I) if the response does not meet the criteria for E or P.
Note: A response that simply indicates that the sample size is too small for the distribution of the sample
proportion to be approximately normal is scored I.
Question 5 (continued)
4 Complete Response
3 Substantial Response
2 Developing Response
1 Minimal Response
Question 6
Intent of Question
The primary goals of this question were to assess a student’s ability to (1) analyze results from a
completely randomized experiment; (2) identify a situation in which it may be better to compare
medians instead of means; (3) develop a test procedure for comparing medians; and (4) interpret
results of the test.
Solution
Part (a):
Because the samples are small and the dot plots suggest potential outliers for both keyboards, it is
not appropriate to perform a two-sample t-test for the difference between the population means.
Part (b):
It is more appropriate to compare population medians because the dot plots indicate that the
distributions of data entry times have outliers to the left for both keyboards. When outliers are
present, the median provides a better indicator of the typical values than the mean. For each
keyboard, the mean of the population of data entry times will be pulled away from the typical
values toward the extreme values in the left tail of the distribution.
Part (c):
(i) The completed table rankings for the combined set of 11 data entry times is shown below
Rank 1 2 3 4 5 6 7 8 9 10 11
Keyboard J K J J J J K K K K K
Time 158 184 240 248 251 261 267 279 280 284 305
Part (d):
nJ (nJ + 1) (5)(5 + 1)
W =
SRJ − 19 −
= 4
=
2 2
Question 6 (continued)
Part (e):
nJ (nJ + 1)
(i) To obtain W = 0, we must have=0 SRJ − which implies that
2
nJ (nJ + 1) (5)(5 + 1)
=SRJ = = 15 .
2 2
(ii) All five ranks for keyboard J must be smaller than any of the ranks for keyboard K.
Rank 1 2 3 4 5 6 7 8 9 10 11
Keyboard J J J J J K K K K K K
Part (f):
For the one-sided alternative that the median of the distribution of data entry times for keyboard J
is smaller than the median of the distribution of data entry times for keyboard K, the p-value is the
proportion of possible arrangements with W values of 4 or less. From the graph, the p-value is
12
= 0.026. Because the p-value is smaller than the α = 0.05 significance level, there is
462
sufficient evidence to reject the null hypothesis in favor of the alternative that the median of the
distribution of data entry times for keyboard J is smaller than the median of the distribution of data
entry times for keyboard K.
Scoring
This question is scored in four sections. Section 1 consists of parts (a) and (b), section 2 consists of
parts (c) and (d), section 3 consists of part (e), and section 4 consists of part (f). Sections 1, 2, 3 and 4
are each scored as essentially correct (E), partially correct (P), or incorrect (I).
Essentially correct (E) if the response includes the following two components:
1. The response to part (a) correctly argues that a two-sample t-test is not appropriate because
the sample sizes are small and the dot plots show potential outliers (or left skewed
distributions).
2. The response to part (b) uses the outliers (skewness) in the entry time distributions to argue
that the medians are better indicators of typical entry times than the means.
Partially correct (P) if the response includes only one of the two components.
Incorrect (I) if the response does not meet the criteria for E or P.
Question 6 (continued)
Essentially correct (E) if the response includes the following three components:
1. The table of rankings in part (c-i) is completed correctly.
2. The sum of ranks in part (c-ii) is computed correctly for both keyboards, using the ranks
reported in the table in part (c-i).
3. The value of W is calculated correctly in part (d), based on the value of SRJ in part (c-ii)
and the correct value of nJ .
Partially correct (P) if the response includes only two of the three components.
Incorrect (I) if the response does not meet the criteria for E or P.
Essentially correct (E) if the response includes the following two components:
1. The response to part (e-i) correctly finds the value of SRJ when W = 0, using the value of
nJ from the response to part (d).
2. The correct assignment of ranks is displayed in the response to part (e-ii) to give W = 0 or
the value of SRJ shown in part (e-i).
Partially correct (P) if the response satisfies only one of the two components.
Incorrect (I) if the response does not meet the criteria for E or P.
Essentially correct if the response in part (f) includes the following four components:
1. The critical region is identified as the possible values of W that are less than or equal to the
value of W reported in part (d). This can be done by marking on the graph shown in the
stem prior to part (e).
2. A correct p-value is reported for the indicated critical region.
3. A correct conclusion is reached based on a reasonable reported p-value.
4. The conclusion is presented in the context of comparing the medians of the data entry time
distributions for keyboards J and K.
Partially correct if the response satisfies only two or three of the four components.
Question 6 (continued)
Each essentially correct (E) section counts as 1 point, and each partially correct (P) section counts as ½
point.
4 Complete Response
3 Substantial Response
2 Developing Response
1 Minimal Response
If a response is between two scores (for example, 2½ points), use a holistic approach to decide whether
to score up or down, depending on the strength of the response and quality of the communication.
× 1.2500 =
Number Correct Weighted Section I Score
(out of 40) (Do not round)
Question 1 × 1.8750 =
(out of 4) (Do not round)
Question 2 × 1.8750 =
(out of 4) (Do not round)
Question 3 × 1.8750 =
(out of 4) (Do not round)
Question 4 × 1.8750 =
(out of 4) (Do not round)
Question 5 × 1.8750 =
(out of 4) (Do not round)
Question 6 × 3.1250 =
(out of 4) (Do not round)
Sum =
Weighted
Section II
Score
(Do not round)
Composite Score
+ =
Weighted Weighted Composite Score
Section I Score Section II Score (Round to nearest
whole number)
Multiple-Choice Questions
Free-Response Questions