Cobb 1997
Cobb 1997
Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at
http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless
you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you
may use content in the JSTOR archive only for your personal, non-commercial use.
Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at
http://www.jstor.org/action/showPublisher?publisherCode=maa.
Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed
page of such transmission.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact [email protected].
Mathematical Association of America is collaborating with JSTOR to digitize, preserve and extend access to
The American Mathematical Monthly.
http://www.jstor.org
Mathematics, Statistics, and Teaching
GeorgeW. Cobband David S Moore
.
Example
1. Themystery
ofAndover.The finite sequence(3, 5, 23, 37, 6, 8, 20, 22, 1, 3)
shows a distinctive pattern when plotted (Figure 1) but the numbers and the
pattern have no meaningor interest until we know their context.They are in fact
monthly totals of people formally accused of witchcraft in Essex County,
Massachusetts,beginningin February,1692. The plot shows two waves of accusa-
tions, separatedby a low point in the summerof 1692. The pattern becomes still
more meaningful when we know that the first hanging of a convicted witch
(BridgetBishop) took place June 10, 1692:it is not hard to imagine the sobering
effect of that first execution in the small community of Salem Village (now
Danvers).But why the second wave of accusations?It turns out that the accusa-
tions in the first wave were directed against residents of Salem Village, Salem
Town, and all but one of the half-dozenimmediatelyadjacenttowns;in the second
wave the majorityof the accusationswere directed against residents of the one
other adjacenttown, Andover.Our sources[3, 4] do not providemuch explanation
for what happenedin Andover,but the pattern,togetherwith what we knowof the
context,tells at least part of a story and raises some interestingquestions.
= 30- / \
U I \
0 , I
8Y I \
o # 1
20- / \ <
S I I / \
O-
l
Jan Apr Jul Oct
Month
Figure1. Numbersof people accusedof witchcraftin Essex County,MA, 1692.
Although this first example has almost no mathematical content, its interplay
between pattern and context is typical of the interpretive part of statistical
thinking. For a more familiar example of a very different sort, consider testing that
two normal distributions have equal means.
Example 2a. A model for comparing normal means. Consider the standard model
involving two sets of independent, identically distributed (iid) random variables:
(l) (2)
Design --> Data--> Patterns
Model(s)--> Methods--> Results-->Intrepretation
(3) (4) (5)
Group1 _ Treatment1
, 10 patients Calcium X
Random Compare
Allocation X BloodPressure
Group2 Treatment2 /
11 patients Placebo
Taken together, the last two examplesoffer what we consider two of the most
importantlessons for mathematicianswho teach statistics:one, the conclusions
from a study depend cruciallyon how the data were produced, and twoSthe
standardmathematicalmodels ignore data production.
Statisticalideas for producingdata to answer specific questions are the most
influential contributionsof statistics to human knowledge. Badly designed data
productionis the most common serious flaw in statisticalstudies. Well designed
data productionallows us to apply standardmethods of analysisand reach clear
conclusions. Professional statisticians are paid for their expertise in designing
studies;if the studyis well designed(and no unanticipateddisasteroccurred),you
don't need a professionalto do the analysis.In other words, the design of data
production is really important. If you just say s4SupposeX1 to Xn are iid
observations,'vyou aren'tteachingstatistics.
2.2. Data analysis:explorationand description Data analysisis the contemporary
form of 44descriptivestatistics,S'
powered by more numerous and more elaborate
descriptivetools, but especially by a philosophy due in large measure to John
Tukeyof Bell Labs and Princeton.The philosophyis capturedin the now-common
name, exploratozy data analysas,or EDA. The goal of EDA is to see what the data
in hand say, on the analogyof an explorerenteringunknownlands. We put aside
(but not forever) the issue of whether these data represent any larger universe.
20 -
° 10-
* 4
ct
.S
ut
U o-
C,
a)
-10
Placebo Calcium
* F
10
c
J - 11
r * r
* CT
ao
X O-
* * X
_S_
-10 -
* I I I
-1 0 -1
Normalscores
For the calcium data, the pattern is reasonablylinear, although the vertical
jump before the three right-mostpoints shows observedresidualsthat are larger
than predictedby the normalmodel, a patternconsistentwith the unequalspreads
in the boxplots.
Mathematicallystructuredinstruction,which tends to emphasizehow methods
follow from models, often provides only the most general warnings about the
realities of practice.Statisticsin practiceresembles a dialog between models and
data. Models for the process that producedour data do indeed play a centralrole
in statistical inference. The mathematicalexplorationof properties and conse-
quences of models is therefore important(as it is in economics and physics).But
the data are also allowed to criticize and even falsify proposed models. In the
calcium examples, the exploratoryanalysiswarns us not to rely heavily on the
assumptionof equal variances,and to use a modified t-test that estimatesseparate
variancesfor the two groups.We can modifyBox'sdictuminto a practicalversion
of the statementthat statisticsis not just mathematics:Mathematicaltheoremsare
true, statisticalmethodsare sometimeseffectivewhenused withskill.
Wide availabilityof cheap computing,especially graphics,has combinedwith
the desire to "let the data speak"to generate an abundanceof new tools: at the
low end we have the stemplots and boxplots of Example 2c, but there are also
model-freescatterplotsmoothers,resistantregressionalgorithms,clever ideas for
displayof high-dimensionaldata on two-dimensionalscreens, and many still more
advanced diagnostic tools for specific situations. Standard statistical software
implementsmuch of this. The books [7] and [9], by Bell Labs scientistsinfluenced
by Tukey,present much of the basic graphicalmaterial.The softwarepackagesS
and S-PLUS,which originatedat Bell Labs, implementmore of the new graphics
and also implementseveralnew classes of models. See [8]for detailed discussionof
the latter.
160 -
,,, 120-
o
o
O "
.° "
> 80 -
o w |
.t
n 40 - *-
-
*""::
.
| i I I {
o 7.5 15 22.s 30
Urban population (millions)
Figure 7. Scatterplot of dormitory population versus urban population for the 50 U.S. states.
2.5-
*VT
ce
.
g 2.0 -
4 * RI
.=
= * *
.
u 1.5- * u
° *
ct
= * . =
:,,1 O- * * : C
Wo + e
Q wv *. . Q Q
..
; * ;
v . v
AK v
o-
25 50 75 loo
Percentage of population in urban areas
Figure 8. Scatterplot of the dorms-and-cities data after adjusting for the "lurking variable"population.
25 -
20 -
= 15 -
10 -
l___ lull * ll * - - - - |
l w g
O-
0 100 200 300 400 500 600
Survivaltime,days
2
CT
d. -
.-
Ct
i -1
-2
-3
100 200 300 400 500 600
Survivaltime (days)
817
the searchfor meaning.This surpriseis so welcome that it carriesa dangerof
pushingthe pendulumtoo far the other way. Some studentsmay drift into a
complacentconvictionthat any story about the data that fits the patterns
with coherenceand plausibilitymustbe true. The timingis rightfor a dose of
design and skepticism.
3.4. Teach design as the bridgebetweendata analysis and inference. An introduc-
tion to design for data productionfits naturallybetween exploratoryanalysisand
inference: sound design is what makes inference possible. Waiting to introduce
probabilitydistributionsuntil after the basics of design has a number of advan-
tages. For one thing, this order helps make clear that the justification for
probability models must come from the randomness in the data production
process, and so providessome protectionagainstunthinkingadoptionof probabil-
ity models. For another, learning about data productionintroduces students to
essential conceptslike populationand sample,parameterand statistic,before they
encounterthe samplingdistribution,which is conceptuallydifficultall by itself.
The single most importantpoint for studentsto understandis why randomized
comparativeexperimentsare the gold standardfor evidence of causation.A rich
source of true-lifecautionarytales is the book [6], edited by the physiciansBunker
and Barnes and the statistician Mosteller, which contains striking examples of
medical treatments that became standardin the days before medicine adopted
randomized comparative experiments, and were found to be worthless when
subjectedto propertesting.
There is of course more to the statistical side of designing experimentsand
sample surveysthan "randomize."The designs used in practice are often quite
complex, and must balance efficiency with the need for informationof vaxying
precision about many factors and their interactions.Simple designs randomized
experimentscomparingtwo or several treatments,simple random samples from
one or several populations-illustrate the most importantideas and support the
inference taughtin a first statisticscourse.You must talk about these designs,but
need not go farther.Some other importantmaterial,for example,proceduresfor
developingand testing surveyquestionsand for trainingand supervisinginterview-
ers, is not usually presented in statistics courses. Statistics students should be
aware that these practicalskills do matter, and that data productioncan go awry
even when we startwith a sound statisticaldesign.How muchtime to spend here is
a matter of your judgmentof the needs of your audience.
3.5. Inference:two barriers to understanding.Section 2.3 has described briefly
how inferenceworks.Because the details are in practiceautomated,we would like
students to put most of their effort into graspingthe ideas. They are not easy to
grasp.The first barrieris the notion of a samplingdistribution.Choose a simple
setting, such as using the proportionp of a sample of workerswho are unem-
ployed to estimate the proportionp of unemployedworkersin an entire popula-
tion. Physicalexamples (samplingbeads from a box), computersimulations,and
encouragingthought experimentsall help convey the idea of many samples with
many values of p. Keep asking, "What would happen if I did this many times?"
That questionis the key to the logic of standardstatisticalinference.
Once the idea of a sampling distributionbegins to settle, the tools of data
analysishelp us take the next steps. Faced with any distribution,we ask about
shape, center, and spread.The shape of the samplingdistributionof p is approxi-
matelynormal.The mean is equal to the unknownpopulationproportionp. This
says that p as an estimatorof p has no bias, or systematicerror.The precisionof
Department
of Mathematics,Statistics Departmentof Statistics
and ComputerScience PurdueUniversity
MountHolyokeCollege WestLafayette,IN 47907
SouthHadZey,
M4 01075 [email protected]
[email protected]
823