Purposive Sampling Patton 1990
Purposive Sampling Patton 1990
Purposive Sampling Patton 1990
169
PURPOSEFUL SAMPLING
Perhaps nothing better captures the difference between quantitative
and qualitative methods than the different logics that undergird
sampling approaches. Qualitative inquiry typically focuses in depth on
relatively small samples, even single cases (n = 1), selected purposefully.
Quantitative methods typically depend on larger samples selected
randomly. Not only are the techniques for sampling different, but the
very logic of each approach is unique because the purpose of each
strategy is different.
The logic and power of probability sampling depends on selecting a
truly random and statistically representative sample that will permit
confident generalization from the sample to a larger population. The
purpose is generalization.
The logic and power of purposeful sampling lies in selecting in
formation-rich cases for study in depth. Information-rich cases are those
from which one can learn a great deal about issues of central importance to the purpose of the research, thus the term purposeful sampling.
For example, if the purpose of an evaluation is to increase the effectiveness of a program in reaching lower-socioeconomic groups, one
may learn a great deal more by focusing in depth on understanding the
needs, interests, and incentives of a small number of carefully selected
poor families than by gathering standardized information from a large,
statistically representative sample of the whole program. The purpose
of purposeful sampling is to select information-rich cases whose study
will illuminate the questions under study.
There are several different strategies for purposefully selecting
information-rich cases. The logic of each strategy serves a particular
evaluation purpose.
(1) Extreme or deviant case sampling. This approach focuses on cases
that are rich in information because they are unusual or special in
some way. Unusual or special cases may be particularly troublesome or
especially enlightening, such as outstanding successes or notable
failures. If, for example, the evaluation was aimed at gathering data
help a national program reach more clients, one might compare a few
project sites that have long waiting lists with those that have short
waiting lists. If staff morale was an issue, one might study and
compare high-morale programs to low-morale programs.
170
171
deviate from the norm. Observing the reactions to someone eating like a
pig in a restaurant and then interviewing people about what they saw
and how they felt would be an example of studying a deviant sample to
illuminate the ordinary.
The Peters and Waterman (1982) best-selling study of "America's
best run companies," In Search of Excellence, exemplifies the logic of
purposeful, extreme group sampling. Their study was based on a sample
of 62 companies "never intended to be perfectly representative of U.S.
industry as a whole ... [but] a list of companies considered to be
innovative and excellent by an informed group of observers of the
business scene" (Peters and Waterman, 1982: 19).
Another excellent example of extreme group sampling is Angela
Browne's (1987) study, When Battered Women Kill. She conducted in-depth
studies of the most extreme cases of domestic violence to elucidate the
phenomenon of battering and abuse. The extreme nature of the cases
presented are what render them so powerful. Browne's book is an
exemplar of qualitative inquiry using purposeful sampling for applied
research.
(2) Intensity sampling. Intensity sampling involves the same logic as
extreme case sampling but with less emphasis on the extremes. An
intensity sample consists of information-rich cases that manifest the
phenomenon of interest intensely (but not extremely). Extreme or
deviant cases may be so unusual as to distort the manifestation of the
phenomenon of interest. Using the logic of intensity sampling, one seeks
excellent or rich examples of the phenomenon of interest, but not
unusual cases.
Heuristic research uses intensity sampling. Heuristic research draws
explicitly on the intense personal experiences of the researcher, for
example, experiences with loneliness or jealousy Coresearchers who
have experienced these phenomena intensely also participate in the
study (see Chapter 3). The heuristic researcher is not typically seeking
pathological or extreme manifestations of loneliness, jealousy, or
whatever phenomenon is of interest. Such extreme cases might not lend
themselves to the reflective process of heuristic inquiry. On the other
hand, if the experience of the heuristic researcher and his or her
coresearchers is quite mild, there won't be much to study. Thus the
researcher seeks a sample of sufficient intensity to elucidate the phenomenon of interest.
The same logic applies in a program evaluation. Extreme successes
or unusual failures may be discredited as being too extreme or un-
172
usual for gaining information. Therefore, the evaluator may select cases
that manifest sufficient intensity to illuminate the nature of success or
failure, but not at the extreme.
Intensity sampling involves some prior information and considerable
judgment. The researcher must do some exploratory work to determine
the nature of the variation in the situation under study One can then
sample intense examples of the phenomenon of interest.
(3) Maximum Variation sampling. This strategy for purposeful sampling aims at capturing and describing the central themes or principal
outcomes that cut across a great deal of participant or program
variation. For small samples a great deal of heterogeneity can be a
problem because individual cases are so different from each other. The
maximum variation sampling strategy turns that apparent weakness into
a strength by applying the following logic: Any common patterns that
emerge from great variation are of particular interest and value in
capturing the core experiences and central, shared aspects or impacts of
a program.
How does one maximize variation in a small sample? One begins by
identifying diverse characteristics or criteria for constructing the sample.
Suppose a statewide program has project sites spread around the state,
some in rural areas, some in urban areas, and some in suburban areas.
The evaluation lacks sufficient resources to randomly select enough
project sites to generalize across the state. The evaluator can at least be
sure that the geographical variation among sites is represented in the
study.
When selecting a small sample of great diversity, the data collection
and analysis will yield two kinds of findings: (1) high-quality, detailed
descriptions of each case, which are useful for documenting uniqueness,
and (Z) important shared patterns that cut across cases and derive their
significance from having emerged out of heterogeneity.
The same strategy can be used within a single program in selecting
individuals for study. By including in the sample individuals the
evaluator determines have had quite different experiences, it is possible
to more thoroughly describe the variation in the group and to
understand variations in experiences while also investigating core
elements and shared outcomes. The evaluator using a maximum
variation sampling strategy would not be attempting to generalize
findings to all people or all groups but would be looking for information
that elucidates programmatic variation and significant common patterns
within that variation.
173
174
village illuminates key issues that must be considered in any development project aimed at this kind of village.
Decision makers may have made their peace with the fact that there
will always be some poor programs and some excellent programs, but
the programs they really want more information about are those run-ofthe-mill programs that are "hard to get a handle on." It is important,
when using this strategy, to attempt to get broad consensus about which
programs are "typical." If a number of such programs are identified,
only a few can be studied, and there is no other basis for selecting
among them purposefully, then it is possible to randomly select from
among all "typical" programs identified to select those few typical cases
that actually will be included in the study.
(6) Stratified purposeful sampling. It is also clearly possible to combine
a typical case sampling strategy with others, essentially taking a stratified
purposeful sample of above average, average, and below average cases.
This is less than a full maximum variation sample. The purpose of a
stratified purposeful sample is to capture major variations rather than to
identify a common core, although the latter may also emerge in the
analysis. Each of the strata would constitute a fairly homogeneous
sample. This strategy differs from stratified random sampling in that the
sample sizes are likely to be too small for generalization or statistical
representativeness.
(7) Critical case sampling. Another strategy for selecting purposeful
samples is to look for critical cases. Critical cases are those that can
make a point quite dramatically or are, for some reason, particularly
important in the scheme of things. A clue to the existence of a critical
case is a statement to the effect that "if it happens there, it will happen
anywhere," or, vice versa, "if it doesn't happen there, it won't happen
anywhere." The focus of the data gathering in this instance is on
understanding what is happening in that critical case. Another clue to
the existence of a critical case is a key informant observation to the
effect that "if that group is having problems, then we can be sure all the
groups are having problems."
Looking for the critical case is particularly important where resources
may limit the evaluation to the study of only a single site. Under such
conditions it makes strategic sense to pick the site that would yield the
most information and have the greatest impact on the development of
knowledge. While studying one or a few critical cases does not
technically permit broad generalizations to all possible cases,
175
176
need, and have not been able to make the changes they desire"
(Grinder et al., 1977: 109). If Milton Erickson couldn't help, no one
could help. He was able to demonstrate that anyone could be hypnotized.
(S) Snowball or chain sampling. This is an approach for locating
information-rich key informants or critical cases. The process begins
?
by asking well-situated people: "Who knows a lot about ____
Who should I talk to?" By asking a number of people who else to talk
with, the snowball gets bigger and bigger as you accumulate new
information-rich cases. In most programs or systems, a few key names
or incidents are mentioned repeatedly. Those people or events recommended as valuable by a number of different informants take on
special importance. The chain of recommended informants will typically diverge initially as many possible sources are recommended, then
converge as a few key names get mentioned over and over.
The Peters and Waterman (1982) study In Search of Excellence began
with snowball sampling, asking a broad group of knowledgeable people
to identify well-run companies. Another excellent and well-known
example was Rosabeth Moss Kanter's (1983) study of innovation
reported in The Change Masters. Her book focused on ten cure case
studies. She began her search for the "best" or "most innovative"
companies by getting the views of corporate experts in human resource fields. Nominations for cases to study snowballed from there
and then converged into a small number of core cases nominated by a
number of different informants.
(9) Criterion sampling. The logic of criterion sampling is to review
and study all cases that meet some predetermined criterion of importance. This approach is common in quality assurance efforts. For
example, the expected range of participation in a mental health outpatient program might be 4 to 26 weeks. All cases that exceed 28
weeks are reviewed and studied to find out what is happening and to
make sure the case is being appropriately handled.
Critical incidents can be a source of criterion sampling. For example, all incidents of client abuse in a program may be objects of indepth evaluation in a quality assurance effort. All farmer mental health
clients who commit suicide within three months of release may
constitute a sample for in-depth, qualitative study. In a school setting,
all students who are absent more than half the time may merit the indepth attention of a qualitative case study. The point of criterion
sampling is to be sure to understand cases that are likely to be
177
178
179
180
population on the basis of 10 cases from each program site, they will
be able to tell legislators that the stories they are reporting were
randomly selected in advance of knowledge of how the outcomes would appear
and that the information collected was comprehensive. The credibility
of systematic and randomly selected case examples is considerably
greater than the personal, ad hoc selection of cases to report after the
factthat is, after outcomes are known.
It is critical to understand, however, that this is a purposeful random
sample, not a representative random sample. The purpose of a small random
sample is credibility, not representativeness. A small, purposeful random
sample aims to reduce suspicion about why certain cases were selected
for study, but such a sample still does not permit statistical
generalizations.
(14) Sampling politically important cases. Evaluation is inherently and
inevitably political to some extent (see Palumbo, 1987; Patton, 1986,
1987b; Turpin, 1989). A variation of the critical case sampling strategy
involves selecting (or sometimes avoiding) a politically sensitive site or
unit of analysis. For example, a statewide program may have a local
site in the district of a state legislator who is particularly influential. By
studying carefully the program in that district, evaluation data may be
more likely to attract attention and get used. This does not mean that
the evaluator then undertakes to make that site look either good or
bad, depending on the politics of the moment. This is simply an
additional sampling strategy for trying to increase the usefulness and
utilization of information where resources permit the study of only a
limited number of cases.
The same (broadly speaking) political perspective may inform case
sampling in applied or even basic research studies. A political scientist
or historian might select the Watergate or Iran-Contra scandals for
study not only because of the insights they provide about the American system of government but because of the likely attention such a
study would attract. A sociologist's study of a riot or a psychologist's
study of a famous suicide would likely involve some attention during
sampling to the political importance of the case.
(15) Convenience sampling. Finally, there is the strategy of sampling
by convenience: doing what's fast and convenient. This is probably
the most common sampling strategyand the least desirable. Too
often evaluators using qualitative methods think that, because the
sample size they can study is too small to permit generalizations, it
doesn't matter how cases are picked, so they might as well pick ones
181
that are easy to access and inexpensive to study. While convenience and
cost are real considerations, they should be the last factors to be taken into
account after strategically deliberating on how to get the most information of greatest utility from the limited number of cases to be
sampled. Purposeful, strategic sampling can yield crucial information
about critical cases. Convenience sampling is neither purposeful nor strategic.
Information-Rich Cases
Table 5.5 summarizes the 15 purposeful sampling strategies discussed above, plus a 16th approachcombination or mixed purposeful sampling. For example, an extreme group or maximum
heterogeneity approach may yield an initial potential sample size that is
still larger than the study can handle. The final selection, then, may be
made randomlya combination approach. Thus these approaches are
not mutually exclusive. Each approach serves a somewhat different
purpose. Because research and evaluations often serve multiple
purposes, more than one qualitative sampling strategy may be necessary. In long-term fieldwork all of these strategies maybe used at some
point.
These are not the only ways of sampling qualitatively. The underlying principle that is common to all these strategies is selecting
information-rich cases. These are cases from which one can learn a
great deal about matters of importance. They are cases worthy of indepth study.
In the process of developing the research design, the evaluator or
researcher is trying to consider and anticipate the kinds of arguments
that will lend credibility to the study as well as the kinds of arguments
that might be used to attack the findings. Reasons for site selections or
individual case sampling need to be carefully articulated and made
explicit. Moreover, it is important to be open and clear about the
study's limitations, including how any particular purposeful sampling
strategy may lead to distortion in the findingsthat is, to anticipate
criticisms that will be made of a particular sampling strategy.
Having weighed the evidence and considered the alternatives,
evaluators and primary stakeholders make their sampling decisions,
sometimes painfully, but always with the recognition that there are no
perfect designs. The sampling strategy must be selected to fit the
purpose of the study, the resources available, the questions being
182
Type
Purpose
2. intensity sampling
183
Purpose
9. criterion sampling
asked, and the constraints being faced. This holds true for sampling strategy
as well as sample size.
SAMPLE SIZE
Qualitative inquiry is rife with ambiguities. There are purposeful
strategies instead of methodological rules. There are inquiry approaches instead of statistical formulas. Qualitative inquiry seems to
work best for people with a high tolerance for ambiguity. (And we're
still only discussing design. It gets worse when we get to analysis.)
184
185
a 95% level of confidence. (See Fitzgibbon and Morris, 1987: 163, for
a table on determining sample size from a given population.)
The logic of purposeful sampling is quite different from the logic
of probability sampling. The problem is, however, that the utility and
credibility of small purposeful samples are often judged on the basis of
the logic, purpose, and recommended sample sizes of probability
sampling. What should happen is that purposeful samples be judged on
the basis of the purpose and rationale of each study and the sampling
strategy used to achieve the study's purpose. The sample, like all other
aspects of qualitative inquiry, must be judged in contextthe same
principle that undergirds analysis and presentation of qualitative data.
Random probability samples cannot accomplish what in-depth,
purposeful samples accomplish, and vice versa.
Piaget contributed a major breakthrough to our understanding of
how children think by observing his own two children at length and in
great depth. Freud established the field of psychoanalysis based on
fewer than ten client cases. Bandler and Grinder (1975a, 1975b)
founded neurolinguistic programming (NLP) by studying three renowned and highly effective therapists: Milton Erickson, Fritz Perls,
and Virginia Satin Peters and Waterman (1982) formulated their widely
followed eight principles for organizational excellence by studying 62
companies, a very small sample of the thousands of companies one
might study.
The validity, meaningfulness, and insights generated from qualitative
inquiry have more to do with the information-richness of the cases selected
and the observational/analytical capabilities of the researcher than with
sample size.
This issue of sample size is a lot like the problem students have
when they are assigned an essay to write.
Student: "How long does the paper have to be?"
Instructor: "Long enough to cover the assignment."
Student: "But how many pages?"
Instructor: "Enough pages to do justice to the subjectno more, no less."
186