STT 215 Exam 1 Study Guide
STT 215 Exam 1 Study Guide
Chapter 4: How are data gathered? What does good design look like?
Observational studies vs. Experiments
Observational studies are prone to lurking variables because there is no randomization.
Sample surveys can use simple random sampling, cluster random sampling, and/or stratified random samples. A
sampling frame provides the list of all possible subjects. Under SRS, each subject in the sampling frame has the
same chance of being selected. Under cluster random sampling, clusters (naturally existing groups) are selected
and then all subjects in those clusters are surveyed. Finally, under stratified sampling, strata are determined using
subjects’ characteristics and then subjects are randomly selected from each strata.
Observational studies can be retrospective, prospective, or cross-sectional (at a given point in time). One special
case of a retrospective study is the case-control study, where equal numbers of subjects with a condition and
without the condition are asked about their previous experiences with the explanatory variable (e.g., lung cancer
and smoking).
When sampling for surveys it is important to guard against biases, including: sampling bias (due to undercoverage
or volunteer samples), response bias, and nonresponse bias. In contrast to sample surveys, a census attempts to
survey every subject in the population.
Experiments, when well-designed, can help us establish causality between the response and explanatory variables.
Good design includes drawing a representative sample, using random assignment for the experimental treatment
and control treatment, and avoiding bias through double-blinding.
Multi-factor experiments have at least two explanatory variables and are interested in the effects of each
explanatory variable as well their interaction. Blocking and matched-pair, and crossover designs are other
experimental techniques.