Notes3.1 TPS6up
Notes3.1 TPS6up
Section 3.1
Scatterplots and
Correlation
Scatterplots and Correlation
LEARNING TARGETS
By the end of this section, you should be able to:
DISTINGUISH between explanatory and response variables for
quantitative data.
MAKE a scatterplot to display the relationship between two
quantitative variables.
DESCRIBE the direction, form, and strength of a relationship displayed
in a scatterplot and identify unusual features.
INTERPRET the correlation.
UNDERSTAND the basic properties of correlation, including how the
correlation is influenced by unusual points.
DISTINGUISH correlation from causation.
A data set that describes the relationship between two variables is sometimes
called bivariate data.
Analysis of relationships between two variables builds on the same tools we used
to analyze one variable:
• Plot the data, then look for overall patterns and departures from those
patterns.
• Add numerical summaries.
• When there’s a regular overall pattern, use a simplified model to describe it.
Note: In many studies, the goal is to show that changes in one or more explanatory
variables actually cause changes in a response variable. However, other
explanatory-response relationships don’t involve direct causation.
No Association
Strength
Direction There is a moderately strong,
Form negative, curved relationship
Unusual features between the percent of
students in a state who take
the SAT and the mean SAT
math score.
Further, there are two distinct
clusters of states and two
unusual points that fall outside
the overall pattern.
CAUTION:
CAUTION:
1 x1 x y1 y x2 x y2 y xn x yn y
r ...
n 1 sx s y sx s y s x s y
1 xi x yi y
r
n 1 sx
sy
How correlation behaves is more important than the details of the formula.
Here are some important facts about r.
LEARNING TARGETS
After this section, you should be able to:
DISTINGUISH between explanatory and response variables for
quantitative data.
MAKE a scatterplot to display the relationship between two
quantitative variables.
DESCRIBE the direction, form, and strength of a relationship displayed
in a scatterplot and identify unusual features.
INTERPRET the correlation.
UNDERSTAND the basic properties of correlation, including how the
correlation is influenced by unusual points.
DISTINGUISH correlation from causation.