Lec 4
Lec 4
Exploring Associations
between Variables
Topics
• Use Technology to
Create a Scatterplot
• Use Scatterplots to
Investigate
Associations
Between Numerical
Variables
Visualizing Variability with a Scatterplot
Scatterplot
• The primary tool for examining relationships
between two numerical variables.
• Each point in the scatterplot represents one
observation.
• Usually created using technology such as a
computer software program or a graphing
calculator.
Median Age of Marriage for Women
Notice that as r increases, there is less vertical variation in the data (the
trend is stronger).
Computing the Correlation Coefficient
Background:
Data are converted to z-scores which are multiplied
together. These products are then added and the
resulting sum is divided by n − 1.
In practice: The correlation coefficient is found
using technology.
Example 7
The table below shows the heights and weights for
6 women. Compute and interpret r, the correlation
coefficient.
Height 61 62 63 64 66 68
Weight 104 110 141 125 170 160
Stat Crunch Output (1 of 2)
• Simple linear regression results:
Dependent Variable: Weight
Independent Variable: Height
Weight = −442.88235 + 9.0294118 Height
Sample size: 6
R ( correlation coefficient ) = 0.88093363
R-sq = 0.77604407
Page 1 of the output has a lot of information, but
we can see r = 0.881. Since r is close to 1, we
would say there is a strong linear association
between height and weight.
Stat Crunch Output (2 of 2)
• Use Technology to
Write the
Regression Equation
• Use the Regression
Equation to Make
Appropriate
Predictions
Regression Line