AP Statistics Chapter 3

This chapter discusses relationships between variables through scatterplots and correlation. A scatterplot shows the relationship between an explanatory variable on the x-axis and a response variable on the y-axis. The form, direction, and strength of the relationship are examined. Correlation measures the strength and direction of the linear relationship between two quantitative variables on a scale from -1 to 1. Regression finds the least squares regression line that best models the relationship between an explanatory and response variable to predict y-values. The coefficient of determination and residuals are used to assess how well the regression line fits the data. Outliers and lurking variables can influence correlation and regression analyses.

Uploaded by

jose mendoza

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (1 vote)

1K views3 pages

AP Statistics Chapter 3

Uploaded by

jose mendoza

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 3

AP Statistics Chapter 3 Examining Relationships

3.1: Scatterplots and Correlation

Explanatory and Response Variables
A response variable measures an outcome of a study. An explanatory variable attempts to
explain the observed outcomes. The explanatory variable is sometimes referred to as the
independent variable and is typically symbolized by the variable x. The response variable is
sometimes referred to as the dependent variable and is typically symbolized by the variable y.
Scatterplot
A scatterplot shows the relationship between two quantitative variables measured on the same
individuals. The values of the explanatory variable appear on the horizontal axis, and the values
of the response variable appear on the vertical axis. If there is no clear explanatory/response
relationship between the two variables, then either variable can be placed on either axis. Each
individual in the data set appears as a single point in the plot fixed by the values of both variables
for that individual.
Examining a Scatterplot
In any graph of data, look for patterns and deviations from the pattern. Describe the overall
pattern of a scatterplot by the form, direction and strength of the relationship.
Form can be described as linear or curved.
Direction can be described as positive or negative or neither.

Strength can be described as weak, moderate or strong.

A deviation from the overall pattern of a scatterplot is called an outlier.

Association
Two variables are positively associated if as one increases the other increases.
Two variables are negatively associated if as one increases the other decreases.
Correlation
Correlation measures the strength and direction of the relationship between two quantitative
variables. Correlation is usually represented by the letter r.
Facts about Correlation
1. When calculating correlation, it makes no difference which variable is x and which is y.
2. Correlation is only calculated for quantitative variables, not categorical.
3. The value of r does not change if the units of x and/or y are changed.
4. Positive r indicates a positive association between x and y. Negative r indicates a negative
association.
5. Correlation is always a number between -1 and +1. Values close to +1 or -1 indicate that
the points lie close to a line. The extreme values of +1 and -1 are only achieved when the
points are perfectly linear.
6. Correlation measures the strength of a linear relationship between two variables, not
curved relationships.
AP Statistics Summary of Chapter 3

Page 1 of 3

7. Correlation, like the mean and standard deviation, is nonresistant. Recall that this means
that it is greatly affected by outliers.

3.2: Least-Squares Regression

Regression Line
A regression line is a straight line that describes how a response variable y changes as an
explanatory variable x changes. The line is often to predict values of y for given values of x.
Regression, unlike correlation, requires an explanatory/response relationship. In other words,
when x and y are reversed, the regression line changes. Recall that correlation is the same no
matter which variable is x and which is y.
Least-Squares Regression Line
The least-squares regression line is the line that makes the sum of the squares of the vertical
distances from the data points to the line as small as possible.
Equation of the Least-Squares Regression Line
To find the equation of the regression line in the form y a bx , where a is the y-intercept and b
is the slope, use the following equations:
br

sy
sx

and a y bx

The Role of r-squared (Coefficient of Determination)

The square of the correlation coefficient, or r-squared, represents the percentage of the change
in the y-variable that can be attributed to its relationship with the x-variable. So if r-squared for
the regression between x and y is .73, we can say that x accounts for 73% of the variation in y.
Residuals
A residual is the difference between an observed value of y and the value predicted by the
regression line. That is, residual = actual y - predicted y.
Residual Plot
A residual plot is a scatterplot of each x-value and its residual value. The residual plot is used to
determine whether a linear equation is a good model for a set of data, as follows:
If the residual plot exhibits randomness, then a line is a good model for the data (see left)
If the residual plot exhibits a pattern, then a line is NOT a good model for the data (right)

Outliers and Influential Points

A point that lies outside the overall pattern of the other observations is considered an outlier. If
AP Statistics Summary of Chapter 3

Page 2 of 3

the removal of such a point has a large effect on the correlation and/or regression, that point is
considered an influential point.

3.3: Correlation and Regression Wisdom

Extrapolation
Extrapolation is the use of a regression line or curve to predict far beyond the domain of the
variable x that was used to obtain the line or curve. Such predictions are often not accurate.
Example:
Looking at the end of year NASDAQ composite stock index from 1994 to 1999 gives the
appearance that the pattern will continue as shown on the graph below. However, the actual
values for those years dropped off considerably (actual data shown by the two points below).

Lurking Variable
A lurking variable is a variable which is not among the variables of a study and yet may
influence the interpretation of the relationships among those variables. For example, consider the
statistical relationship between ice cream sales and drowning deaths. These two variables have a
positive, and potentially statistically significant, correlation with each other. One might be
tempted to conclude then, that more ice cream sales cause more drowning deaths to occur. The
real cause of a corresponding increase in both of these variables is a lurking variable warm
weather. People eat more ice cream and go swimming more when it is warm.
The Use of Averaged Data
When averaged data is used instead of all of the actual data in a two-variable setting, the result is
a much stronger correlation. This can give the false impression that the relationship between x
and y is stronger than it actually is. In general, correlations based on averages are usually too
high when applied to individuals. This typically occurs when correlations based on grouped
data are incorrectly assumed to hold for individuals.

AP Statistics Summary of Chapter 3

Page 3 of 3

AP Stats Unit 3 Practice Test
No ratings yet
AP Stats Unit 3 Practice Test
4 pages
Ap Stats CH 6 Test
0% (1)
Ap Stats CH 6 Test
6 pages
Commerce 1DA3 Notes-6
No ratings yet
Commerce 1DA3 Notes-6
256 pages
Formula Sheet AP Statistics 2019 Free-Response Questions
No ratings yet
Formula Sheet AP Statistics 2019 Free-Response Questions
3 pages
AP Statistics Chapter 9C Test
100% (2)
AP Statistics Chapter 9C Test
2 pages
AP Statistics Chapter 7 Review Key
100% (1)
AP Statistics Chapter 7 Review Key
8 pages
Chapter 2 AP Statistics Practice Test
100% (1)
Chapter 2 AP Statistics Practice Test
5 pages
Test 8A A P Statistics Name
0% (1)
Test 8A A P Statistics Name
4 pages
Chap. 5 Test Review Answers PDF
33% (3)
Chap. 5 Test Review Answers PDF
4 pages
AP Statistics Problems #19
No ratings yet
AP Statistics Problems #19
3 pages
AP Statistics Problems #18
0% (1)
AP Statistics Problems #18
3 pages
2007 AP Statistics Multiple Choice Exam
100% (2)
2007 AP Statistics Multiple Choice Exam
17 pages
AP Statistics Study Guide
100% (2)
AP Statistics Study Guide
12 pages
Ap Statistics Unit 1 Review Answers
100% (1)
Ap Statistics Unit 1 Review Answers
2 pages
AP Stats Chapter 7A Practice Test
100% (1)
AP Stats Chapter 7A Practice Test
10 pages
AP Statistics
100% (1)
AP Statistics
36 pages
AP Statistics Midterm
50% (4)
AP Statistics Midterm
51 pages
4C Test
No ratings yet
4C Test
12 pages
Statistics For Business and Economics: Anderson Sweeney Williams
No ratings yet
Statistics For Business and Economics: Anderson Sweeney Williams
25 pages
IBM Spss
100% (5)
IBM Spss
1,103 pages
System Identification Toolbox - Reference Matlab
100% (1)
System Identification Toolbox - Reference Matlab
1,249 pages
AP Stats Study Guide
No ratings yet
AP Stats Study Guide
14 pages
Stats AP Review
100% (2)
Stats AP Review
38 pages
Review CH 7
No ratings yet
Review CH 7
5 pages
AP Statistics Problems #09
0% (1)
AP Statistics Problems #09
6 pages
AP Statistics Problems #13
No ratings yet
AP Statistics Problems #13
2 pages
2007 AP Statistics Multiple Choice Exam
No ratings yet
2007 AP Statistics Multiple Choice Exam
17 pages
Test - B AP Statistics
67% (6)
Test - B AP Statistics
24 pages
Test 2A AP Statistics
100% (1)
Test 2A AP Statistics
5 pages
Test 8A AP Statistics Name:: Circle The Letter Corresponding To The Best Answer
No ratings yet
Test 8A AP Statistics Name:: Circle The Letter Corresponding To The Best Answer
5 pages
Up Tps6 Lecture Powerpoint 11.1 2
No ratings yet
Up Tps6 Lecture Powerpoint 11.1 2
63 pages
Ap Stats
No ratings yet
Ap Stats
8 pages
AP Statistics Final Practice Exam
No ratings yet
AP Statistics Final Practice Exam
17 pages
AP STAT Midterm Review
No ratings yet
AP STAT Midterm Review
8 pages
AP Statistics Review 1
No ratings yet
AP Statistics Review 1
2 pages
AP Statistics Problems #8
No ratings yet
AP Statistics Problems #8
7 pages
Midterm Review Problems and Solutions
No ratings yet
Midterm Review Problems and Solutions
6 pages
Stats 8 Practice Test
No ratings yet
Stats 8 Practice Test
6 pages
Exploring Data: AP Statistics Unit 1: Chapters 1-4
No ratings yet
Exploring Data: AP Statistics Unit 1: Chapters 1-4
83 pages
2010 AP Statistics Free Response Solutions
No ratings yet
2010 AP Statistics Free Response Solutions
3 pages
Practice Exam 09 Multiple Choice
No ratings yet
Practice Exam 09 Multiple Choice
11 pages
AP-Statistics Exam
100% (2)
AP-Statistics Exam
23 pages
Ap Statistics Practice Exam From The 2018 Administration
No ratings yet
Ap Statistics Practice Exam From The 2018 Administration
36 pages
Practice Test 1. AP Stat 2025
No ratings yet
Practice Test 1. AP Stat 2025
13 pages
AP Statistics
No ratings yet
AP Statistics
42 pages
Released AP Statistics Exam 2002
100% (4)
Released AP Statistics Exam 2002
27 pages
AP Statistics Syllabus
No ratings yet
AP Statistics Syllabus
11 pages
AP Statistics Problems #19
No ratings yet
AP Statistics Problems #19
1 page
AP Statistics Multiple Choice Exam
No ratings yet
AP Statistics Multiple Choice Exam
21 pages
AP 2015 Statistics
No ratings yet
AP 2015 Statistics
65 pages
AP Statistics HW - Unit 1 MC
No ratings yet
AP Statistics HW - Unit 1 MC
3 pages
Stats 5 Practice Test
No ratings yet
Stats 5 Practice Test
5 pages
AP Stats S1 Midterm Exam (2021) MC
No ratings yet
AP Stats S1 Midterm Exam (2021) MC
5 pages
AP Stats Chapter 9B Test
No ratings yet
AP Stats Chapter 9B Test
7 pages
AP Statistics Practice Exam
No ratings yet
AP Statistics Practice Exam
19 pages
AP Stat 1997
No ratings yet
AP Stat 1997
104 pages
AP Calc BC 2003
No ratings yet
AP Calc BC 2003
29 pages
Module 2 - Section 4 (Linear Regression) - 11
No ratings yet
Module 2 - Section 4 (Linear Regression) - 11
20 pages
Ch. 3 Review Packet
No ratings yet
Ch. 3 Review Packet
9 pages
Looking at Data: Relationships - : Caution About Correlation and Regression The Question of Causation
No ratings yet
Looking at Data: Relationships - : Caution About Correlation and Regression The Question of Causation
20 pages
Chapter 3 Describing Relationships
No ratings yet
Chapter 3 Describing Relationships
39 pages
Chapter 2
No ratings yet
Chapter 2
67 pages
Chapter 3 Slides
No ratings yet
Chapter 3 Slides
40 pages
Chapter2-ESTA3042 2020S2
No ratings yet
Chapter2-ESTA3042 2020S2
80 pages
PGA-Fukushima 1988 & Fukushima and Tanaka 1990
No ratings yet
PGA-Fukushima 1988 & Fukushima and Tanaka 1990
2 pages
(Shavelson & Webb, 2005) - Generalizability Theory
No ratings yet
(Shavelson & Webb, 2005) - Generalizability Theory
14 pages
Group 6
No ratings yet
Group 6
50 pages
ch08 SamplingDist
No ratings yet
ch08 SamplingDist
43 pages
CH-15 - IInd Sem 23-24
No ratings yet
CH-15 - IInd Sem 23-24
99 pages
MAT2377F13 Midterm - Sol
No ratings yet
MAT2377F13 Midterm - Sol
9 pages
Barth, M.E., Landsman, W.R. & Wahlen, J.M. (1995). Fair value accounting: Effects on banks’ earnings volatility, regulatory capital and value of contractual cash flows. Journal of Banking and Finance, 19, 577-605.
No ratings yet
Barth, M.E., Landsman, W.R. & Wahlen, J.M. (1995). Fair value accounting: Effects on banks’ earnings volatility, regulatory capital and value of contractual cash flows. Journal of Banking and Finance, 19, 577-605.
29 pages
Analysis of Rainfall Intensity For Southern Nigeria: S.O. Oyegoke, A. S. Adebanjo, E.O. Ajani, and J.T. Jegede
No ratings yet
Analysis of Rainfall Intensity For Southern Nigeria: S.O. Oyegoke, A. S. Adebanjo, E.O. Ajani, and J.T. Jegede
12 pages
Hasil SPSS
No ratings yet
Hasil SPSS
8 pages
CHAPTER 3: Research Methodology
No ratings yet
CHAPTER 3: Research Methodology
10 pages
Effects of Damping Parameters On Damping Force of Two Wheeler Front Suspension IJERTV2IS70190 PDF
No ratings yet
Effects of Damping Parameters On Damping Force of Two Wheeler Front Suspension IJERTV2IS70190 PDF
9 pages
Analyzing The Amount of Health Insurance Premiums Using Multiple Linear Regression Models
100% (1)
Analyzing The Amount of Health Insurance Premiums Using Multiple Linear Regression Models
24 pages
Inbound 3991216296804003764
No ratings yet
Inbound 3991216296804003764
15 pages
Factors Influencing Unemployment Among Graduates in Malaysia PDF
No ratings yet
Factors Influencing Unemployment Among Graduates in Malaysia PDF
7 pages
C4.5 Algorithm
100% (1)
C4.5 Algorithm
31 pages
Chapter 1 - Introduction To Statistics
No ratings yet
Chapter 1 - Introduction To Statistics
91 pages
A Statistical Analysis of Color Distribution in Candy
100% (1)
A Statistical Analysis of Color Distribution in Candy
8 pages
Elan Guides Formula Sheet CFA 2013 Level 2
100% (2)
Elan Guides Formula Sheet CFA 2013 Level 2
91 pages
Ch3 Forecasting
No ratings yet
Ch3 Forecasting
53 pages
Ch02 WienerFilters Lect 04
No ratings yet
Ch02 WienerFilters Lect 04
51 pages
AI Business Intelligence Software
No ratings yet
AI Business Intelligence Software
27 pages
Latihan Statistik - Mia Dwi Sartika
No ratings yet
Latihan Statistik - Mia Dwi Sartika
26 pages
3 +Irwany,+JMIF+Vol 2+no 1+2022+ (54-70)
No ratings yet
3 +Irwany,+JMIF+Vol 2+no 1+2022+ (54-70)
17 pages
1 s2.0 S0143974X98000078 Main PDF
No ratings yet
1 s2.0 S0143974X98000078 Main PDF
18 pages
195 Master 195 100 Highlighted
No ratings yet
195 Master 195 100 Highlighted
4 pages
IJENS
No ratings yet
IJENS
7 pages

AP Statistics Chapter 3

Uploaded by

AP Statistics Chapter 3

Uploaded by

AP Statistics Chapter 3 Examining Relationships

3.1: Scatterplots and Correlation

Strength can be described as weak, moderate or strong.

A deviation from the overall pattern of a scatterplot is called an outlier.

3.2: Least-Squares Regression

The Role of r-squared (Coefficient of Determination)

Outliers and Influential Points

3.3: Correlation and Regression Wisdom

AP Statistics Summary of Chapter 3

You might also like