0% found this document useful (0 votes)
10 views10 pages

Chap 11 Lecture

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
10 views10 pages

Chap 11 Lecture

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

MATH 333 Probability & Statistics

Chapter 11 Lecture

Instructor: Kristin Carfora


11.2: Simple Linear Regression
In this chapter we want to determine if there is a significant linear
relationship between two quantitative variables, denoted X and Y . This is
called a linear correlation or association.
Data for the two variables is given in a table or as a set of ordered pairs
(x, y ). We use scatterplots, graphs of ordered pairs of data, to determine if
there is a visual relationship between the variables.
We can describe the strength of the linear relationship between two
variables as strong, moderate, or weak

2 / 10
11.2: Simple Linear Regression
Linear Regression is the process of analyzing the linear relationship
between two variables.
If x and y have a moderate/strong linear relationship, we can then find the
equation of the line that best fits the data (shown below). This line is called
the line of best fit, or the least squares regression line

The formulas that determine the


equation of the line minimize the
squares of the distances between the
coordinate points and the line

3 / 10
11.2: Simple Linear Regression
The equation of the least-squares regression line is given by: y = β1 x + β0
β1 = slope of the best fit line
β0 = y-intercept of the best fit line

The slope of a line is the change in y for each unit increase in x

The y-intercept of a line is the y-value of the function when x = 0

P P P
n( xy ) − ( x)( y )
β1 =
n( x 2 ) − ( x)2
P P

P P
y x
β0 = Ȳ − β1 X̄ where Ȳ = and X̄ =
n n
Note: n represents the number of coordinate points in your data set
4 / 10
Example
Consider the following data set, where X = number of absences in a math
class and Y = final grade. Find the equation of the least squares regression
line for X and Y. (There is more room on the next slide.)
X Y

2 91

3 90

6 80

8 72

9 70

10 58

5 / 10
Example
For the data set on the previous slide, find the equation of the least squares
regression line for X = number of absences and Y = final grade.

6 / 10
11.2: Simple Linear Regression
We can use the regression line to make predictions about the population.

Example: For the same data set, using the equation for the regression line
y = −3.78x + 101, answer the following:
Predict the final grade for a student who has 7 absences
How many absences would you expect a student to have had if their
final grade is 82%?

X Y
2 91
3 90
6 80
8 72
9 70
10 58

7 / 10
Example
Example: For the same data set, using the equation for the regression line
y = −3.78x + 101, answer the following:
Intepret the slope of the regression line
Interpret the y-intercept of the regression line. (Is this realistic?)

X Y
2 91
3 90
6 80
8 72
9 70
10 58

8 / 10
Extra Practice
Example: For the following data set, let X = patient’s age in years and Y =
systolic blood pressure. Find the equation of the line of best fit.

X Y XY X2

17 114

21 120

39 146

47 150

56 154

9 / 10
Extra Practice
Example: For the following data set, let X = patient’s age in years and Y =
systolic blood pressure. The regression equation is ŷ = 1.08x + 98.
X Y 1 Predict the blood pressure of a 50 year old.

17 114

21 120 2 What age would you expect a person to be if they have a


systolic blood pressure of 130?
39 146

47 150

56 154 3 Interpret the slope and y-intercept.

10 / 10

You might also like