Section 9.2, Linear Regression

This document discusses linear regression analysis. It defines the regression line as the best-fit line for predicting a dependent variable (y) from an independent variable (x). The regression line takes the form ŷ = mx + b, where m and b are calculated using formulas involving sums of the x and y values and their products. The residual is the difference between the observed and predicted y values. The coefficient of determination, r2, indicates what proportion of the variation in y is explained by the regression line. Examples are provided to demonstrate calculating the regression line and r2 from sample data. Limitations of linear regression like its reliance on a linear relationship and issues with extrapolation are also noted.

Uploaded by

Han Myo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

99 views

Section 9.2, Linear Regression

Uploaded by

Han Myo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Section 9.

2, Linear Regression

Our goal for this section will be to write the equation of the “best-fit” line through the points on
a scatter plot for paired data. This helps us to predict values of the response variable when the
explanatory variable is given.
The regression line is the best-fit line through the points in the data set. For an independent
variable x and dependent variable y, it has the form
ŷ = mx + b,
where ŷ is the predicted y-value for a given x-value,
P P P
n (xy) − x y
m= P P and
n (x2 ) − ( x)2
P P
y x
b = ȳ − mx̄ = −m .
n n
The line always passes through the point (x̄, ȳ).
The residual, d, is the difference of the observed y-value and the predicted y-value. d = (observed y-value)−
(predicted y-value). The regression line (found with these formulas) minimizes the sum of the squares
of the residuals.
The coefficient of determination, r2 , is the proportion of the variation that explained by the
regression line.
Examples
1. The number of officers on duty in a Boston city park and the number of muggings for that day
are:
Officers Muggings
10 5
15 2
16 1
1 9
4 7
6 8
18 1
12 5
14 3
7 6
Calculate the regression line for this data, and the residual for the first observation, (10, 5).
What percentage of variation is explained by the regression line?
P P P
FromPthe calculations we did in section 9.1, we found that x = 103, y = 47, xy = 343,
and x2 = 1347. So,
10 · 343 − 103 · 47
m= = −0.493 and
10 · 1347 − 1032
47 103
b= − (−0.493) · = 9.780.
10 10
Then, the equation of the regression line is ŷ = −0.493x + 9.780.
To find the residual, we need to find ŷ when x = 10, so ŷ = −0.493 · 10 + 9.780 = 4.848, so
d = 5−4.848 = 0.152. (Whenever possible, use the original numbers for m and b in calculations
instead of the rounded numbers).
In Section 9.1, we calculated that r = −0.969, so r2 = .939 and 93.9% of the variation is
explained by the regression line (and 6.1% is due to random and unexplained factors).
2. A study involved comparing the per capita income (in thousands of dollars) to the number
of medical doctors per 10,000 residents. Six small cities in Oregon had the observations:
Per capita income Doctors
8.6 9.6
9.3 18.5
10.1 20.9
8.0 10.2
8.3 11.4
8.7 13.1
The data has a correlation coefficient of r = 0.934. Calculate the regression line for this
data. What percentage of variation is explained by the regression line? Predict the number of
doctors per 10,000 residents in a town with a per capita income of $8500.
P P P P 2
Calculating from the data we see that x = 53, y = 83.7, (xy) = 755.89, and x =
471.04. Then,
6 · 755.89 − 53 · 83.7
m= = 5.756 and
6 · 471.04 − 532
83.7 53
b= − 5.756 · = −36.898.
6 6
The equation of the line is ŷ = 5.756x − 36.898.
The proportion of variation explained by the line is r2 = 0.9342 = 0.872, so 87.2% is explained
by the line.
A town with a per capita income of $8500 (x=8.5) will have approximately ŷ = 5.756 · 8.5 −
36.898 = 12.03 doctors per 10,000 residents
Some problems with Linear Regression:
• It works best to predict values when the relationship between variables is linear. If r is close
to zero, ŷ will not be a good predictor of y, in general.
• Extrapolation: The line is intended to predict values of y for values of x that are close to the
data. Using the line far outside that range may produce unrealistic forecasts.

Regression Analysis - VCE Further Mathematics
No ratings yet
Regression Analysis - VCE Further Mathematics
5 pages
Comm 215.MidtermReview
No ratings yet
Comm 215.MidtermReview
71 pages
DMJAP-LinearRegression-3
No ratings yet
DMJAP-LinearRegression-3
28 pages
Coefficient of Determination
No ratings yet
Coefficient of Determination
7 pages
Topic 8 - Regression Analysis
No ratings yet
Topic 8 - Regression Analysis
51 pages
Unit 2-Part 3-Linear Regression
No ratings yet
Unit 2-Part 3-Linear Regression
38 pages
Lecture 07 Regression
No ratings yet
Lecture 07 Regression
22 pages
Unit 07 Regression Correlation (1)
No ratings yet
Unit 07 Regression Correlation (1)
36 pages
STB1003_Unit-3 bsc
No ratings yet
STB1003_Unit-3 bsc
12 pages
Level II IFT Study Notes Quant R04 Introduction To Linear Regression
No ratings yet
Level II IFT Study Notes Quant R04 Introduction To Linear Regression
13 pages
Module 3 (Regression Line) and Module 4
No ratings yet
Module 3 (Regression Line) and Module 4
38 pages
linearregression-Rupak_(1)
No ratings yet
linearregression-Rupak_(1)
32 pages
Linear Regression II
No ratings yet
Linear Regression II
54 pages
Chap01-3 (Autosaved)
No ratings yet
Chap01-3 (Autosaved)
51 pages
4-Biol 605-Regression Models (1)
No ratings yet
4-Biol 605-Regression Models (1)
25 pages
Chapter 5 - Eng
No ratings yet
Chapter 5 - Eng
20 pages
Chapter 3 - Classical Simple Linear Regression
No ratings yet
Chapter 3 - Classical Simple Linear Regression
52 pages
ArunRangrej
No ratings yet
ArunRangrej
5 pages
Unit-2 Numericals
No ratings yet
Unit-2 Numericals
17 pages
Proycto Final Karla Tamayo Bioestadistica - Ingles.
No ratings yet
Proycto Final Karla Tamayo Bioestadistica - Ingles.
5 pages
Unit 3 Notes
100% (2)
Unit 3 Notes
32 pages
Regression Model
No ratings yet
Regression Model
26 pages
Mda-Session-7 Simple Linear Regression
No ratings yet
Mda-Session-7 Simple Linear Regression
75 pages
STAT22209 - Chapter 02-Regression Analyisis - 2022
No ratings yet
STAT22209 - Chapter 02-Regression Analyisis - 2022
41 pages
Regression
No ratings yet
Regression
6 pages
CO 4 Session 34 Linear Regression and Its Applications
No ratings yet
CO 4 Session 34 Linear Regression and Its Applications
21 pages
9 Regression (Statistics IEM 2-2)
No ratings yet
9 Regression (Statistics IEM 2-2)
32 pages
Econometrics For Finance
100% (1)
Econometrics For Finance
54 pages
Business Stat 10 12 .PDF
No ratings yet
Business Stat 10 12 .PDF
144 pages
2.04 Regression - Describing The Line
No ratings yet
2.04 Regression - Describing The Line
2 pages
Regression: - Regression: - Linear Regression: - Uses
No ratings yet
Regression: - Regression: - Linear Regression: - Uses
14 pages
Unit 3 notes
No ratings yet
Unit 3 notes
35 pages
Chapter 9
No ratings yet
Chapter 9
23 pages
13-52Statistics Ch 13 2024
No ratings yet
13-52Statistics Ch 13 2024
14 pages
Linear Regression Full Version
No ratings yet
Linear Regression Full Version
34 pages
1.5 Linear Regression Using Technology (Filled In)
No ratings yet
1.5 Linear Regression Using Technology (Filled In)
3 pages
Topics: Regression
No ratings yet
Topics: Regression
26 pages
Regression-Analysis
No ratings yet
Regression-Analysis
31 pages
Business Statistics by Gupta 365 379
No ratings yet
Business Statistics by Gupta 365 379
15 pages
5 - Part II - Regression Analysis w-notes(1)
No ratings yet
5 - Part II - Regression Analysis w-notes(1)
10 pages
Unit 2 Regression Analysis
No ratings yet
Unit 2 Regression Analysis
16 pages
Regression Basics: Predicting A DV With A Single IV
No ratings yet
Regression Basics: Predicting A DV With A Single IV
20 pages
Session 15 Regression and Correlation
No ratings yet
Session 15 Regression and Correlation
66 pages
A Tutorial On How To Run A Simple Linear Regression in Excel
No ratings yet
A Tutorial On How To Run A Simple Linear Regression in Excel
19 pages
Unit 9 Linear Regression: Structure
No ratings yet
Unit 9 Linear Regression: Structure
18 pages
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
No ratings yet
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
11 pages
Chapter 4 (Regression part)
No ratings yet
Chapter 4 (Regression part)
13 pages
Pradytha Galuh Putranti_2304220013_SSD_B ING-STAT (2)
No ratings yet
Pradytha Galuh Putranti_2304220013_SSD_B ING-STAT (2)
26 pages
Regression Analysis
No ratings yet
Regression Analysis
29 pages
Regression Analysis
No ratings yet
Regression Analysis
14 pages
Regression Equation: Independent Variable Predictor Variable Explanatory Variable Dependent Variable Response Variable
No ratings yet
Regression Equation: Independent Variable Predictor Variable Explanatory Variable Dependent Variable Response Variable
60 pages
Econometrics for Finace Lecture II-Session Three
No ratings yet
Econometrics for Finace Lecture II-Session Three
32 pages
Prediction Is A Key Task of Statistics
No ratings yet
Prediction Is A Key Task of Statistics
18 pages
Regression
No ratings yet
Regression
60 pages
Coding 2
No ratings yet
Coding 2
3 pages
Regression Analysis
No ratings yet
Regression Analysis
21 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
20 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet
Mathematical Functions
From Everand
Mathematical Functions
Oliver Linton
No ratings yet
Useful Formulae: Mathematical & Physical
From Everand
Useful Formulae: Mathematical & Physical
Matthew Watkins
No ratings yet
Work-Life Balance Programs To Improve Employee Performance
No ratings yet
Work-Life Balance Programs To Improve Employee Performance
329 pages
Vol 71, No 30 PDF
No ratings yet
Vol 71, No 30 PDF
72 pages
Civil Service Reform Strategic Action Plan (Eng) PDF
No ratings yet
Civil Service Reform Strategic Action Plan (Eng) PDF
38 pages
The Differences Between Administration and Management
No ratings yet
The Differences Between Administration and Management
14 pages
Engineering Equality: Government Launches Economic Policy
No ratings yet
Engineering Equality: Government Launches Economic Policy
15 pages
CB Bank Assignment
100% (1)
CB Bank Assignment
16 pages
Data Integration
No ratings yet
Data Integration
7 pages
One-Sample Hypothesis Test Examples: (Chapter 10)
No ratings yet
One-Sample Hypothesis Test Examples: (Chapter 10)
5 pages
I-Banking CKTA Persentation
No ratings yet
I-Banking CKTA Persentation
5 pages
Non Monetary Compensation
No ratings yet
Non Monetary Compensation
6 pages
AEC Scorecard 3
No ratings yet
AEC Scorecard 3
34 pages
Herzberg's Motivation Theory - Two Factor Theory
No ratings yet
Herzberg's Motivation Theory - Two Factor Theory
19 pages
CLOUD VISION TECHNOLOGY (Myanmar) PDF
50% (2)
CLOUD VISION TECHNOLOGY (Myanmar) PDF
8 pages
Chapter 11 IS-LM Model
100% (1)
Chapter 11 IS-LM Model
38 pages
Nutanix Spec Sheet July16
No ratings yet
Nutanix Spec Sheet July16
9 pages
CLOUD VISION TECHNOLOGY (Myanmar) PDF
50% (2)
CLOUD VISION TECHNOLOGY (Myanmar) PDF
8 pages
In Dept Myanmar
100% (1)
In Dept Myanmar
66 pages
A History of The Burma Socialist Party (1930-1964) PDF
No ratings yet
A History of The Burma Socialist Party (1930-1964) PDF
40 pages
Ge 10 Pretest
No ratings yet
Ge 10 Pretest
3 pages
Classification Pros Cons
No ratings yet
Classification Pros Cons
1 page
When To Choose UCF Over CCF
No ratings yet
When To Choose UCF Over CCF
6 pages
Pengaruh Kualitas Produk Dan Harga Terhadap Keputusan Pembelian Mobil Daihatsu Grand Max Pick Up
No ratings yet
Pengaruh Kualitas Produk Dan Harga Terhadap Keputusan Pembelian Mobil Daihatsu Grand Max Pick Up
12 pages
Recap
No ratings yet
Recap
75 pages
Syllabus Applied Microeconometrics 23fall
No ratings yet
Syllabus Applied Microeconometrics 23fall
5 pages
Testing The Assumptions of Linear Regression
100% (1)
Testing The Assumptions of Linear Regression
14 pages
Advanced Statistical Approaches To Quality: INSE 6220 - Week 4
No ratings yet
Advanced Statistical Approaches To Quality: INSE 6220 - Week 4
44 pages
Econometrics I AMU
No ratings yet
Econometrics I AMU
145 pages
Introduction To Econometrics, 5 Edition: Chapter 1: Simple Regression Analysis
No ratings yet
Introduction To Econometrics, 5 Edition: Chapter 1: Simple Regression Analysis
26 pages
Compensation For Sales Professionals
100% (1)
Compensation For Sales Professionals
12 pages
(eBook PDF) Statistics for Business Economics 13th Edition by Davidinstant download
100% (2)
(eBook PDF) Statistics for Business Economics 13th Edition by Davidinstant download
43 pages
Certificate in Business Statistics (VRQ) : Pearson LCCI
No ratings yet
Certificate in Business Statistics (VRQ) : Pearson LCCI
20 pages
How To Write A Systematic Review A Step by Step Guide
100% (2)
How To Write A Systematic Review A Step by Step Guide
6 pages
Section 1: Cross-Validation and Model Performance
No ratings yet
Section 1: Cross-Validation and Model Performance
33 pages
Sol Linear Regression by Hand
No ratings yet
Sol Linear Regression by Hand
3 pages
Repaso Final - Estadistica, Spring 2022 - WebAssign
No ratings yet
Repaso Final - Estadistica, Spring 2022 - WebAssign
20 pages
Steve Humble - Quantitative Analysis of Questionnaires - Techniques To Explore Structures and Relationships-Routledge - Taylor & Francis Group (2020)
No ratings yet
Steve Humble - Quantitative Analysis of Questionnaires - Techniques To Explore Structures and Relationships-Routledge - Taylor & Francis Group (2020)
234 pages
STATISTICS
No ratings yet
STATISTICS
48 pages
Type I & Type II Error
No ratings yet
Type I & Type II Error
19 pages
Statistics and Probability 3rd Quarter 3rd Assessment
No ratings yet
Statistics and Probability 3rd Quarter 3rd Assessment
6 pages
Lab 5 - Hypothesis Testing Using One Sample T-Test: Table 1
No ratings yet
Lab 5 - Hypothesis Testing Using One Sample T-Test: Table 1
7 pages
Pengaruh Teamwork Dan Komunikasi Internal Terhadap Kinerja Karyawan Pada PT. Penta Valent Denpasar
No ratings yet
Pengaruh Teamwork Dan Komunikasi Internal Terhadap Kinerja Karyawan Pada PT. Penta Valent Denpasar
8 pages
Doing Bayesian Data Analysis With JASP: Darrell A. Worthy
No ratings yet
Doing Bayesian Data Analysis With JASP: Darrell A. Worthy
76 pages
Bastidas HW 5 Chap 3
No ratings yet
Bastidas HW 5 Chap 3
7 pages
IM & EE-lecture ppt -CH-2
No ratings yet
IM & EE-lecture ppt -CH-2
29 pages
Regression Analysis
No ratings yet
Regression Analysis
34 pages
Qvive PMP Formulas PMBOK6 v1b
No ratings yet
Qvive PMP Formulas PMBOK6 v1b
1 page
Lab 01 - Scientific Method and Statistics (New Version)
0% (1)
Lab 01 - Scientific Method and Statistics (New Version)
25 pages

Section 9.2, Linear Regression

Uploaded by

Section 9.2, Linear Regression

Uploaded by

Section 9.

You might also like