Chapter 3 Part 2

Chapter 3 discusses additional considerations in regression modeling, including nonlinearity, correlation of error terms, non-constant variance, outliers, high-leverage points, and collinearity. It highlights the importance of checking for collinearity using pairwise correlation and variance inflation factor (VIF), and suggests remedies like dropping or combining variables. The chapter also compares linear regression with K-nearest neighbors (KNN) regression, noting that linear regression may outperform KNN in certain settings, especially with fewer observations per predictor.

Uploaded by

amin1jafarzade

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views16 pages

Chapter 3 Part 2

Uploaded by

amin1jafarzade

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Chapter 3

Additional topics
Other Considerations in the Regression Model
1. Nonlinearity of the response-predictor relationship
2. Correlation of error terms

An important assumption of the

linear regression is that the error
terms are uncorrelated.

If the assumption is violated, the

estimated standard errors will
tend to underestimate the true
standard error.
3. Non-constant variance of error terms
4. Outliers
5. High-leverage points
" (%! & %)̅ "
Leverage statistics: ℎ! = + ∑& "
# #$%(%# &%̅ )
The leverage statistic is always between 1/n and 1.
If an observation has leverage statistic exceeding (p+1)/n, then we
suspect that the corresponding point has high leverage.
6. Collinearity
Two or more predictors are closely
related with each other.
It reduces the accuracy of the estimates
of the regression coefficients.
It results in a decline in t-statistic.
• For detecting collinearity
Check the pairwise correlation
Use variance inflaction factor (VIF):
1
𝑉𝐼𝐹 𝛽'* =
1 − 𝑅+-# |+'#
• What should we do when collinearity exists?
(1) drop the problematic variable,
(2) combine the collinear variables into a single predictor(e.g.
PCA regression)
3.5 Comparison of Linear Regression with K-
Nearest Neighbors

K-nearest neighbors regression:

Type equation here.
1
𝑓' 𝑥= = / 𝑦!
𝐾
%! ∈?(
• The optimal value for K will depend on the bias-variance
tradeoff.
The small value of K provides the most flexible fit, which will have low
bias but high variance.

Large values of K provide a smoother and high bias with small variance.

• In what setting will least squares linear regression outperform a

KNN regression?
Curse of dimensionality
• Parametric approaches outperforms nonparametric approaches
when there are a small number of observations per predictor.

• Even in problems in which the dimension is small, we may

prefer linear regression to KNN from an interpretability
standpoint.

MLS 1 - Regression
No ratings yet
MLS 1 - Regression
20 pages
Bike Sharing Assignment
100% (6)
Bike Sharing Assignment
7 pages
3 Linear Vs KNN Regression
No ratings yet
3 Linear Vs KNN Regression
4 pages
Data Science Interview Preparation
100% (1)
Data Science Interview Preparation
113 pages
Unit 3 Notes
100% (2)
Unit 3 Notes
32 pages
Lectures 7 8-Simple Regression Analysis - Assumptions and Estimations (OLS)
No ratings yet
Lectures 7 8-Simple Regression Analysis - Assumptions and Estimations (OLS)
21 pages
Assumptions of Regression
100% (2)
Assumptions of Regression
16 pages
Assignment-Based Subjective Questions
100% (1)
Assignment-Based Subjective Questions
10 pages
Statistic and Data Science Ii PDF
No ratings yet
Statistic and Data Science Ii PDF
37 pages
Unit-III (Data Analytics)
50% (2)
Unit-III (Data Analytics)
15 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
35 pages
DS w13 Regression
No ratings yet
DS w13 Regression
60 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
25 pages
Numerical Computation - 7 - Linear Regression
No ratings yet
Numerical Computation - 7 - Linear Regression
27 pages
Lecture+Notes+-+Advanced+Regression
No ratings yet
Lecture+Notes+-+Advanced+Regression
12 pages
CS550 Regression
No ratings yet
CS550 Regression
62 pages
Ms 236 N 0
No ratings yet
Ms 236 N 0
63 pages
Machine Learning
No ratings yet
Machine Learning
62 pages
Linear Regression PDF
100% (1)
Linear Regression PDF
32 pages
Linear Regression
No ratings yet
Linear Regression
38 pages
Data Science 6th Sem CS Engineesring Questions
No ratings yet
Data Science 6th Sem CS Engineesring Questions
35 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
PROBLEMS ch05
No ratings yet
PROBLEMS ch05
117 pages
m2 Data Analytic and Visualization
No ratings yet
m2 Data Analytic and Visualization
53 pages
Da Module 3
No ratings yet
Da Module 3
54 pages
Time Series Montg Notes
No ratings yet
Time Series Montg Notes
7 pages
KNN, RR and KRR-1
No ratings yet
KNN, RR and KRR-1
9 pages
Chap01-3 (Autosaved)
No ratings yet
Chap01-3 (Autosaved)
51 pages
Level 2 r12 Multiple Regression
No ratings yet
Level 2 r12 Multiple Regression
29 pages
DSR Notes 3 To 5
No ratings yet
DSR Notes 3 To 5
70 pages
Intermediate Analytics-Regression-Week 1
No ratings yet
Intermediate Analytics-Regression-Week 1
52 pages
Module 4
No ratings yet
Module 4
33 pages
Statistical Testing and Prediction Using Linear Regression: Abstract
No ratings yet
Statistical Testing and Prediction Using Linear Regression: Abstract
10 pages
Unit Iii
No ratings yet
Unit Iii
27 pages
Fda Unit 5
No ratings yet
Fda Unit 5
20 pages
ML Unit-2
No ratings yet
ML Unit-2
34 pages
Advance Machine Learning
No ratings yet
Advance Machine Learning
16 pages
Linear Regression
No ratings yet
Linear Regression
35 pages
Module 3
No ratings yet
Module 3
34 pages
3-Linear Regreesion-Assumptions
No ratings yet
3-Linear Regreesion-Assumptions
28 pages
Chapter 1. Elements in Predictive Analytics
No ratings yet
Chapter 1. Elements in Predictive Analytics
66 pages
Da Unit 3 R22
No ratings yet
Da Unit 3 R22
15 pages
Linear - Regression & Evaluation Metrics
No ratings yet
Linear - Regression & Evaluation Metrics
31 pages
ch03 Regression
No ratings yet
ch03 Regression
10 pages
MIS BA 20232024 Notes Chapter03
No ratings yet
MIS BA 20232024 Notes Chapter03
13 pages
Linear Regression Algorithm
No ratings yet
Linear Regression Algorithm
16 pages
Unit III
No ratings yet
Unit III
13 pages
Linear Regression Assumptions and Limitations
No ratings yet
Linear Regression Assumptions and Limitations
10 pages
Classical LinearReg 000
No ratings yet
Classical LinearReg 000
41 pages
Revision 235
No ratings yet
Revision 235
8 pages
2023 Statistics Fin 10
No ratings yet
2023 Statistics Fin 10
14 pages
Linearregressionpl
No ratings yet
Linearregressionpl
9 pages
Regression Notes
No ratings yet
Regression Notes
7 pages
Classical Machine Learning: Linear Regression: Ramesh S
No ratings yet
Classical Machine Learning: Linear Regression: Ramesh S
28 pages
DA-3rd Unit
No ratings yet
DA-3rd Unit
16 pages
3 Da
No ratings yet
3 Da
16 pages
Regression Models Notes
No ratings yet
Regression Models Notes
13 pages
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet

Chapter 3 Part 2

Uploaded by

Chapter 3 Part 2

Uploaded by

Chapter 3

An important assumption of the

If the assumption is violated, the

K-nearest neighbors regression:

• In what setting will least squares linear regression outperform a

• Even in problems in which the dimension is small, we may

You might also like