0% found this document useful (0 votes)

6 views3 pages

Regression Script

The document outlines the process of fitting a linear regression model using the mtcars dataset and performing various assumption checks. It covers linearity, normality of residuals, homoscedasticity, multicollinearity, and independence of residuals, providing interpretations for each check. The document emphasizes the importance of these assumptions in ensuring the validity of the regression model.

Uploaded by

rgulati005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views3 pages

Regression Script

Uploaded by

rgulati005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

# Load necessary libraries

library(ggplot2)
library(car) # for VIF
library(lmtest) # for Breusch-Pagan and Durbin-Watson tests
library(nortest) # for normality tests
library(ggfortify) # for diagnostic plots

# Load dataset
data(mtcars)

# Fit the linear regression model

model <- lm(Y ~ X1 + X2 + X3 + X4, data = DATA)
summary(model)

# Interpretation:
# The summary shows coefficient estimates, R-squared, and significance.
# Significant predictors (p < 0.05) impact mpg.
# R-squared tells how well the model explains mpg variation.

# --- Assumption Checks with Interpretations ---

## 1. Linearity Assumption
plot(model, which = 1)

# Interpretation:
# The Residuals vs Fitted plot should show no distinct pattern.
# A random scatter suggests a linear relationship between predictors and the dependent variable.
# If you see a curve or funnel shape, linearity may be violated.

## 2. Normality of Residuals
residual <- residuals(model)
shapiro.test(residual)
qqnorm (residual, col= “red”) # Q-Q Plot
qqline (residual, col= “green”) # Q-Q Plot

# Interpretation:
# Q-Q plot should show points roughly along the diagonal.
# Shapiro-Wilk test p-value > 0.05 suggests residuals are normally distributed.
# If p < 0.05, residuals deviate significantly from normality.

# Plot the histogram

hist(residual,
breaks = 10,
col = "skyblue",
main = "Histogram of Residuals",
xlab = "Residuals",
ylab = "Frequency")

# Shape:
# Look for a bell-shaped curve that is roughly symmetrical around zero.
# If the shape is skewed or multi-peaked, it suggests non-normality.
# Center:
#The bulk of residuals should be centered around zero.
# Spread:
#Residuals should be spread reasonably without extreme outliers.

## 3. Homoscedasticity (Constant Variance)

bptest(model)

# Interpretation:
# Breusch-Pagan test checks for equal residual variance (homoscedasticity).
# p-value > 0.05 = residuals have constant variance → assumption met.
# p < 0.05 = heteroscedasticity (non-constant variance), which can bias standard errors.

# Generate the Scale-Location plot

plot(model, which = 3)

# --- INTERPRETATION GUIDE ---

# Scale-Location Plot (aka Spread-Location or √|Standardized Residuals| vs Fitted values):
# - X-axis: Fitted values (predicted mpg)
# - Y-axis: Square root of the absolute standardized residuals
# - Each point = one observation
# - Red smooth line helps identify any trend in the spread

# What you want to see:

# Points scattered randomly around the red line with no clear pattern.
#The red line is mostly flat.
#This suggests **homoscedasticity**: the variance of residuals is roughly constant.

# What you DON'T want to see:

# A funnel shape (spread increases or decreases across fitted values)
# A curved or sloped red line
# This suggests **heteroscedasticity**, meaning the model’s residuals have **non-constant
variance**.

## 4. Multicollinearity
vif(model)
# Interpretation:
# VIF values above 5 (some say 10) indicate high multicollinearity.
# High VIF means predictors are correlated, which can inflate SEs and distort the model.
# Try removing or combining variables with high VIF.

## 5. Independence of Residuals
dwtest(model)

# Interpretation:
# Durbin-Watson test detects autocorrelation in residuals.
# DW ≈ 2 and p > 0.05 = residuals are independent.
# p < 0.05 = autocorrelation present, common in time series data.

## Optional: Residual Plot

plot(model$residuals, main = "Residuals", ylab = "Residuals", xlab = "Index")
abline(h = 0, col = "red")

# Interpretation:
# Residuals should appear randomly scattered around zero.
# Patterns

Analysis of Hydrocarbon Data - Application of LASSO Regression
No ratings yet
Analysis of Hydrocarbon Data - Application of LASSO Regression
26 pages
7 OLS Assumptions
No ratings yet
7 OLS Assumptions
37 pages
Regression Analysis Script
No ratings yet
Regression Analysis Script
24 pages
Statistical Methods For Forecasting
No ratings yet
Statistical Methods For Forecasting
8 pages
Chapter 14
No ratings yet
Chapter 14
15 pages
r notesss
No ratings yet
r notesss
12 pages
Lab 5 LR
No ratings yet
Lab 5 LR
9 pages
Linear Regression
No ratings yet
Linear Regression
22 pages
Mod3
No ratings yet
Mod3
50 pages
Linear Regression
No ratings yet
Linear Regression
17 pages
Shivam Batra (19BPS1131) 21/01/2022: List
No ratings yet
Shivam Batra (19BPS1131) 21/01/2022: List
5 pages
What Are The Consequences of Heteroscedasticity and Multicollinearity in Regression? What Are The Possible Remedies?
No ratings yet
What Are The Consequences of Heteroscedasticity and Multicollinearity in Regression? What Are The Possible Remedies?
3 pages
Multiple Linear Regression
100% (1)
Multiple Linear Regression
14 pages
CC02 Group6 Report
No ratings yet
CC02 Group6 Report
36 pages
20BCE1205 Lab3
No ratings yet
20BCE1205 Lab3
9 pages
Lab3 Report Revathy
No ratings yet
Lab3 Report Revathy
8 pages
Linear Regression
100% (2)
Linear Regression
228 pages
MakeUpCat
No ratings yet
MakeUpCat
6 pages
Simple Regression Model Fitting
No ratings yet
Simple Regression Model Fitting
5 pages
Modern Regression 1 - hw6
No ratings yet
Modern Regression 1 - hw6
11 pages
Business Analytics C-2
No ratings yet
Business Analytics C-2
7 pages
Course Notes18
No ratings yet
Course Notes18
113 pages
Module01 LinearRegression
No ratings yet
Module01 LinearRegression
41 pages
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
100% (1)
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
15 pages
Regression Diagnostics With R: Anne Boomsma
No ratings yet
Regression Diagnostics With R: Anne Boomsma
23 pages
Chapter 06-Regression Analysis
No ratings yet
Chapter 06-Regression Analysis
41 pages
Lab 5
No ratings yet
Lab 5
6 pages
Analisis Jalur
No ratings yet
Analisis Jalur
30 pages
Topic 7-Regression Analysis
No ratings yet
Topic 7-Regression Analysis
56 pages
Regression Analysis (Spring, 2000) : by Wonjae
No ratings yet
Regression Analysis (Spring, 2000) : by Wonjae
6 pages
Practical Session 2 Linear Regression Model Assumptions
No ratings yet
Practical Session 2 Linear Regression Model Assumptions
7 pages
SM Notes 2020
No ratings yet
SM Notes 2020
139 pages
MIT 302 - Statistical Computing II - Tutorial 03
No ratings yet
MIT 302 - Statistical Computing II - Tutorial 03
16 pages
AMDA Practical - A048
No ratings yet
AMDA Practical - A048
35 pages
Regression Modelli Ng Assignment
No ratings yet
Regression Modelli Ng Assignment
3 pages
Matlab-STATISTICAL MODELS AND METHODS FOR FINANCIAL MARKETS
No ratings yet
Matlab-STATISTICAL MODELS AND METHODS FOR FINANCIAL MARKETS
13 pages
Monika Project
No ratings yet
Monika Project
34 pages
LR Assumptions_05
No ratings yet
LR Assumptions_05
12 pages
R-Codes-1
No ratings yet
R-Codes-1
3 pages
Tutorial-4
No ratings yet
Tutorial-4
16 pages
Regression in The Toolbar of Minitab's Help
No ratings yet
Regression in The Toolbar of Minitab's Help
9 pages
Deep Learning Curriculum
No ratings yet
Deep Learning Curriculum
23 pages
3-Linear Regreesion-Assumptions
No ratings yet
3-Linear Regreesion-Assumptions
28 pages
Regression Analysis
100% (1)
Regression Analysis
280 pages
Homework 2
100% (1)
Homework 2
14 pages
Module01.1 LinearRegression
No ratings yet
Module01.1 LinearRegression
32 pages
MultivariableRegression Summary
No ratings yet
MultivariableRegression Summary
15 pages
Linear Regression for Real
No ratings yet
Linear Regression for Real
1 page
Robust Regression Modeling With STATA Lecture Notes
No ratings yet
Robust Regression Modeling With STATA Lecture Notes
93 pages
Linear Model
No ratings yet
Linear Model
10 pages
Unit II - Diagnotis and Multiple Linear
No ratings yet
Unit II - Diagnotis and Multiple Linear
8 pages
R Practical Ecotrix
No ratings yet
R Practical Ecotrix
4 pages
Ken Black QA 5th Chapter17 Solution
No ratings yet
Ken Black QA 5th Chapter17 Solution
44 pages
R Regression Commands
No ratings yet
R Regression Commands
5 pages
R CODES
No ratings yet
R CODES
5 pages
Exam 1 Notes
No ratings yet
Exam 1 Notes
4 pages
Multiple Linear Regression Analysis
No ratings yet
Multiple Linear Regression Analysis
23 pages
model_lab[1]
No ratings yet
model_lab[1]
6 pages
Multiple Regression
No ratings yet
Multiple Regression
7 pages
STATA Red Tutorial
100% (1)
STATA Red Tutorial
84 pages
Chapter 4 Deep Neural Nets
No ratings yet
Chapter 4 Deep Neural Nets
75 pages
Unit-V Deep Generative Models Part-01
No ratings yet
Unit-V Deep Generative Models Part-01
41 pages
Final Quiz 2 - Attempt Review 4
No ratings yet
Final Quiz 2 - Attempt Review 4
4 pages
1 Introduction
No ratings yet
1 Introduction
35 pages
LM02 ESG Considerations in Investment Analysis IFT Notes
No ratings yet
LM02 ESG Considerations in Investment Analysis IFT Notes
16 pages
DOC-20250318-WA0029
No ratings yet
DOC-20250318-WA0029
24 pages
Problems On Poisson Distribution
100% (1)
Problems On Poisson Distribution
4 pages
LM02 Overview of Types of Real Estate Investment IFT Notes
No ratings yet
LM02 Overview of Types of Real Estate Investment IFT Notes
18 pages
54RegularExpressions PDF
No ratings yet
54RegularExpressions PDF
61 pages
Gan
No ratings yet
Gan
28 pages
How To Reduce Overfitting With Dropout Regularization in Keras
No ratings yet
How To Reduce Overfitting With Dropout Regularization in Keras
12 pages
NFA to DFA
No ratings yet
NFA to DFA
12 pages
Practica Macro 22
No ratings yet
Practica Macro 22
9 pages
Y .C, YA,: Yt Yy y Ys
No ratings yet
Y .C, YA,: Yt Yy y Ys
24 pages
401 - Dhruv Agarwal - Assignment 2 - Dhruv Agarwal
No ratings yet
401 - Dhruv Agarwal - Assignment 2 - Dhruv Agarwal
7 pages
What Is OOAD
No ratings yet
What Is OOAD
13 pages
The Global Minotaur - Book review
No ratings yet
The Global Minotaur - Book review
7 pages
Artificial Neural Network - Building Blocks - Tutorialspoint
No ratings yet
Artificial Neural Network - Building Blocks - Tutorialspoint
5 pages
Discrete Probability Distributions
No ratings yet
Discrete Probability Distributions
6 pages
Time Series Analysis
0% (1)
Time Series Analysis
173 pages
Soft Computing Perceptron Neural Network in MATLAB
No ratings yet
Soft Computing Perceptron Neural Network in MATLAB
8 pages
Lecture 04
No ratings yet
Lecture 04
3 pages
.... Statistics MCQS
No ratings yet
.... Statistics MCQS
4 pages
Crisis of Fordism
No ratings yet
Crisis of Fordism
3 pages
Discrete Structures, Logic and Computability by James L. Hein
No ratings yet
Discrete Structures, Logic and Computability by James L. Hein
2 pages
Unit 3
No ratings yet
Unit 3
7 pages
GPE FINAL QUESTIONS
No ratings yet
GPE FINAL QUESTIONS
2 pages
Reordered_Questions_Unit3_R_Programming
No ratings yet
Reordered_Questions_Unit3_R_Programming
2 pages
Assignment 1
No ratings yet
Assignment 1
1 page
UML Class Diagram 3 Relationships
No ratings yet
UML Class Diagram 3 Relationships
35 pages
QB DL
No ratings yet
QB DL
2 pages
TOC Unit I MCQ
No ratings yet
TOC Unit I MCQ
4 pages
UNIT 1 TOC Sem5 RGPV
100% (2)
UNIT 1 TOC Sem5 RGPV
12 pages
Exercises of Advanced Statistics
From Everand
Exercises of Advanced Statistics
Simone Malacrida
No ratings yet
Trifocal Tensor: Exploring Depth, Motion, and Structure in Computer Vision
From Everand
Trifocal Tensor: Exploring Depth, Motion, and Structure in Computer Vision
Fouad Sabry
No ratings yet
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet

Regression Script

Uploaded by

Regression Script

Uploaded by

# Load necessary libraries

# Fit the linear regression model

# --- Assumption Checks with Interpretations ---

# Plot the histogram

## 3. Homoscedasticity (Constant Variance)

# Generate the Scale-Location plot

# --- INTERPRETATION GUIDE ---

# What you want to see:

# What you DON'T want to see:

## Optional: Residual Plot

You might also like