0% found this document useful (0 votes)

2 views8 pages

Experiment No.8 - Fit Simple Linear Regression Models Using Built-In Functions.

This document outlines the process of fitting simple linear regression models using R, including definitions, equations, and practical applications. It provides step-by-step instructions for using built-in functions such as lm() and summary() to analyze datasets, along with examples using the mtcars dataset. Additionally, it includes practice questions to reinforce the concepts learned.

Uploaded by

Sanchita Yadav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views8 pages

Experiment No.8 - Fit Simple Linear Regression Models Using Built-In Functions.

Uploaded by

Sanchita Yadav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Experiment # 8

Aim: Fit simple linear regression models using built-in functions.

8.1. Definition:-
Linear Regression:-
A linear regression is a statistical model that analyses the relationship between a response
variable/dependent variable (often called y) and one predictor variables (often called x or
explanatory variables) and their interactions using a regression line.

Linear Regression Equation is y=ax+b

where a is slope and b is intercept.

Example: when we calculate the age of a child based on their height, we assumed how older
they are, the taller they will be.
In this particular example, you can calculate the height of a child if you know her/his age:
𝑯𝒆𝒊𝒈𝒉𝒕 = 𝒂 + 𝑨𝒈𝒆 × 𝒃
In this case, a and b are called the intercept and the slope, respectively. The slope measures the
change in height with respect to the age in months (or years). In general, for every month older
the child is, their height will increase with b.

R - Multiple Regression:-
Multiple regression is an extension of linear regression into relationship between more than
two variables. In simple linear relation we have one predictor and one response variable, but in
multiple regression we have more than one predictor variable and one response variable.

The general mathematical equation for multiple regression is –

y = a + b1x1 + b2x2 +...+bnxn
where
•y is the response variable.
•a, b1, b2,...,bn are the coefficients.
•x1, x2, ..., xn are the predictor variables.

Real world Applications: (Why are you studying this model)

1. Predicting house prices based on features like size, location, etc.
2. Estimating sales revenue based on advertising spend.
3. Analyzing relationships between biological or environmental variables.
4. Analyzing relationships between Mid-term marks and End-term marks.

8.2. Commands and calculation of R: Basic steps to perform Linear Regression in R

1. Use the lm() function to fit a linear model.
The model lm() determines the value of the coefficients using the input data. Next we
can predict the value of the response variable for a given set of predictor variables
using these coefficients.
The syntax is:
model <- lm(Y ~ X, data = dataset)
Here, Y is the dependent variable, X is the independent variable, and data is the data
frame containing the variable you want to study.

2. Check the Model Summary: The summary() function provides detailed information
on the model, including coefficients, R-squared, and p-values.

summary(model)
• Coefficients: The estimated values for the intercept and the slope.
• R-squared: A measure of how well the model explains the variance in the data.
• p-values: To test the significance of the coefficients.
• Residual standard error: A measure of the typical size of the residuals.

3. Plot the Model : A quick visualization of the model fit can be achieved using plot().

plot(dataset$X, dataset$Y)

abline(model, col = "blue")

abline() - Adds the regression

line to the plot.

8.3. Example of Simple Linear Regression in R:

Example no.1:-

Using an in-built dataset like mtcars in R is quite simple. Here’s a step-by-step guide on how
to use an in-built dataset in R: (Instead of using pre-loaded dataset we can also use our own
file, such as CSV file, dataframe etc.)
Step 1: Load the Dataset
For most in-built datasets, you don’t need to explicitly load them; they are pre-loaded with the
datasets package, which comes with base R. Simply type the dataset name to view it:

data(mtcars)
View(mtcars)

Step 2: Explore the Dataset

str(mtcars)
'data.frame': 32 obs. of 11 variables:
$ mpg : num 21 21 22.8 21.4 18.7 18.1 14.3 24.4 22.8 19.2 ...
$ cyl : num 6 6 4 6 8 6 8 4 4 6 ...
$ disp: num 160 160 108 258 360 ...
$ hp : num 110 110 93 110 175 105 245 62 95 123 ...
$ drat: num 3.9 3.9 3.85 3.08 3.15 2.76 3.21 3.69 3.92 3.92 ...
$ wt : num 2.62 2.88 2.32 3.21 3.44 ...
$ qsec: num 16.5 17 18.6 19.4 17 ...
$ vs : num 0 0 1 1 0 1 0 1 1 1 ...
$ am : num 1 1 1 0 0 0 0 0 0 0 ...
$ gear: num 4 4 4 3 3 3 3 4 4 4 ...
$ carb: num 4 4 1 1 2 1 4 2 2 4 ...

Step 3: Visualize the Data: Basic plots are useful for understanding the relationships in the
dataset. For example, with mtcars

Plotting example: scatter plot of mpg vs wt (weight)

plot(mtcars$wt, mtcars$mpg, main = "MPG vs Weight", xlab = "Weight (1000 lbs)",
ylab = "Miles Per Gallon",col=c("green","red"))

Step 4: Analyze the Data

Now that the dataset is loaded and explored, you can apply various statistical models or
functions. For example, performing a linear regression:
model <- lm(mpg ~ wt, data = mtcars)
summary(model)

Call:
lm(formula = mpg ~ wt, data = mtcars)

Residuals:
Min 1Q Median 3Q Max
-4.5432 -2.3647 -0.1252 1.4096 6.8727
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 37.2851 1.8776 19.858 < 2e-16 ***
wt -5.3445 0.5591 -9.559 1.29e-10 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Residual standard error: 3.046 on 30 degrees of
freedom Multiple R-squared: 0.7528, Adjusted R-
squared: 0.7446
F-statistic: 91.38 on 1 and 30 DF, p-value: 1.294e-10

Add the regression line to the plot

> abline(model, col = "blue")
Example No. 2:-
x <- c(151,174,138,186,128,136,179,163,152,131)
y <- c(63,81,56,91,47,57,76,72,62,48)
relation=lm(x~y)
print(relation)
Call:
lm(formula = x ~ y)

Coefficients:
(Intercept) y
61.380 1.415
>print(summary(relation))

Call:
lm(formula = x ~ y)

Residuals:
Min 1Q Median 3Q Max
-6.0529 -2.4833 -0.0912 1.3774 10.0562

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 61.3803 7.2653 8.448 2.94e-05 ***
y 1.4153 0.1089 12.997 1.16e-06 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 4.712 on 8 degrees of freedom

Multiple R-squared: 0.9548, Adjusted R-squared: 0.9491
F-statistic: 168.9 on 1 and 8 DF, p-value: 1.164e-06
> cor(x,y)
[1] 0.9771296
Therefore the regression line is y=61.3803+1.4153x where slope is 1.4153 and intercept is 61.38
03

plot(y,x,main ="Height & Weight Regression",xlab ="Weight in Kg",ylab ="Height in

cm",col=c("red","green"))
abline(lm(x~y),col="blue")

8.4 Practice Questions

Predicting Fuel Efficiency with the mtcars Dataset
1.The mtcars dataset in R contains various attributes of different car models, such as miles per
gallon (mpg), horsepower (hp), weight (wt), and more. Your task is to predict the fuel efficiency
(mpg) of cars based on their weight (wt) and horsepower (hp).

2. Fit a linear model for inbuilt data-women like mtcars.

Note:-

• Standard Error: Measures the precision of the coefficient estimates. Smaller values suggest more
precise estimates.

• t value: A measure of how many standard errors the estimated coefficient is away from 0. Larger
values indicate that the predictor is more significant.

• Pr(>|t|): The p-value for testing the null hypothesis that the coefficient is zero. A small p-value
(typically < 0.05) indicates that the predictor is statistically significant.

• R-squared: A measure of how well the model fits the data. It indicates the proportion of variance in
the dependent variable explained by the independent variable(s).

• Residual Standard Error: The standard deviation of the residuals. Smaller values indicate better fit.
• F-statistic and p-value: Tests the overall significance of the model. A significant F-statistic (p-value <
0.05) indicates that at least one of the predictors is significantly related to the dependent variable.

M348 Applied Statistical Modelling - Linear Models
No ratings yet
M348 Applied Statistical Modelling - Linear Models
504 pages
R-Programming - Unit 5
No ratings yet
R-Programming - Unit 5
43 pages
During Ketu Mahadasha
No ratings yet
During Ketu Mahadasha
10 pages
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
No ratings yet
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
89 pages
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
MIT 302 - Statistical Computing II - Tutorial 03
No ratings yet
MIT 302 - Statistical Computing II - Tutorial 03
16 pages
Business Analytics Unit - V Notes - 60637708 - 2025 - 05 - 15 - 02 - 16
No ratings yet
Business Analytics Unit - V Notes - 60637708 - 2025 - 05 - 15 - 02 - 16
37 pages
Adc Lab Manual PDF
75% (4)
Adc Lab Manual PDF
74 pages
R Stastics PDF
No ratings yet
R Stastics PDF
30 pages
R Module 11 - Statistics
No ratings yet
R Module 11 - Statistics
35 pages
Dar Lec10
No ratings yet
Dar Lec10
22 pages
Ajmal Steel - Product Catalog V2 1
No ratings yet
Ajmal Steel - Product Catalog V2 1
42 pages
NEB Class 12 Computer Recent Trends in Technology Notes
No ratings yet
NEB Class 12 Computer Recent Trends in Technology Notes
17 pages
Chapter 8 B - Trendlines and Regression Analysis
No ratings yet
Chapter 8 B - Trendlines and Regression Analysis
73 pages
SC&RP - Unit 5
No ratings yet
SC&RP - Unit 5
36 pages
Chapter4 Notes
No ratings yet
Chapter4 Notes
18 pages
Asset v1 - Indic AI+PR103+2020 - T3+type@asset+block@1 Running Linear Regression in R
No ratings yet
Asset v1 - Indic AI+PR103+2020 - T3+type@asset+block@1 Running Linear Regression in R
74 pages
Machine Learning-Lecture 1 (Student)
No ratings yet
Machine Learning-Lecture 1 (Student)
14 pages
Regression Analysis Assignment1111
No ratings yet
Regression Analysis Assignment1111
13 pages
Regression Analysis Using R
No ratings yet
Regression Analysis Using R
17 pages
Statistical Analysis
No ratings yet
Statistical Analysis
26 pages
Evans Analytics2e PPT 08
No ratings yet
Evans Analytics2e PPT 08
65 pages
Linear Regression
No ratings yet
Linear Regression
13 pages
WEEK
No ratings yet
WEEK
17 pages
Experiment 8
No ratings yet
Experiment 8
4 pages
6th Lecture Note 108335647 230518 203102
No ratings yet
6th Lecture Note 108335647 230518 203102
35 pages
21BCS5999 - Ankit Kumar (Assignment 2)
No ratings yet
21BCS5999 - Ankit Kumar (Assignment 2)
16 pages
R Workshop PART 2
No ratings yet
R Workshop PART 2
36 pages
R Unit 4th and 5th
No ratings yet
R Unit 4th and 5th
17 pages
Vemana Padyalu 2 05131155
No ratings yet
Vemana Padyalu 2 05131155
114 pages
Linear Model
No ratings yet
Linear Model
10 pages
DS Exp6
No ratings yet
DS Exp6
5 pages
WINSEM2024-25 CSE3506 ELA CH2024250502181 Reference Material III 21-12-2024 21NEW3
No ratings yet
WINSEM2024-25 CSE3506 ELA CH2024250502181 Reference Material III 21-12-2024 21NEW3
7 pages
Predictive Modeling-Handouts
No ratings yet
Predictive Modeling-Handouts
11 pages
Exam 1 Notes
No ratings yet
Exam 1 Notes
4 pages
Make Up Cat
No ratings yet
Make Up Cat
6 pages
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
100% (1)
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
15 pages
Final Cost Practical
No ratings yet
Final Cost Practical
29 pages
Unit-15 Data Analysis and R
No ratings yet
Unit-15 Data Analysis and R
12 pages
DTSL B2
No ratings yet
DTSL B2
4 pages
Lab 10 Forest Regression
No ratings yet
Lab 10 Forest Regression
5 pages
Ceccato CSA 5.5-20 (IVR) Spare Parts List EN 2200789700 Ed 04
No ratings yet
Ceccato CSA 5.5-20 (IVR) Spare Parts List EN 2200789700 Ed 04
38 pages
Unit5 R
No ratings yet
Unit5 R
5 pages
Lab 4
No ratings yet
Lab 4
7 pages
Intro To Stereo Logy Grain Size
No ratings yet
Intro To Stereo Logy Grain Size
64 pages
20BCE1205 Lab3
No ratings yet
20BCE1205 Lab3
9 pages
Linear Regression
No ratings yet
Linear Regression
17 pages
R Lab 4
No ratings yet
R Lab 4
7 pages
R Lab 3
No ratings yet
R Lab 3
7 pages
What Is 30c8 Steel
No ratings yet
What Is 30c8 Steel
3 pages
3010 Lab Model Diagnostic-1
No ratings yet
3010 Lab Model Diagnostic-1
4 pages
Classification of Things
No ratings yet
Classification of Things
6 pages
Updated Faculty List & Specialisationas On April 2018
No ratings yet
Updated Faculty List & Specialisationas On April 2018
1 page
Mtcars: Choosing The Most Related Variable (S) To The Response
No ratings yet
Mtcars: Choosing The Most Related Variable (S) To The Response
13 pages
Mindanao State University General Santos City: Simple Linear Regression
No ratings yet
Mindanao State University General Santos City: Simple Linear Regression
12 pages
Regression An Ova
No ratings yet
Regression An Ova
24 pages
Ramezankhani 2018
No ratings yet
Ramezankhani 2018
51 pages
RegrCorr PDF
No ratings yet
RegrCorr PDF
20 pages
Shivam Batra (19BPS1131) 21/01/2022: List
No ratings yet
Shivam Batra (19BPS1131) 21/01/2022: List
5 pages
Lab-3: Regression Analysis and Modeling Name: Uid No. Objective
No ratings yet
Lab-3: Regression Analysis and Modeling Name: Uid No. Objective
9 pages
R Studio Cheat Sheet
No ratings yet
R Studio Cheat Sheet
6 pages
Simple Regression Model Fitting
No ratings yet
Simple Regression Model Fitting
5 pages
Introduction to Applied Econometrics Analysis Using Stata
From Everand
Introduction to Applied Econometrics Analysis Using Stata
Justin Doran
5/5 (3)
Solution7 AMLP
No ratings yet
Solution7 AMLP
14 pages
Experiment 7 (I) : Artificial Intelligence & Machine Learning Lab
No ratings yet
Experiment 7 (I) : Artificial Intelligence & Machine Learning Lab
4 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
7 pages
BDA Exp7 Removed
No ratings yet
BDA Exp7 Removed
4 pages
Using R For Linear Regression
No ratings yet
Using R For Linear Regression
9 pages
Unit 4 (History of English II)
No ratings yet
Unit 4 (History of English II)
26 pages
How To Use "Qqplot": X: Independent Variable, Y: Dependent Variable
No ratings yet
How To Use "Qqplot": X: Independent Variable, Y: Dependent Variable
6 pages
In Class Exercise Linear Regression in R
No ratings yet
In Class Exercise Linear Regression in R
6 pages
Which Test When: 1 Exploratory Tests
No ratings yet
Which Test When: 1 Exploratory Tests
5 pages
Activity #1 - IE 503
No ratings yet
Activity #1 - IE 503
5 pages
P1 Further Differentiation
No ratings yet
P1 Further Differentiation
8 pages
Behavioural Sciences Syllabus
No ratings yet
Behavioural Sciences Syllabus
8 pages
Word How Yo Use
No ratings yet
Word How Yo Use
20 pages
Huawei Optixstar S800E Datasheet: Product Overview
No ratings yet
Huawei Optixstar S800E Datasheet: Product Overview
3 pages
2-Linear Regression and Correlation in R Commander-CV
No ratings yet
2-Linear Regression and Correlation in R Commander-CV
2 pages
Print Math 11 Mod 1
No ratings yet
Print Math 11 Mod 1
13 pages
Second Concept Paper
No ratings yet
Second Concept Paper
5 pages
MSDS Zetag 8125 PDF
No ratings yet
MSDS Zetag 8125 PDF
8 pages
What Makes A Retributive Theory of Justice Valuable - R - Askphilosophy
No ratings yet
What Makes A Retributive Theory of Justice Valuable - R - Askphilosophy
6 pages
Polidoros XL 50
No ratings yet
Polidoros XL 50
5 pages
CHITOCEL TDS EN 1100519 OENO Italy
No ratings yet
CHITOCEL TDS EN 1100519 OENO Italy
2 pages
A Study of Inbound Logistics Mode Based On JIT Production in Cruise Ship Construction
No ratings yet
A Study of Inbound Logistics Mode Based On JIT Production in Cruise Ship Construction
18 pages
Green Awareness Through Social Media
No ratings yet
Green Awareness Through Social Media
9 pages
Sally Ozonoff Autism Spectrum Disorder Sacramento
No ratings yet
Sally Ozonoff Autism Spectrum Disorder Sacramento
3 pages
Class Xii Neet, Test Planner For Pt-Iii
No ratings yet
Class Xii Neet, Test Planner For Pt-Iii
1 page
If I Were President I Would
No ratings yet
If I Were President I Would
3 pages
Supplier Problem Report: GAGNOLET Christophe
No ratings yet
Supplier Problem Report: GAGNOLET Christophe
3 pages

Experiment No.8 - Fit Simple Linear Regression Models Using Built-In Functions.

Uploaded by

Experiment No.8 - Fit Simple Linear Regression Models Using Built-In Functions.

Uploaded by

Experiment # 8

Aim: Fit simple linear regression models using built-in functions.

Linear Regression Equation is y=ax+b

where a is slope and b is intercept.

The general mathematical equation for multiple regression is –

Real world Applications: (Why are you studying this model)

8.2. Commands and calculation of R: Basic steps to perform Linear Regression in R

abline(model, col = "blue")

abline() - Adds the regression

8.3. Example of Simple Linear Regression in R:

Step 2: Explore the Dataset

Plotting example: scatter plot of mpg vs wt (weight)

Step 4: Analyze the Data

Add the regression line to the plot

Residual standard error: 4.712 on 8 degrees of freedom

plot(y,x,main ="Height & Weight Regression",xlab ="Weight in Kg",ylab ="Height in

8.4 Practice Questions

2. Fit a linear model for inbuilt data-women like mtcars.

You might also like