0% found this document useful (0 votes)

15 views

Lab 2

Uploaded by

thulasi.v

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

Lab 2

Uploaded by

thulasi.v

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Analyzing the Impact of Caloric Intake on Weight Gain: A

Simple Linear Regression Approach

Thulasi-2348152

2023-11-10

Introduction
This report aims to analyze the relationship between the number of calories consumed and
the corresponding weight gained in grams. The dataset provided contains information on
calories consumed and weight gained over a period. We will employ simple linear
regression to understand and model this relationship.

Analysis
a. Steps Involved in Building a Simple Linear Regression Model
Data Collection: The dataset includes two variables - “Weight_gained_grams” and
“Calories_Consumed.” Data Exploration: Examine the dataset for any anomalies, missing
values, or patterns.
Data Visualization: Create visualizations such as scatter plots to visualize the relationship
between calories consumed and weight gained.
Correlation Analysis: Calculate the correlation coefficient to quantify the strength and
direction of the relationship.
Model Training: Split the dataset into training and testing sets. Train the model on the
training set. Model Evaluation: Evaluate the model’s performance on the testing set using
appropriate metrics.
library(readr)
data <- read_csv("C:/Users/Admin/Downloads/calories_consumed.csv")

## Rows: 14 Columns: 2
## ── Column specification
────────────────────────────────────────────────────────
## Delimiter: ","
## dbl (2): Weight_gained_grams, Calories_Consumed
##
## ℹ Use `spec()` to retrieve the full column specification for this data.
## ℹ Specify the column types or set `show_col_types = FALSE` to quiet this
message.
# a. Scatter Diagram and Coefficient of Correlation
plot(data$Calories_Consumed, data$Weight_gained_grams, main="Scatter Plot",
xlab="Calories Consumed", ylab="Weight Gained (grams)")

correlation_coefficient <- cor(data$Calories_Consumed,

data$Weight_gained_grams)
cat("Correlation Coefficient:", correlation_coefficient, "\n")

## Correlation Coefficient: 0.946991

We get a correlation of 0.946991 which determines a strong correlation between calories

consumed and weight gained.
# b. Parameter Estimation and Regression Line
model <- lm(Weight_gained_grams ~ Calories_Consumed, data=data)
summary(model)

##
## Call:
## lm(formula = Weight_gained_grams ~ Calories_Consumed, data = data)
##
## Residuals:
## Min 1Q Median 3Q Max
## -158.67 -107.56 36.70 81.68 165.53
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) -625.75236 100.82293 -6.206 4.54e-05 ***
## Calories_Consumed 0.42016 0.04115 10.211 2.86e-07 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 111.6 on 12 degrees of freedom
## Multiple R-squared: 0.8968, Adjusted R-squared: 0.8882
## F-statistic: 104.3 on 1 and 12 DF, p-value: 2.856e-07

The intercept is -625.75. This is the estimated value of “Weight_gained_grams” when

“Calories_Consumed” is zero. The coefficient for “Calories_Consumed” is 0.42016. This
implies that, on average, for each additional unit of calories consumed, the weight gained is
expected to increase by 0.42016 grams.
fit=fitted.values(model)
scatter.smooth(data$Weight_gained_grams,data$Calories_Consumed,col="darkgreen
")
abline(lm(data$Weight_gained_grams ~ data$Calories_Consumed))

residuals <- residuals(model)

cat("Residuals:\n", residuals, "\n\n")

## Residuals:
## 103.5174 -140.6079 97.21979 -98.59224 -124.6392 63.50174 165.5331 -
110.5453 49.31377 87.14147 24.09077 -22.54525 -158.6706 65.28245
# R-squared value
r_squared <- summary(model)$r.squared
cat("R-squared Value:", r_squared, "\n\n")

## R-squared Value: 0.896792

Different ways to check the significance if the estimated value via

model.
#check through scatter plot of y and fitted values
scatter.smooth(data$Weight_gained_grams,fit,col="red")#to check how close y
and y estimated is.

R=cor(data$Weight_gained_grams,fit)
R

## [1] 0.946991

R^2

## [1] 0.896792

From above we could say that our predicted value is significant as we have the same
correlation value.The R-squared value is approximately 0.8968, meaning that around
89.68% of the variability in weight gained is explained by the model. This is a relatively
high R-squared value, indicating a strong relationship.
summary(model)

The critical t-value for a two-tailed test with 12 degrees of freedom at a 0.025 significance
level is approximately 2.179.Since the t-value of 10.211 is much larger than 2.179, you
would reject the null hypothesis for the coefficient of “Calories_Consumed” in a two-tailed
test as well. The large t-value indicates that the effect of “Calories_Consumed” is statistically
significant, whether considering a positive or negative relationship.

Conclusion
This report has demonstrated the application of simple linear regression to understand the
relationship between calories consumed and weight gained. The analysis provides insights
into the predictive power of the model and assesses its quality of fit based on the given
dataset.

UNIVERSAL BANK CASE SOLUTION
No ratings yet
UNIVERSAL BANK CASE SOLUTION
9 pages
1 Module 3: Peer Reviewed Assignment
No ratings yet
1 Module 3: Peer Reviewed Assignment
22 pages
Solution 3.1
No ratings yet
Solution 3.1
4 pages
MKT20019 A3 Group Report
No ratings yet
MKT20019 A3 Group Report
32 pages
Assignment 1:: Intro To Machine Learning
No ratings yet
Assignment 1:: Intro To Machine Learning
6 pages
Diabetes Case Study - Jupyter Notebook
100% (1)
Diabetes Case Study - Jupyter Notebook
10 pages
Solution 1
No ratings yet
Solution 1
6 pages
ISYE 6501 Georgia Tech hmwk3.1b
No ratings yet
ISYE 6501 Georgia Tech hmwk3.1b
5 pages
Sajjad DS
100% (2)
Sajjad DS
97 pages
How To Perform Structural Equation Modeling (SEM) in R - AGRON INFO TECH
No ratings yet
How To Perform Structural Equation Modeling (SEM) in R - AGRON INFO TECH
15 pages
Modern Regression Homework 5-1
No ratings yet
Modern Regression Homework 5-1
8 pages
Lab 9 Report
No ratings yet
Lab 9 Report
5 pages
A Short Introduction To The Caret Package: Max Kuhn June 20, 2013
No ratings yet
A Short Introduction To The Caret Package: Max Kuhn June 20, 2013
10 pages
Ex Nested Resampling
No ratings yet
Ex Nested Resampling
4 pages
A Short Introduction To Caret
No ratings yet
A Short Introduction To Caret
10 pages
ML0101EN Reg Simple Linear Regression Co2 Py v1
No ratings yet
ML0101EN Reg Simple Linear Regression Co2 Py v1
4 pages
K-Means in Python - Solution
No ratings yet
K-Means in Python - Solution
6 pages
Week-2 NK
No ratings yet
Week-2 NK
12 pages
BES - R Lab 9
No ratings yet
BES - R Lab 9
7 pages
Algorithum-explanantion
No ratings yet
Algorithum-explanantion
6 pages
ML0101EN Reg Mulitple Linear Regression Co2 Py v1
No ratings yet
ML0101EN Reg Mulitple Linear Regression Co2 Py v1
5 pages
2 - T.test Activity
No ratings yet
2 - T.test Activity
13 pages
Guide
No ratings yet
Guide
24 pages
Map Assign 8
No ratings yet
Map Assign 8
7 pages
Practical Machine Learning
No ratings yet
Practical Machine Learning
11 pages
Model Fine-Tuning_ Hyperparameter Optimization
No ratings yet
Model Fine-Tuning_ Hyperparameter Optimization
9 pages
ML Interview Questions
No ratings yet
ML Interview Questions
10 pages
S-2
No ratings yet
S-2
10 pages
Results
No ratings yet
Results
4 pages
Poisson Regression - Stata Data Analysis Examples
No ratings yet
Poisson Regression - Stata Data Analysis Examples
12 pages
Data Mining Models and Evaluation Techniques
No ratings yet
Data Mining Models and Evaluation Techniques
59 pages
question
No ratings yet
question
7 pages
Linear regression
No ratings yet
Linear regression
1 page
DAY 7 SESSION 2 Cross Validation
No ratings yet
DAY 7 SESSION 2 Cross Validation
18 pages
Big Data Machine Learning
100% (1)
Big Data Machine Learning
6 pages
Logit Regression - R Data Analysis Examples
No ratings yet
Logit Regression - R Data Analysis Examples
12 pages
Lab(Revised)
No ratings yet
Lab(Revised)
4 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
Ayuda Comandos Stata Meta
No ratings yet
Ayuda Comandos Stata Meta
42 pages
Cheat Sheet Final
100% (2)
Cheat Sheet Final
7 pages
DATT - Class 05 - Assignment - GR 9
No ratings yet
DATT - Class 05 - Assignment - GR 9
9 pages
Machine Learning
100% (5)
Machine Learning
56 pages
Notebook 4 - Machine Learning
No ratings yet
Notebook 4 - Machine Learning
17 pages
7 K-Means Clustering
No ratings yet
7 K-Means Clustering
27 pages
Supervised Logistic Tutorial Final PDF
No ratings yet
Supervised Logistic Tutorial Final PDF
9 pages
Lecture 3
No ratings yet
Lecture 3
23 pages
Dosis Respuesta R
No ratings yet
Dosis Respuesta R
11 pages
Bi12-019 Bi12-263 LW3
No ratings yet
Bi12-019 Bi12-263 LW3
35 pages
Week-1 NK
No ratings yet
Week-1 NK
5 pages
3.1. Cross-Validation - Evaluating Estimator Performance - Scikit-Learn 1.3.0 Documentation
No ratings yet
3.1. Cross-Validation - Evaluating Estimator Performance - Scikit-Learn 1.3.0 Documentation
12 pages
Data Analysis and Evaluation Methods Comparison
No ratings yet
Data Analysis and Evaluation Methods Comparison
11 pages
ml lab programs 2
No ratings yet
ml lab programs 2
16 pages
ML Assignment (22BCE8086) 2
No ratings yet
ML Assignment (22BCE8086) 2
19 pages
41 Perusse Alexander Aperusse PDF
No ratings yet
41 Perusse Alexander Aperusse PDF
7 pages
IT0089 TB391 Decision Tree RABE
No ratings yet
IT0089 TB391 Decision Tree RABE
6 pages
ISYE 6501 Georgia Tech Hmwk3.1a
No ratings yet
ISYE 6501 Georgia Tech Hmwk3.1a
4 pages
SET3065_group9_A6
No ratings yet
SET3065_group9_A6
4 pages
Linear Regression With Pytroch
No ratings yet
Linear Regression With Pytroch
13 pages
Carlos_Willis_Problem-Set-4,-Spring-2023
No ratings yet
Carlos_Willis_Problem-Set-4,-Spring-2023
16 pages
Linear Regression with Multiple Covariates
From Everand
Linear Regression with Multiple Covariates
Brett Kottmann
No ratings yet
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Steps in Solving The Mode of The Grouped Data
No ratings yet
Steps in Solving The Mode of The Grouped Data
3 pages
Introduction To Forecasting Techniques
No ratings yet
Introduction To Forecasting Techniques
30 pages
Lecture Notes WI3411TU Financial Time Series - 2021
No ratings yet
Lecture Notes WI3411TU Financial Time Series - 2021
107 pages
Uncertainty Analysis of Constant Amplitude Fatigue Test Data Employing The Six Parameters Random Fatigue Limit Model
No ratings yet
Uncertainty Analysis of Constant Amplitude Fatigue Test Data Employing The Six Parameters Random Fatigue Limit Model
8 pages
Sales & Distribution Managment - Coke India-: Mba (Ib) 7 Trimester - IIFT, Delhi
No ratings yet
Sales & Distribution Managment - Coke India-: Mba (Ib) 7 Trimester - IIFT, Delhi
27 pages
Wilcoxon Signed Rank Test Critical Values Table
No ratings yet
Wilcoxon Signed Rank Test Critical Values Table
1 page
Multi Kol
No ratings yet
Multi Kol
44 pages
II Puc Statistics Practice Papers
No ratings yet
II Puc Statistics Practice Papers
9 pages
Standard Deviation
No ratings yet
Standard Deviation
13 pages
Regression Paper
No ratings yet
Regression Paper
12 pages
Mastering-Inferential-Statistics-Worksheet
No ratings yet
Mastering-Inferential-Statistics-Worksheet
6 pages
Business Analytics-I Project Synopsis
No ratings yet
Business Analytics-I Project Synopsis
8 pages
SQC Syllabus
No ratings yet
SQC Syllabus
2 pages
Time Series Analysis Forecasting and Control 4th Edition George E.P. Box instant download
100% (2)
Time Series Analysis Forecasting and Control 4th Edition George E.P. Box instant download
60 pages
Standard Deviation and Its Link To Strength of Concrete
No ratings yet
Standard Deviation and Its Link To Strength of Concrete
1 page
Variable Charts
No ratings yet
Variable Charts
18 pages
Kolmogorov-Smirnov Test For Normality
No ratings yet
Kolmogorov-Smirnov Test For Normality
16 pages
DOC-20250304-WA0000.
No ratings yet
DOC-20250304-WA0000.
18 pages
IPS (Points and Interval Estimate)
No ratings yet
IPS (Points and Interval Estimate)
23 pages
Group 5: Glendell Atesora Estelle Nica Marie Dunlao Johny Sevilla Jeoffrey Casipong Elny Rose Vio Zweetsel Señeres
No ratings yet
Group 5: Glendell Atesora Estelle Nica Marie Dunlao Johny Sevilla Jeoffrey Casipong Elny Rose Vio Zweetsel Señeres
45 pages
Cpk
No ratings yet
Cpk
1 page
34 Time Series Analysis for Mustard Production, Productivity, And Area Forecasting in Madhya Pradesh, India
No ratings yet
34 Time Series Analysis for Mustard Production, Productivity, And Area Forecasting in Madhya Pradesh, India
7 pages
bsta450-ASSIGNMENT 5
No ratings yet
bsta450-ASSIGNMENT 5
2 pages
Cronbachs Alpha
No ratings yet
Cronbachs Alpha
16 pages
Data Analysis and Graphics Using R-An Example Based Approach
No ratings yet
Data Analysis and Graphics Using R-An Example Based Approach
22 pages
The Chi Square Test
No ratings yet
The Chi Square Test
10 pages
Difference in Difference Models
No ratings yet
Difference in Difference Models
30 pages
Descriptives: Notes
No ratings yet
Descriptives: Notes
39 pages
Grouped and Ungrouped Data
75% (4)
Grouped and Ungrouped Data
13 pages

Lab 2

Uploaded by

Lab 2

Uploaded by

Analyzing the Impact of Caloric Intake on Weight Gain: A

Simple Linear Regression Approach

correlation_coefficient <- cor(data$Calories_Consumed,

## Correlation Coefficient: 0.946991

We get a correlation of 0.946991 which determines a strong correlation between calories

The intercept is -625.75. This is the estimated value of “Weight_gained_grams” when

residuals <- residuals(model)

## R-squared Value: 0.896792

Different ways to check the significance if the estimated value via

You might also like