0% found this document useful (0 votes)

4 views4 pages

Week13 Exercise Solutions

The document presents solutions to exercises involving regression analysis on two datasets: 'plantnutrition' and 'height_weight'. It details the process of calculating regression lines, checking for normality of residuals, and predicting values, including confidence intervals for specific nutrient types and heights. The results indicate significant relationships between nutrient type and plant species diversity, as well as between height and weight, with comparisons of prediction intervals highlighting differences in precision based on sample size and data subsets.

Uploaded by

bodurr571

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views4 pages

Week13 Exercise Solutions

Uploaded by

bodurr571

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Week 13 Exercise Solutions

load("Regression.RData")

Exercise 1
In a field experiment, effect of nutrient type of soil on plant species diversity is studied. Researchers want to
know whether increasing nutrient type changes number of plant species, and if it changes, how can plant
species number be predicted from nutrient type.
Use the “plantnutrition” data.frame. Calculate the regression line. Visualize the data and add the regression
line to the plot. Check if residuals are normally distributed. Check if the slope of the regression line is
significant. Calculate the predicted confidence interval of number of plant species when nutrient type is 2.
head(plantnutrition)

## NutrientNo PlantSp
## 1 0 36
## 2 0 36
## 3 0 32
## 4 1 34
## 5 2 33
## 6 3 30
# regression line calculation
reg_result = lm(PlantSp~ NutrientNo, data = plantnutrition)

# graphical representation
plot(PlantSp~NutrientNo, data = plantnutrition, col = "red", pch =19)
abline(reg_result)

1
35
30
PlantSp

25
20

0 1 2 3 4

NutrientNo
# checking residuals for normality
residual_values = reg_result$residuals
shapiro.test(residual_values)

##
## Shapiro-Wilk normality test
##
## data: residual_values
## W = 0.92819, p-value = 0.4303
We can assume that residuals are normally distributed. Hence assumption of regression is met.
# significance of regression line slope
summary(reg_result)

##
## Call:
## lm(formula = PlantSp ~ NutrientNo, data = plantnutrition)
##
## Residuals:
## Min 1Q Median 3Q Max
## -10.771 -1.856 1.068 2.894 5.907
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 34.110 2.599 13.12 1.08e-06 ***
## NutrientNo -3.339 1.098 -3.04 0.0161 *
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 5.336 on 8 degrees of freedom
## Multiple R-squared: 0.536, Adjusted R-squared: 0.478
## F-statistic: 9.241 on 1 and 8 DF, p-value: 0.01607
We tested the null hypothesis that slope = 0, and rejected it with p-value = 0.0161. So, there is a statistically

2
significant linear association between nutrient type and plant species diversity. The relationship is negative
and can be shown with the calculated line equation.
# predicted CI for nutrient type 2
predict(reg_result, data.frame(NutrientNo = 2), interval = "prediction")

## fit lwr upr

## 1 27.4322 14.5167 40.34771

Exercise 2
In the height_weight data.frame, there is made-up data of height and weight measurements of people from
2 different job types. We want to predict weight values from height. Y values for each X are normally
distributed, you do not need to check for assumption of normality.
• Calculate and check significance of the regression line for “O1” only and then for all observations. (We
will calculate 2 distinct models)
• For both of the models, predict weight values that correspond to height values 165 and 175. Compare
the prediction intervals. Are the prediction intervals for height 165 and height 175 intersecting or not?
Are they different in 2 models? What can be a reason for a potential difference?
head(height_weight)

## occupation height weight

## 1 O1 184 85
## 2 O1 184 81
## 3 O1 163 59
## 4 O1 172 64
## 5 O1 182 55
## 6 O1 178 55
# first subset the data for "O1"

row_belongs_to_O1 = height_weight[ ,"occupation"] == "O1"

job1 = height_weight[row_belongs_to_O1, ]

# be careful about the order of variables

# we want to predict weight from height
reg_result_job1 = lm(weight ~ height, data = job1)
summary(reg_result_job1)

##
## Call:
## lm(formula = weight ~ height, data = job1)
##
## Residuals:
## Min 1Q Median 3Q Max
## -20.116 -8.281 3.169 8.063 22.741
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) -119.9106 63.1410 -1.899 0.07370 .
## height 1.0712 0.3589 2.984 0.00795 **
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 11.88 on 18 degrees of freedom

3
## Multiple R-squared: 0.331, Adjusted R-squared: 0.2938
## F-statistic: 8.906 on 1 and 18 DF, p-value: 0.007953
P-value is 0.00795, null hypothesis (slope of regression line = 0) is rejected. Height values can be used to
predict weight values.
# same analysis with all individuals in the data set
reg_result_all = lm(weight ~ height, data = height_weight)
summary(reg_result_all)

##
## Call:
## lm(formula = weight ~ height, data = height_weight)
##
## Residuals:
## Min 1Q Median 3Q Max
## -18.2602 -6.3568 -0.5929 8.4346 25.9064
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) -37.9695 34.4587 -1.102 0.27744
## height 0.6112 0.2010 3.040 0.00427 **
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 10.31 on 38 degrees of freedom
## Multiple R-squared: 0.1956, Adjusted R-squared: 0.1745
## F-statistic: 9.243 on 1 and 38 DF, p-value: 0.004266
P-value is 0.00427, we can make the same conclusion as before.
Let us compare predictions of the weight values that correspond to height values 165 and 175.
predict(reg_result_job1, data.frame(height = c(165, 175)), interval = "prediction")

## fit lwr upr

## 1 56.83477 30.00139 83.66815
## 2 67.54661 41.96087 93.13235
predict(reg_result_all, data.frame(height = c(165, 175)), interval = "prediction")

## fit lwr upr

## 1 62.87058 41.58631 84.15485
## 2 68.98210 47.79336 90.17084
By inspecting the prediction intervals, we can see that prediction of weight values from the analysis with all
individuals has less overlap for different height values, in comparison to the prediction of the analysis with
individuals from job1 only. So reg_result_all provides more precise predictions. We can expect this, because
reg_result_all have a higher significance (lower p-value). The fact that it has greater sample size plays a role
in this. But be careful, greater sample size doesn’t necessarily provide higher significance. If the correlation
in people from job2 would be much lower, the analysis that use all individuals could have lower significance.

Clean Disruption Seba
100% (4)
Clean Disruption Seba
52 pages
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet
SBA #5 and #6 Guide
No ratings yet
SBA #5 and #6 Guide
7 pages
Maths Formulae - SSC CGL
0% (1)
Maths Formulae - SSC CGL
7 pages
HW4 Solutions: Problem 6.2
No ratings yet
HW4 Solutions: Problem 6.2
8 pages
Regression in R
No ratings yet
Regression in R
40 pages
Weatherwax Weisberg Solutions
No ratings yet
Weatherwax Weisberg Solutions
162 pages
R Code
No ratings yet
R Code
3 pages
Lab-5-1-Regression and Multiple Regression
100% (2)
Lab-5-1-Regression and Multiple Regression
8 pages
Project of Biostatistics#02-RaeesaAli-MS - BIOTECH
No ratings yet
Project of Biostatistics#02-RaeesaAli-MS - BIOTECH
27 pages
Seu Ds610 Mod03
No ratings yet
Seu Ds610 Mod03
45 pages
Which Test When: 1 Exploratory Tests
No ratings yet
Which Test When: 1 Exploratory Tests
5 pages
Math 2275 Assignment 5
No ratings yet
Math 2275 Assignment 5
15 pages
Linear Regression
100% (2)
Linear Regression
228 pages
ECON20003 S1 2024 Sample Exam
No ratings yet
ECON20003 S1 2024 Sample Exam
27 pages
jl1DPGEQRai25HgJgc3J - Simple Linear Regression
No ratings yet
jl1DPGEQRai25HgJgc3J - Simple Linear Regression
7 pages
R Practice
No ratings yet
R Practice
38 pages
HW1 Solution Fall2024
No ratings yet
HW1 Solution Fall2024
11 pages
Lecture-5 2
No ratings yet
Lecture-5 2
51 pages
Regression Analysis Assignment1111
No ratings yet
Regression Analysis Assignment1111
13 pages
Midterm2021R1 Sol PDF
No ratings yet
Midterm2021R1 Sol PDF
13 pages
Linear Regression
No ratings yet
Linear Regression
13 pages
R Code Default Data PDF
No ratings yet
R Code Default Data PDF
10 pages
Assingment6 512
No ratings yet
Assingment6 512
6 pages
Lab 10 Forest Regression
No ratings yet
Lab 10 Forest Regression
5 pages
Final Predictive Vaibhav 2020
No ratings yet
Final Predictive Vaibhav 2020
101 pages
Multicollinearity and Oaxaca - Tutorial
No ratings yet
Multicollinearity and Oaxaca - Tutorial
35 pages
WEEK
No ratings yet
WEEK
17 pages
HW3 Solutions - Stats 500: Problem 1
No ratings yet
HW3 Solutions - Stats 500: Problem 1
4 pages
BES - R Lab 9
No ratings yet
BES - R Lab 9
7 pages
R - Program
No ratings yet
R - Program
5 pages
R Lab 4
No ratings yet
R Lab 4
7 pages
A1
No ratings yet
A1
8 pages
Lab-9 RMD
No ratings yet
Lab-9 RMD
5 pages
How To Use "Qqplot": X: Independent Variable, Y: Dependent Variable
No ratings yet
How To Use "Qqplot": X: Independent Variable, Y: Dependent Variable
6 pages
Lab 5 LR
No ratings yet
Lab 5 LR
9 pages
W3 - Testing Means - Choose Your Test
No ratings yet
W3 - Testing Means - Choose Your Test
7 pages
Solutions Week 10
No ratings yet
Solutions Week 10
7 pages
BDA Exp7 Removed
No ratings yet
BDA Exp7 Removed
4 pages
36-401 Modern Regression HW #6 Solutions: Problem 1 (32 Points)
No ratings yet
36-401 Modern Regression HW #6 Solutions: Problem 1 (32 Points)
26 pages
D Linear Regression With R
No ratings yet
D Linear Regression With R
9 pages
Maths Lab
No ratings yet
Maths Lab
17 pages
Assignment 01 Nipun Goyal Jinye Lu
No ratings yet
Assignment 01 Nipun Goyal Jinye Lu
12 pages
Assignment 3
No ratings yet
Assignment 3
10 pages
Regression PDF
No ratings yet
Regression PDF
18 pages
Linear Regression Experiment
No ratings yet
Linear Regression Experiment
6 pages
Revision Questions On Regression
No ratings yet
Revision Questions On Regression
9 pages
Regression Modelling 1 Assignment (R)
No ratings yet
Regression Modelling 1 Assignment (R)
7 pages
Regression Modelling Ass
No ratings yet
Regression Modelling Ass
6 pages
SC&RP - Unit 5
No ratings yet
SC&RP - Unit 5
36 pages
Regression Models For Data Science in R
No ratings yet
Regression Models For Data Science in R
137 pages
Problem-Set - 1 Practise Problems From Textbook
No ratings yet
Problem-Set - 1 Practise Problems From Textbook
2 pages
Y F (X, Z) : Regression Statistics
No ratings yet
Y F (X, Z) : Regression Statistics
12 pages
BDA MSC It
No ratings yet
BDA MSC It
35 pages
Multinomial Logistic Regression - R Data Analysis Examples - IDRE Stats
No ratings yet
Multinomial Logistic Regression - R Data Analysis Examples - IDRE Stats
8 pages
Iml Exp. 3
No ratings yet
Iml Exp. 3
4 pages
Exercice V
No ratings yet
Exercice V
5 pages
Notes Book
No ratings yet
Notes Book
39 pages
Using R For Linear Regression
No ratings yet
Using R For Linear Regression
9 pages
3-Applying Multiple Linear Regression
No ratings yet
3-Applying Multiple Linear Regression
5 pages
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Basic Exercises for Competitive Programming: Python
From Everand
Basic Exercises for Competitive Programming: Python
Jan Pol
No ratings yet
GCSE Maths Revision: Cheeky Revision Shortcuts
From Everand
GCSE Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (2)
Chapter 17
No ratings yet
Chapter 17
56 pages
Chapter 14
No ratings yet
Chapter 14
78 pages
Chapter 16
No ratings yet
Chapter 16
57 pages
Cardiovascular Ceren
No ratings yet
Cardiovascular Ceren
46 pages
Are STEM Syllabi Gendered A Feminist Critical Discourse Analysis
No ratings yet
Are STEM Syllabi Gendered A Feminist Critical Discourse Analysis
17 pages
Example of Measurement Uncertainty Estimation
No ratings yet
Example of Measurement Uncertainty Estimation
7 pages
Daniel Asmus CV
No ratings yet
Daniel Asmus CV
5 pages
The Uveitis - Periodontal Disease Connection in Pregnancy: Controversy Between Myth and Reality
No ratings yet
The Uveitis - Periodontal Disease Connection in Pregnancy: Controversy Between Myth and Reality
5 pages
Listening To Scent An Olfactory Journey With Aromatic Plants and Their Extracts All-in-One Download
100% (13)
Listening To Scent An Olfactory Journey With Aromatic Plants and Their Extracts All-in-One Download
16 pages
DR Pooja Mishra 2
No ratings yet
DR Pooja Mishra 2
27 pages
Compact Hi-Fi Power Amplifier
No ratings yet
Compact Hi-Fi Power Amplifier
2 pages
Deep Learning Concepts
No ratings yet
Deep Learning Concepts
13 pages
Kumpulan Soal Bahasa Inggri Kelas 10
No ratings yet
Kumpulan Soal Bahasa Inggri Kelas 10
8 pages
LM317 3-Terminal Adjustable Regulator: 1 Features 3 Description
No ratings yet
LM317 3-Terminal Adjustable Regulator: 1 Features 3 Description
32 pages
Tagging Fire Extinguisher
No ratings yet
Tagging Fire Extinguisher
5 pages
Review of Bacteriology: BY: Paul Aeron E. Bansil, RMT
No ratings yet
Review of Bacteriology: BY: Paul Aeron E. Bansil, RMT
18 pages
Har Khet Ko Pani
No ratings yet
Har Khet Ko Pani
6 pages
DM Questions Review
No ratings yet
DM Questions Review
5 pages
Regenerative Endodontics: A Way Forward
No ratings yet
Regenerative Endodontics: A Way Forward
9 pages
Owners: Workshop
No ratings yet
Owners: Workshop
192 pages
Assignment Brief UBGMLU-15-2 Presentation 23-24 Final
No ratings yet
Assignment Brief UBGMLU-15-2 Presentation 23-24 Final
4 pages
Usage of The Fly Ash in Hot Asphalt Mixes: Ivica, Androjić, Mag - Ing.aedif., Osijek - Koteks D.D., Croatia
No ratings yet
Usage of The Fly Ash in Hot Asphalt Mixes: Ivica, Androjić, Mag - Ing.aedif., Osijek - Koteks D.D., Croatia
10 pages
Vibrations Practical Workbook Lab 1
No ratings yet
Vibrations Practical Workbook Lab 1
8 pages
Azolla
No ratings yet
Azolla
5 pages
Mbbs Curriculum Muhs
No ratings yet
Mbbs Curriculum Muhs
70 pages
Conductors in Electric Field and Current Densities
No ratings yet
Conductors in Electric Field and Current Densities
3 pages
Adebukola's Research Project 2024 Edit
No ratings yet
Adebukola's Research Project 2024 Edit
48 pages
Bill of Materials Power Amplifier Elciricuit PDF
No ratings yet
Bill of Materials Power Amplifier Elciricuit PDF
2 pages
Femoral Triangle Objectives Questions
No ratings yet
Femoral Triangle Objectives Questions
14 pages
Infant Incubator Service Manual
No ratings yet
Infant Incubator Service Manual
62 pages
Schimmel Deciphering Signs
No ratings yet
Schimmel Deciphering Signs
287 pages
Crio 9068 User Manaul
No ratings yet
Crio 9068 User Manaul
24 pages

Week13 Exercise Solutions

Uploaded by

Week13 Exercise Solutions

Uploaded by

Week 13 Exercise Solutions

## fit lwr upr

## occupation height weight

row_belongs_to_O1 = height_weight[ ,"occupation"] == "O1"

# be careful about the order of variables

## fit lwr upr

## fit lwr upr

You might also like