0% found this document useful (0 votes)

12 views4 pages

Unit 531 Describing and Assessing The Linear Relationship Between Two Scale Variables Without Answers

Uploaded by

z13612909240

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views4 pages

Unit 531 Describing and Assessing The Linear Relationship Between Two Scale Variables Without Answers

Uploaded by

z13612909240

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Assignment for unit 531

Describing and assessing the linear relationship between income and BMI
Henk van der Kolk

03/08/2022

Goals of this assignment and preparing the data

In this assignment, you will (again) be running a simple linear regression analysis. These
data you will use are identical to the data used in unit 530. We use the smaller version of
the dataset, which includes the variable BMI. The data can be seen as from a random
sample from the Dutch population. The dataset we will use is called
Health_LISS_Core_Study_Wave_12_2020_data_plus_background_small.sav. The pdf with the
codebook has a similar name. Download the datafile and put the data in your working
directory. (Install and) load the packages tidyverse (for handling the data and making
plots), haven (for importing SPSS files), and broom (for inspecting regression output in a
nice way using the ‘tidy’ function).
With “simple linear regression models” we describe the relationship between one
dependent scale variable (we will sometimes treat variables with 5 values as ratio/scale
variables) and one independent scale variable. You first shortly focus on variables:
• ch19l021 (To what extent did your physical health or emotional problems hinder
your social activities over the past month?) and

• ch19l022 (To what extent did your physical health or emotional problems hinder
your work over the past month, for instance in your job, the housekeeping, taking
care of the children, doing volunteer work, or in school?)

1. Is it a good choice to take the first as independent and the second as dependent
variable?
**

Studying the relationship between BMI and income: expectations

After inspection of the data you become interested in the relationship between BMI and
income (we will use the variable nettoink in the data file). Following conventional wisdom,
we expect income to have a negative effect on BMI (poor people more often have a high
BMI). Normally we would extensively theorize about this. This is the ‘story version’ of a
theory,. In the story you also discuss why this relationship is expected. For now ‘keep it
simple’.

1
2. Draw the theoretical model (a graph) for this ‘conventional wisdom’ prediction (pen
and paper). Use boxes and arrows. Include a positive or a negative sign to indicate
the sign of the effect.

3. Also give the linear equation for the expectation.

4. Do you think the residuals should be part of the equation? Why (not)?

Studying the relationship between BMI and income: data inspection and
cleaning
5. Since we will use the income variable (nettoink), you inspect that variable. Create a
histogram of the variable nettoink.
You will notice there is at least one person with a net income of at least 1.5 million a month.
This is either a rich person OR a coding error :-). Normally do NOT simply remove outliers.
If this is the sample, that is what it is. However, since we do not have time to to extensively
check this data point, we focus on people with a somewhat more reasonable income.
6. Filter extreme outliers out (take 15.000 a month as a cut-off point).

7. After filtering the data, make a scatter plot of the relationship between income and
BMI (the datafile also filtered out outliers with extreme values). Think about which
variable you put on the x-axis and which on the y-axis. Include a regression line
through the cloud of datapoints by adding geom_smooth(method = "lm", se =
FALSE).

8. Based on this graph, do you think the relationship is as hypothesized?

Studying the relationship between BMI and income: data analysis

Run a simple linear regression using R: change the following syntax and add the
independent and dependent variables from the dataset. Store the output under the name
‘model’.
model <- name_of_the_dataset %>%
lm(dependentvariable ~ independentvariable, data = .)

9. Inspect the output using one of the two following commands:

# this requires the broom package
model %>%
tidy()

# or use
summary(model)

10. What is the intercept? What does this number tell you?

2
The intercept is: **
11. What is the slope?
The slope is: **
12. What is the sign of the slope? Is that what you expected (theoretically)?
The slope is: **
13. The effect of the income variable (the slope) seems extremely small (it says -04,
meaning you have to move the comma/dot four places to the left, meaning it is VERY
small). Why is it so small?
**
NOTE: Reading the scientific notation (with the “-04” after a number) may be
difficult. The following commands will often simplify things, but make sure you
are able to interpret scientific notation! Check the internet to find out how
scientific notation works.
model %>%
tidy() %>%
mutate_if(is.numeric, round, 5) # if a number is numeric, simplify
and round to 5 decimals

Assessing the relationship: inference

Let us check whether we can say something about the population. We can use the
‘confidence interval’ approach or the ‘test’ approach. These are just different ways of doing
basically the same thing.
14. What is the meaning of the “std.error” in the output?
**
We now first focus on the confidence interval.
15.The confidence interval of the slope and the intercept can be calculated “by hand” (and a
calculator), using the output (the standard error) presented above. Using the output
presented above, what is the CI?
**
Check your answer, using the following command lines:
confint(model, 'nettoink', level=0.95) %>%
as.data.frame() %>%
mutate_if(is.numeric, round, 6)

16. Does the 95 percent confidence interval in this case include zero? What does this
mean?

3
**
A second, similar way to approach this is by using a ‘testing’ approach (not the confidence
interval approach).
17. The effect (the slope) itself does not reveal much. That number depends on the scale
(are we measuring in Euro’s or Dollars or kEuro’s?). We need to ‘standardize’ that
effect, so we can check whether it is very ‘different’ from zero. How do we do that?
In other words, how are the estimate, the standard error and the t-value (here called
‘the statistic’) related?
**
18. What is the t-value in this case?

19. What is the p-value? And what does it mean?

**
<< END OF THE ASSIGNMENT>>

Anticancer Drugs Classification
100% (1)
Anticancer Drugs Classification
19 pages
Pset 6 - Fall2019 - Solutions PDF
100% (3)
Pset 6 - Fall2019 - Solutions PDF
33 pages
Ryobi-825r Parts List
No ratings yet
Ryobi-825r Parts List
3 pages
Sources of Air Pollution PDF
100% (1)
Sources of Air Pollution PDF
30 pages
Using SPSS For Multiple Regression: UDP 520 Lab 7 Lin Lin December 4, 2007
100% (1)
Using SPSS For Multiple Regression: UDP 520 Lab 7 Lin Lin December 4, 2007
20 pages
Unit 3 - Notes
No ratings yet
Unit 3 - Notes
32 pages
Ece Result 6TH Sem Ipu
No ratings yet
Ece Result 6TH Sem Ipu
325 pages
Biostat 2 Assignment PDF
No ratings yet
Biostat 2 Assignment PDF
32 pages
RSM1282-2025-Session 6-Multiple Regression POST
No ratings yet
RSM1282-2025-Session 6-Multiple Regression POST
84 pages
13simple Linear Regression
No ratings yet
13simple Linear Regression
127 pages
3 2LeastSquaresRegression
No ratings yet
3 2LeastSquaresRegression
29 pages
Mas 202
No ratings yet
Mas 202
22 pages
Logistic Regression Notes
No ratings yet
Logistic Regression Notes
79 pages
Amino Acids II
No ratings yet
Amino Acids II
64 pages
4 - Simple Linear Regression I 2022-23
No ratings yet
4 - Simple Linear Regression I 2022-23
25 pages
XSTK Câu hỏi
No ratings yet
XSTK Câu hỏi
19 pages
The Policy Challenges For Green Economy and Sustainable Economic Development
No ratings yet
The Policy Challenges For Green Economy and Sustainable Economic Development
13 pages
Corelation With Example
No ratings yet
Corelation With Example
112 pages
Om Ashish Mishra 23363025: 5 Mcqs
No ratings yet
Om Ashish Mishra 23363025: 5 Mcqs
9 pages
TNDY - TA Session 2
No ratings yet
TNDY - TA Session 2
6 pages
Solved MCQs 6
100% (1)
Solved MCQs 6
3 pages
Linear Regression and Correlation
No ratings yet
Linear Regression and Correlation
99 pages
Correlation Regression Tutorial
No ratings yet
Correlation Regression Tutorial
42 pages
Global Climate Change
No ratings yet
Global Climate Change
45 pages
Kinematics and Dynamics of Machines MECE 3270-Course Outline
No ratings yet
Kinematics and Dynamics of Machines MECE 3270-Course Outline
10 pages
Regression With Linear Predictors Complete DOCX Download
100% (18)
Regression With Linear Predictors Complete DOCX Download
16 pages
Application of The Exact Muffin-Tin Orbitals Theory
No ratings yet
Application of The Exact Muffin-Tin Orbitals Theory
30 pages
IPAQ - AUTOMATIC REPORT - Kuisioner
No ratings yet
IPAQ - AUTOMATIC REPORT - Kuisioner
20 pages
LAb Test 2
No ratings yet
LAb Test 2
4 pages
Share MBBS Lecture 5 (1) - 1
No ratings yet
Share MBBS Lecture 5 (1) - 1
40 pages
Vu360 Helpguide
No ratings yet
Vu360 Helpguide
8 pages
Using SPSS For Multiple Regression: UDP 520 Lab 7 Lin Lin December 4, 2007
No ratings yet
Using SPSS For Multiple Regression: UDP 520 Lab 7 Lin Lin December 4, 2007
20 pages
Quantum Transport in QD and DQD PDF
No ratings yet
Quantum Transport in QD and DQD PDF
26 pages
Bhu 18
No ratings yet
Bhu 18
10 pages
D4L1-Introduction-sep 2023
No ratings yet
D4L1-Introduction-sep 2023
35 pages
Transcript of Earnings Conference Call Q2 FY24
No ratings yet
Transcript of Earnings Conference Call Q2 FY24
19 pages
Lec11 Ecmt
No ratings yet
Lec11 Ecmt
25 pages
Community Project: Simple Linear Regression in SPSS
No ratings yet
Community Project: Simple Linear Regression in SPSS
4 pages
Econometrics: Problem Set 3: Professor: Mauricio Sarrias
No ratings yet
Econometrics: Problem Set 3: Professor: Mauricio Sarrias
5 pages
Digital Trichoblastoma Treated With ECT in A Dog
No ratings yet
Digital Trichoblastoma Treated With ECT in A Dog
4 pages
Week 8 - 10
No ratings yet
Week 8 - 10
72 pages
Nature vs. Nuture PDF
No ratings yet
Nature vs. Nuture PDF
4 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
19 pages
Traditional Chinese Medicine (TCM) : Chen Shaodong TCM Department, Xiamen University
No ratings yet
Traditional Chinese Medicine (TCM) : Chen Shaodong TCM Department, Xiamen University
35 pages
W3 (Extra) - Data 123 Practice Open Questions With Means
No ratings yet
W3 (Extra) - Data 123 Practice Open Questions With Means
9 pages
Unit 6 - Assignment With Answers
No ratings yet
Unit 6 - Assignment With Answers
9 pages
Regression Models Course Notes
No ratings yet
Regression Models Course Notes
102 pages
Unit 522 Understanding and Visualizing Linear Equations Without Answers
No ratings yet
Unit 522 Understanding and Visualizing Linear Equations Without Answers
8 pages
Unit 545 Differences Between Two or More Groups Non Parametric Without Answers
No ratings yet
Unit 545 Differences Between Two or More Groups Non Parametric Without Answers
8 pages
AIATS Second Step JEE (Main & Advanced) 2024
No ratings yet
AIATS Second Step JEE (Main & Advanced) 2024
5 pages
Blended Learning
No ratings yet
Blended Learning
17 pages
Unit 545 Differences Between Two or More Groups Non Parametric With Answers
No ratings yet
Unit 545 Differences Between Two or More Groups Non Parametric With Answers
10 pages
(EMPTY) - Practice Test 2.5
No ratings yet
(EMPTY) - Practice Test 2.5
16 pages
W6 - Interaction Equations
No ratings yet
W6 - Interaction Equations
6 pages
Unit 6 - Assignment Without Answers
No ratings yet
Unit 6 - Assignment Without Answers
6 pages
Best Example by Henk - Research Proposal 1
No ratings yet
Best Example by Henk - Research Proposal 1
14 pages
UNIT II Regression
No ratings yet
UNIT II Regression
59 pages
Psy 234 Investigating Relationships Week 11
No ratings yet
Psy 234 Investigating Relationships Week 11
37 pages
Criteria For Funding Under Bharatmala, Gati Shakti and Sagarmala
No ratings yet
Criteria For Funding Under Bharatmala, Gati Shakti and Sagarmala
7 pages
Unit 1 - Assignment With Answers
No ratings yet
Unit 1 - Assignment With Answers
4 pages
Assignmentdyads6 - 71455 - 4039886 - Assignment 4 - Method and Results Qualitative Draft-1
No ratings yet
Assignmentdyads6 - 71455 - 4039886 - Assignment 4 - Method and Results Qualitative Draft-1
4 pages
Инструкция Panasonic KX-TCD150FXC (77 страницы)
No ratings yet
Инструкция Panasonic KX-TCD150FXC (77 страницы)
3 pages
Unit 10 - Assignment Without Answers PM
No ratings yet
Unit 10 - Assignment Without Answers PM
3 pages
STAT302 Investigation5-1
No ratings yet
STAT302 Investigation5-1
4 pages
Lecture 6 Correlation and Regression
No ratings yet
Lecture 6 Correlation and Regression
10 pages
DNA Coloring
No ratings yet
DNA Coloring
4 pages
Unit 5 - Assignment Without Answers
No ratings yet
Unit 5 - Assignment Without Answers
2 pages
Second Stats Packet 24
No ratings yet
Second Stats Packet 24
100 pages
78 Outliers Etc
No ratings yet
78 Outliers Etc
4 pages
Correlation and Regression
No ratings yet
Correlation and Regression
10 pages
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
DECS Cheat Sheet
No ratings yet
DECS Cheat Sheet
8 pages
Shrine Our Lady of Mercy JBL 2
No ratings yet
Shrine Our Lady of Mercy JBL 2
2 pages
Financial Plans for Successful Wealth Management In Retirement: An Easy Guide to Selecting Portfolio Withdrawal Strategies
From Everand
Financial Plans for Successful Wealth Management In Retirement: An Easy Guide to Selecting Portfolio Withdrawal Strategies
Tushar S. Chande, Ph.D., MBA
No ratings yet
Marketing Analytics Project: Alisha Srivastava Prachi Aggarwal Anup Thakur Gowtham Reddy Sandeep Pal
No ratings yet
Marketing Analytics Project: Alisha Srivastava Prachi Aggarwal Anup Thakur Gowtham Reddy Sandeep Pal
16 pages
Spss
100% (1)
Spss
26 pages
QBM 101 Lecture 10
No ratings yet
QBM 101 Lecture 10
45 pages
1 Oz Equals How Many Grains - Google Search
No ratings yet
1 Oz Equals How Many Grains - Google Search
1 page
Common Pitfalls in Statistical Analysis: Linear Regression Analysis
No ratings yet
Common Pitfalls in Statistical Analysis: Linear Regression Analysis
4 pages
R Egression Simplified
No ratings yet
R Egression Simplified
24 pages
Ahmd To Gandhidham PDF
No ratings yet
Ahmd To Gandhidham PDF
2 pages
Dummy Variable
No ratings yet
Dummy Variable
10 pages
07 - Inference For Numerical Data
No ratings yet
07 - Inference For Numerical Data
3 pages
Advanced Statistical Methods
No ratings yet
Advanced Statistical Methods
63 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
17 pages
Stata Session 1 KA (Class)
No ratings yet
Stata Session 1 KA (Class)
6 pages
Linear Regression Experiment
No ratings yet
Linear Regression Experiment
6 pages
Start Predicting In A World Of Data Science And Predictive Analysis
From Everand
Start Predicting In A World Of Data Science And Predictive Analysis
Matthew Abbitt
No ratings yet
Bhakti Shastri Material
No ratings yet
Bhakti Shastri Material
4 pages
Lokomat Nanos Datasheet
No ratings yet
Lokomat Nanos Datasheet
2 pages
Brief Lecture Notes On Simple Linear Regression Regression Analysis
No ratings yet
Brief Lecture Notes On Simple Linear Regression Regression Analysis
8 pages
Regression
No ratings yet
Regression
3 pages
From Average To K-means
From Everand
From Average To K-means
Beam van Waardenberg
No ratings yet
Thesis LD
No ratings yet
Thesis LD
4 pages
Gale Researcher Guide for: Econometric Models
From Everand
Gale Researcher Guide for: Econometric Models
Chupp
No ratings yet
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Introduction to Applied Econometrics Analysis Using Stata
From Everand
Introduction to Applied Econometrics Analysis Using Stata
Justin Doran
5/5 (3)
Mastering O'Level Islamiyat
98% (47)
Mastering O'Level Islamiyat
343 pages
Candy Crossword
No ratings yet
Candy Crossword
3 pages
Data Analytics
From Everand
Data Analytics
Jeffery Short
1/5 (1)
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
Microsoft Excel Statistical and Advanced Functions for Decision Making
From Everand
Microsoft Excel Statistical and Advanced Functions for Decision Making
Palani Murugappan
No ratings yet
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
From Everand
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
Peter Bradley
No ratings yet
Painless Statistics
From Everand
Painless Statistics
Barron's Educational Series
No ratings yet

Unit 531 Describing and Assessing The Linear Relationship Between Two Scale Variables Without Answers

Uploaded by

Unit 531 Describing and Assessing The Linear Relationship Between Two Scale Variables Without Answers

Uploaded by

Assignment for unit 531

Goals of this assignment and preparing the data

Studying the relationship between BMI and income: expectations

3. Also give the linear equation for the expectation.

8. Based on this graph, do you think the relationship is as hypothesized?

Studying the relationship between BMI and income: data analysis

9. Inspect the output using one of the two following commands:

Assessing the relationship: inference

19. What is the p-value? And what does it mean?

You might also like