0% found this document useful (0 votes)

12 views

Exercises

The document contains 6 challenges involving creating and manipulating vectors, matrices, arrays, and data frames in R. Challenge I involves creating vectors for height and weight data and answering questions about averages, variances, standard deviations, and extracting elements. Challenge II involves creating a matrix from a table and answering questions using matrix operations. Challenge III represents a table as an array and answers questions using array operations. Challenge IV creates a list containing risk ratio results from previous challenges. Challenge V creates a data frame from vectors and extracts a row. Challenge VI identifies errors in sample R code.

Uploaded by

Reetika Choudhury

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

Exercises

Uploaded by

Reetika Choudhury

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Challenges:

Day - 1 (Session - 1)

Challenge - I

Create vectors named height and weight using the following data:
height : 160.3, 134.2, 159, 149, 145, and 147.1
weight : 83.8, 37.2, 71.7, 72.8, 50.5, and 42.9.

i) Based on the above vectors, answer the following questions:

a) The average height is __________

b) The variance of height is __________

c) The SD of height is __________

d) The average weight is __________

e) The variance of weight is __________

f) The SD of weight is __________

ii) Extract the 4th element in weight and height vector

a) 4th element in weight __________

b) 4th element in height __________

iii) Based on the above vector, calculate BMI

a) Calculate BMI __________

b) Extract the 4th element in BMI vector __________

1
Challenge - II

Create a matrix using the following table, and answer the following questions using matrix
operations

a) The total number of smokers __________.

(Hint rowSums(___))

b) The total number of non smokers __________.

(Hint rowSums(___))

A
c) Incidence of CHD among smokers ( A+B )__________.

C
d) Incidence of CHD among non smokers ( C+D )__________.

A/(A+B)
e) Risk ratio of CHD ( C/(C+D) ) __________.

2
Challenge - III

Represent the following table using array, and answer the following using array operations

a) In rural, incidence of CHD among smokers __________.

b) In rural, incidence of CHD among non smokers __________.

c) In rural, what is the risk ratio of CHD __________.

d) In urban, incidence of CHD among smokers __________.

e) In urban, incidence of CHD among non smokers __________.

f) In urban, risk ratio of CHD __________.

3
Challenge - IV

4) Create a list that contains results of overall risk ratio (Challenge II), rural risk ratio
(Challenge IIIc) and urban risk ratio (challenge IIIf)

Challenge - V

5) Write a R program to create a data frame for the following data:

unique_id’s vector C10001, C10002, and C10003;

treatment vector A, B and C;
age vector 29,30 and 28.

Then extract 3rd entire row.

Challenge - VI

Find the error in the following R codes

a) temp <- c(99.4, 102.3; 100.3)

b) Consider mat is a 2X2 matrix. Now, to extract 2nd row 1st column, will this command
mat(2;1) works?

c) hba1c% <- c(16.4, 11.0, 10.3, 12.4)

d) vector <- c(13, 7A, 11, 30)

e) R command to view the last 6 rows of dataframe df is str(df).

4
Day - 1 (Session - 2)

In this hypothetical study, data from 25 individuals have been collected to explore the
relationship between demographic factors, systolic blood pressure, hypertension, and the
effectiveness of two types of drugs, A and B.

Lets work through these questions to undergo the data cleaning process.

1) Import the exercise data from the directory (File name is Exercise_data-Day1.csv)

i) How many variables are there in the datasheet? __________ (Hint ____ %>%
dim())

ii) The datasheet has how many observations? __________

2) Give the variables new names as the following (Hint ___ %>% rename())

i) “Height.in.cms” as height

ii) “Weight.in.kgs” as weight

iii) “Type.of.drug.given” as drugType

3) Give the variables labels as the following (Hint ___ %>% set_variable_labels())

i) height as Height (in Cms)

ii) weight as Weight (in Kgs)

iii) drugType as Type of Drug

5
4) Recode the values of the following variables (Hint ___ %>% recode())

i) Hypertension, Yes=1 and No=0

ii) Gender, Male=1 and Female=2

5) Assign value labels to the following (Hint ___ %>% set_value_labels())

i) In Hypertension 0 as “NO” and 1 as “YES”

ii) In Gender 1 as “Male” and 2 as “Female”

6) How many people participated in the study from urban? __________ (Hint ___
%>% filter( ))

7) How many individuals took drug A? __________ (Hint ___ %>% filter( ))

8) How many individuals took drug B? __________ (Hint ___ %>% filter( )

6
9) Find the duplicates. How many pairs that are the same did you find?__________

(Hint ___ %>% filter(duplicated(-----))

10) Find the missing data for the variable Systolic Blood pressure (mmHg). (Hint
filter(is.na(-----)))

How many missing values were discovered?__________

11) Identify the outliers in Systolic Blood Pressure (mmHg). (Hint use the range
80-160)

How many outliers were found? __________

12) Prepare summary table by drug type for diastolic blood pressure with count, mean
and median, and SD (Hint ___ %>% group_by(___) %>% summarise(___))

i) What is the mean diastolic blood pressure for A __________

ii) What is the median diastolic blood pressure for B __________

7
Day - 2 (Session - 1)

Let us create some data visualizations to understand how drug is effective in treatment
of blood pressure, and see if there are any baseline differences, and differences in outcomes
- hypertension, systolic and diastolic BP.

Import exercise data (Exercise_data-Day2.csv) from the directory.

1. Use the ggplot2 package to plot the bar graph for hypertension response (Univariate
bar graph). Which response has the most frequency? __________

2. Could you add drug type in the bar chart for hypertension? (Bivariate grouped bar
chart). How many people who indicated they had hypertension also took drug A?
__________

(Hint ggplot(aes(x=,y=), fill=))

3. Could you now add the dwelling type to the previous bar graph. In bar graph, to
include the location use facet_wrap() function. What type of distribution does the
graph looks like in large city? __________

(Hint facet_wrap(~____))

4. Draw a density chart for systolic blood pressure (Univariate chart). What type of
distribution does the graph looks like? __________

a) Right skewed-distribution

b) Left skewed-distribution

c) Normal distribution

d) Uniform distribution

8
5. Create a box plot to represent systolic blood pressure by drug type (Bivariate box
plot). What is the median blood pressure for both drug type? __________

6. Using facet_wrap(), add the type of dwelling to the previous graph. Which sort of
dwelling has the highest blood pressure when using drug B? __________

7. Use a scatter chart to plot the graph for systolic and diastolic pressure (Bivariate
graph).

What is the relationship between systolic and diastolic blood pressure? __________

a) No association

b) Positive association

c) Negative association

9
Day - 2 (Session - 2)

Create summary tables for the following conditions. Then, fill in the blanks.

Import exercise data (Exercise_data-Day2.csv) from the directory.

1) Prepare summary statistics for the variables, sex, dwelling, drugType.

Variable n(%)
Gender

- Male __________

- Female __________

Location

- Small city __________

- Large city __________

- Town __________

Drug type

- Type A __________

- Type B __________

10
2) Prepare summary statistics for the following variables by type of drug, sex, dwelling,
hyper. Include statistical tests.

Variable Type A Type B p-value

Gender __________

- Male

- Female

Location __________

- Small city

- Large city

- Town

Hypertension __________

- Yes

- No __________ __________

3) Prepare the summary statistics for the numerical vectors systolic and diastolic blood
pressure by drug type. Include statistical tests.

Variable Type A Type B p-value

Systolic BP __

Diastolic BP __

ACLS Review Test
60% (5)
ACLS Review Test
5 pages
Textbook Practice Problems 1
No ratings yet
Textbook Practice Problems 1
39 pages
Vital Signs
90% (20)
Vital Signs
21 pages
Nadar, Sunil - Lip, Gregory - Hypertension (Oxford Cardiology Library) 3E-Oxford University Press, Incorporated (2022)
No ratings yet
Nadar, Sunil - Lip, Gregory - Hypertension (Oxford Cardiology Library) 3E-Oxford University Press, Incorporated (2022)
285 pages
Cookeville Regional Medical Center Severe Sepsis/Septic Shock Clinical Pathway
No ratings yet
Cookeville Regional Medical Center Severe Sepsis/Septic Shock Clinical Pathway
4 pages
Anesthesia For Open Abdominal Aortic Surgery
No ratings yet
Anesthesia For Open Abdominal Aortic Surgery
18 pages
Computer Lab 1 MM
No ratings yet
Computer Lab 1 MM
26 pages
q3 Stat2100 Bautista-Lhuriely
No ratings yet
q3 Stat2100 Bautista-Lhuriely
11 pages
Computer Lab 3 MM
No ratings yet
Computer Lab 3 MM
38 pages
Q3 - Stat2100 Dupol Melkiancaesar
No ratings yet
Q3 - Stat2100 Dupol Melkiancaesar
12 pages
Assignment# 06
No ratings yet
Assignment# 06
16 pages
STAT501 Online - HW2R - Spring2024
No ratings yet
STAT501 Online - HW2R - Spring2024
7 pages
IntroR 2
No ratings yet
IntroR 2
18 pages
ProbList5-24-Sln
No ratings yet
ProbList5-24-Sln
9 pages
Assignment Day1 - Alpesh Panchal
No ratings yet
Assignment Day1 - Alpesh Panchal
5 pages
HWK3_324
No ratings yet
HWK3_324
9 pages
CODE.project
No ratings yet
CODE.project
42 pages
Assignment - 1: Data Analytics and R
No ratings yet
Assignment - 1: Data Analytics and R
4 pages
Stroke Prediction Dataset
No ratings yet
Stroke Prediction Dataset
48 pages
R
No ratings yet
R
4 pages
2300 FT Assignment 3 Answer Key
No ratings yet
2300 FT Assignment 3 Answer Key
14 pages
Assignment 3 (Recoded)
No ratings yet
Assignment 3 (Recoded)
1 page
Prog Assignment 3
No ratings yet
Prog Assignment 3
10 pages
Sega
No ratings yet
Sega
5 pages
Medidas de Tendencia Central 2020 PDF
No ratings yet
Medidas de Tendencia Central 2020 PDF
26 pages
ACMT 311 Assignment
No ratings yet
ACMT 311 Assignment
6 pages
10 A 5
No ratings yet
10 A 5
1 page
R Practice
No ratings yet
R Practice
38 pages
Unit 2 Assignment SKELETON R spr18
No ratings yet
Unit 2 Assignment SKELETON R spr18
12 pages
ALY6015 Final Project Report
No ratings yet
ALY6015 Final Project Report
19 pages
Model Building Using Healthcare Dataset
No ratings yet
Model Building Using Healthcare Dataset
19 pages
Assignment 1-Data Science
No ratings yet
Assignment 1-Data Science
8 pages
Group Assignment
No ratings yet
Group Assignment
3 pages
Lab 1 Manual - Introduction to R
No ratings yet
Lab 1 Manual - Introduction to R
7 pages
R Programming
No ratings yet
R Programming
47 pages
HW4_solution_Fall_2024
No ratings yet
HW4_solution_Fall_2024
21 pages
Project of Biostatistics#02-RaeesaAli-MS - BIOTECH
No ratings yet
Project of Biostatistics#02-RaeesaAli-MS - BIOTECH
27 pages
Full Download Solutions Manual to Advanced Regression Models with SAS and R 1st Edition Olga Korosteleva PDF DOCX
100% (4)
Full Download Solutions Manual to Advanced Regression Models with SAS and R 1st Edition Olga Korosteleva PDF DOCX
55 pages
STAT-2450 Assignment 1: Name:, Student ID: B00
No ratings yet
STAT-2450 Assignment 1: Name:, Student ID: B00
9 pages
Lab 1 - Introduction To Data
No ratings yet
Lab 1 - Introduction To Data
7 pages
BM-1, Applied Statistics, Lesson 2: Comparing Two Groups (And One Group)
No ratings yet
BM-1, Applied Statistics, Lesson 2: Comparing Two Groups (And One Group)
39 pages
Slide PS
No ratings yet
Slide PS
74 pages
Programming Assignment 3
No ratings yet
Programming Assignment 3
5 pages
Solutions Manual to Advanced Regression Models with SAS and R 1st Edition Olga Korosteleva download
100% (4)
Solutions Manual to Advanced Regression Models with SAS and R 1st Edition Olga Korosteleva download
65 pages
Stata All Command (Jahidul)
No ratings yet
Stata All Command (Jahidul)
13 pages
Sta 226
No ratings yet
Sta 226
5 pages
A1
No ratings yet
A1
8 pages
Workshop Activity: X Seq y Length
No ratings yet
Workshop Activity: X Seq y Length
3 pages
CS1B044
No ratings yet
CS1B044
6 pages
Lab4 2021
No ratings yet
Lab4 2021
2 pages
Biostatistics II. Final Assignment
No ratings yet
Biostatistics II. Final Assignment
3 pages
CodeToPrepareData
No ratings yet
CodeToPrepareData
2 pages
Regression Analysis Assignment1111
No ratings yet
Regression Analysis Assignment1111
13 pages
Macroeconomics (Econ 3033) Department of Statistics: Group Assignment
No ratings yet
Macroeconomics (Econ 3033) Department of Statistics: Group Assignment
27 pages
R Studio Question and Answers
No ratings yet
R Studio Question and Answers
6 pages
Pima Indians Diabetes Database Analysis - Kaggle
No ratings yet
Pima Indians Diabetes Database Analysis - Kaggle
37 pages
Appendix: Answers To Selected Exercises: /user
No ratings yet
Appendix: Answers To Selected Exercises: /user
8 pages
R Programming Exercice
No ratings yet
R Programming Exercice
5 pages
Ex Day1
No ratings yet
Ex Day1
9 pages
Assignment_STAT5002
No ratings yet
Assignment_STAT5002
5 pages
STA215 Test2 F 2014 A Solutions
No ratings yet
STA215 Test2 F 2014 A Solutions
8 pages
Heart Disease Risk Factor Data Analysis Midterm Data 2 - Jupyter Notebook
No ratings yet
Heart Disease Risk Factor Data Analysis Midterm Data 2 - Jupyter Notebook
20 pages
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
Co-Clustering: Models, Algorithms and Applications
From Everand
Co-Clustering: Models, Algorithms and Applications
Gérard Govaert
No ratings yet
Fifteen Years Experience With Finger Arterial Pressure Monitoring
No ratings yet
Fifteen Years Experience With Finger Arterial Pressure Monitoring
12 pages
Icu One Pager Iabp
No ratings yet
Icu One Pager Iabp
1 page
BCIS
No ratings yet
BCIS
6 pages
Chapter 6 Lecture 3 of 3
No ratings yet
Chapter 6 Lecture 3 of 3
30 pages
0104 1169 Rlae 28 E3369
No ratings yet
0104 1169 Rlae 28 E3369
10 pages
Question Text: Not Yet Answered
No ratings yet
Question Text: Not Yet Answered
11 pages
O Gorman Et Al 2016 The Use of Ultrasound and Other Markers For Early Detection of Preeclampsia
No ratings yet
O Gorman Et Al 2016 The Use of Ultrasound and Other Markers For Early Detection of Preeclampsia
9 pages
Pharmacology Presentation
No ratings yet
Pharmacology Presentation
21 pages
Strategies in Answering Reading Comprehension: Vocabulary Questions. The First Type Asks You To Answer Questions
No ratings yet
Strategies in Answering Reading Comprehension: Vocabulary Questions. The First Type Asks You To Answer Questions
9 pages
HYPERTENSION-WPS Office
No ratings yet
HYPERTENSION-WPS Office
4 pages
Manual Omron Healthcare HEM-773AC
No ratings yet
Manual Omron Healthcare HEM-773AC
20 pages
Air Medical Service (15 July 1921)
No ratings yet
Air Medical Service (15 July 1921)
140 pages
NCP
No ratings yet
NCP
8 pages
Penerapan Anjuran Diet Dash Dibandingkan Diet Rendah Garam
No ratings yet
Penerapan Anjuran Diet Dash Dibandingkan Diet Rendah Garam
12 pages
Chapter 2: Role and Professional Ethics of Care Workers
No ratings yet
Chapter 2: Role and Professional Ethics of Care Workers
55 pages
AEMT - Trauma Exam Practice
No ratings yet
AEMT - Trauma Exam Practice
26 pages
Bates Physical Examination RED NOTES
100% (1)
Bates Physical Examination RED NOTES
2 pages
1st Level Assessment
No ratings yet
1st Level Assessment
15 pages
Grsmu - Byfilesfileuniversitycafedrynevrologiifileslfklin Protokol PDF
No ratings yet
Grsmu - Byfilesfileuniversitycafedrynevrologiifileslfklin Protokol PDF
197 pages
Laboratory Exercise #2
No ratings yet
Laboratory Exercise #2
6 pages
Life Processes X
No ratings yet
Life Processes X
124 pages
Corasore 2
No ratings yet
Corasore 2
42 pages
Hemodynamic Monitoring
No ratings yet
Hemodynamic Monitoring
65 pages
Pe103-Course-Guide-And-Module-Sem. 2022-2023
No ratings yet
Pe103-Course-Guide-And-Module-Sem. 2022-2023
55 pages
Recognise Critical Illness
No ratings yet
Recognise Critical Illness
4 pages

Exercises

Uploaded by

Exercises

Uploaded by

Challenges:

i) Based on the above vectors, answer the following questions:

a) The average height is __________

b) The variance of height is __________

c) The SD of height is __________

d) The average weight is __________

e) The variance of weight is __________

f) The SD of weight is __________

ii) Extract the 4th element in weight and height vector

a) 4th element in weight __________

b) 4th element in height __________

iii) Based on the above vector, calculate BMI

a) Calculate BMI __________

b) Extract the 4th element in BMI vector __________

a) The total number of smokers __________.

b) The total number of non smokers __________.

a) In rural, incidence of CHD among smokers __________.

b) In rural, incidence of CHD among non smokers __________.

c) In rural, what is the risk ratio of CHD __________.

d) In urban, incidence of CHD among smokers __________.

e) In urban, incidence of CHD among non smokers __________.

f) In urban, risk ratio of CHD __________.

5) Write a R program to create a data frame for the following data:

unique_id’s vector C10001, C10002, and C10003;

Then extract 3rd entire row.

Find the error in the following R codes

a) temp <- c(99.4, 102.3; 100.3)

c) hba1c% <- c(16.4, 11.0, 10.3, 12.4)

d) vector <- c(13, 7A, 11, 30)

e) R command to view the last 6 rows of dataframe df is str(df).

ii) The datasheet has how many observations? __________

ii) “Weight.in.kgs” as weight

iii) “Type.of.drug.given” as drugType

i) height as Height (in Cms)

ii) weight as Weight (in Kgs)

iii) drugType as Type of Drug

i) Hypertension, Yes=1 and No=0

ii) Gender, Male=1 and Female=2

5) Assign value labels to the following (Hint ___ %>% set_value_labels())

i) In Hypertension 0 as “NO” and 1 as “YES”

ii) In Gender 1 as “Male” and 2 as “Female”

(Hint ___ %>% filter(duplicated(-----))

How many missing values were discovered?__________

How many outliers were found? __________

i) What is the mean diastolic blood pressure for A __________

ii) What is the median diastolic blood pressure for B __________

Import exercise data (Exercise_data-Day2.csv) from the directory.

(Hint ggplot(aes(x=____,y=____), fill=____))

Import exercise data (Exercise_data-Day2.csv) from the directory.

1) Prepare summary statistics for the variables, sex, dwelling, drugType.

- Small city __________

- Large city __________

Variable Type A Type B p-value

- Male __________ __________

- Female __________ __________

- Small city __________ __________

- Large city __________ __________

- Town __________ __________

- Yes __________ __________

Variable Type A Type B p-value

Systolic BP __________ __________ __________

Diastolic BP __________ __________ __________

You might also like

(Hint ggplot(aes(x=,y=), fill=))

- Male

- Female

- Small city

- Large city

- Town

- Yes

Systolic BP __

Diastolic BP __