4 Assignment 4 - Unit 1 - Internet Privacy Poll
4 Assignment 4 - Unit 1 - Internet Privacy Poll
Calculator
IMPORTANT NOTE: This problem is optional, and will not count towards your grade. We have
created this problem to give you extra practice with the topics covered in this unit.
Kentucky
Missouri
Ohio
Pennsylvania
Which was the state in the South census region with the largest number of interviewees?
Select an option Answer: Texas
Explanation
From table(poll$State, poll$Region), we can identify the census region of a particular state by looking at the
region associated with all its interviewees. We can read that Colorado is in the West region, Kentucky is in the
South region, Pennsylvania is in the Northeast region, but the other three states are all in the Midwest region.
From the same chart we can read that Texas is the state in the South region with the largest number of
interviewees, 72.
Another way to approach these problems would have been to subset the data frame and then use table on the
limited data frame. For instance, to find which states are in the Midwest region we could have used:
MidwestInterviewees = subset(poll, Region=="Midwest")
table(MidwestInterviewees$State)
and to find the number of interviewees from each South region state we could have used:
SouthInterviewees = subset(poll, Region=="South")
table(SouthInterviewees$State)
Important: For all remaining questions in this assignment please use the limited data frame you created in
Problem 2.3.
Sex
Age
State
Region
Conservativeness
Info.On.Internet
Worry.About.Info
Privacy.Importance
Anonymity.Possible
Tried.Masking.Identity
Privacy.Laws.Effective
Explanation
You can read the number of missing values for each variable from summary(limited)
Explanation
From hist(limited$Age), we see the histogram peaks at around 60 years old.
Explanation
By running the command jitter(c(1, 2, 3)) multiple times, we can see that the jitter function randomly adds or
subtracts a small value from each number, and two runs will yield different results.
Older age seems strongly associated with a smaller value for Info.On.Internet
Explanation
For younger people aged 18-30, the average value of Info.On.Internet appears to be roughly 5, while most
peopled aged 60 and older have a value less than 5. Therefore, older age appears to be associated with a smaller
value of Info.On.Internet, but from the spread of dots on the image, it's clear the association is not particularly
strong.
Calculator
© 2023 edX LLC. All rights reserved.
深圳市恒宇博科技有限公司 粤ICP备17044299号-2
Calculator