0% found this document useful (0 votes)
11 views

Assignment01 Question

The document provides guidance and questions for Assignment 1 in STAT100-D900. It outlines general guidance for preparing and submitting solutions. It then provides 6 multiple part questions on topics related to data analysis, surveys, and statistical concepts. Students are asked to identify variables, parameters, statistics, and other statistical elements. They are also asked to interpret data, calculate proportions and confidence intervals, and identify sources of error in surveys.

Uploaded by

Yash Patel
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views

Assignment01 Question

The document provides guidance and questions for Assignment 1 in STAT100-D900. It outlines general guidance for preparing and submitting solutions. It then provides 6 multiple part questions on topics related to data analysis, surveys, and statistical concepts. Students are asked to identify variables, parameters, statistics, and other statistical elements. They are also asked to interpret data, calculate proportions and confidence intervals, and identify sources of error in surveys.

Uploaded by

Yash Patel
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

STAT100-D900

Assignment 1
Due: Tuesday, 11:00 PM in Crowdmark, May 28, 2024

General Guidance on How to Prepare Your Solu4on


• Prepare your solu=on for each ques=on on paper or digitally. Make sure to convert your solu=on
into a PDF or PNG file before submiKng it on Crowdmark. Addi=onally, ensure that your
uploaded solu=on is in the correct orienta=on for easy marking.
• Ensure that you have separate solu=on file for each ques=on and submit them individually.
• Clearly label your answers for each part of the ques=on.
• Ensure that your submission is readable and uses an appropriate font size and is clear enough for
TA marking.
• A6en4on: Please do NOT submit solu=ons that are similar to others, as this violates our
academic integrity policy. While you are encouraged to discuss assignment ques4ons with TAs
or peers in general terms, it is important that you prepare your solu4ons independently. Any
solu=ons found to be iden=cal or displaying a high level of similarity will result in a zero score for
the en=re assignment, once iden=fied by the TAs.

Q1. [9 pts] Iden4fy individuals and type of variables.


Here is a small part of a survey conducted at local gym regarding customer sa=sfac=on and usage
paQerns.
Frequency
Member ID Age Gender of Visits Per Satisfaction Rating
Week
M001 24 Male 4 5 (Very Satisfied)
M002 30 Female 3 4 (Satisfied)
M003 22 Female 1 3 (Neutral)
M004 35 Male 2 2 (Dissatisfied)
M005 28 Female 5 5 (Very Satisfied)
M006 42 Male 2 3 (Neutral)
M007 37 Female 3 4 (Satisfied)
M008 45 Male 1 1 (Very Dissatisfied)
M009 25 Female 4 5 (Very Satisfied)
M010 39 Male 3 2 (Dissatisfied)

(a) [1 pt] What are the individuals in this data set?


(b) [4 pts] How many variables are measured for each individual? List all variables.
(c) [1 pt] Iden=fy the variable in this dataset that serves an iden=fier for each individual?
(d) [3 pts] For all variables measured except the iden=fier variable, list all numerical variables and all
categorical variables separately.

Marking Rubric
(a) 1 point for the correct iden=fica=on of individuals.
(b) Deduct 0.5 points for each missing variable.
(c) 1 point for the correct iden=fica=on of the iden=fier in this data table.
(d) Deduct 0.5 points for each misclassifica=on of a variable.

Assignment 1 1
STAT100-D900

Q2. [4 pts] Text. #1.12 The death penalty


A press release by the Gallup News Service says that, based on a poll conducted on October 5–11, 2017,
it found that 54.96% of Americans respond Yes when asked this ques=on: “Are you in favor of the death
penalty for a person convicted of murder?” Toward the end of the ar=cle, you read: “Results for this
Gallup poll are based on telephone interviews conducted October 5–11, 2017, with a random sample of
1,028 adults, aged 18 and older, living in all 50 U.S. states and the District of Columbia.”
(a) [1 pt] What variable did this poll measure?
(b) [1 pt] What popula=on is Gallup interested in obtaining informa=on about?
(c) [1 pt] What was the sample in this poll?
(d) [1 pt] Based on the informa=on provided, how many respondents answered “Yes”?

Marking Rubric
(a) 1 point for sta=ng the variable measured.
(b) 1 point for specifying the popula=on.
(c) 1 point for indica=ng the sample correctly.
(d) 1 point for the correct answer.

Q3. [6 pts] Exercise and Mental Health in Men.


An ar=cle from the New York Times reported on a study about exercise habits and mental health
outcomes in men. Since 2000, a team of Canadian researchers has been collec=ng extensive data from
800,000 men aged 40 to 55. The researchers documented the men's exercise rou=nes when they joined
the study and again five years later. They then explored any correla=ons between the amount of exercise
and the 45,000 cases of mental health issues (such as depression and anxiety) that the men developed
over the study period. The study observed that even men who engaged in moderate exercise, such as
walking for 30 minutes a day, showed a significantly lower risk of developing mental health issues.

(a) [2 pts] Is this an experiment? Explain your answer.


(b) [1 pt] We would prefer a sample survey to using men who volunteer for a study. What
popula=on does it appear that the researchers were interested in?
(c) [3 pts] What variables did they measure? List at least 3 variables.

Marking Rubric
(a) [2 pts] 1 point for correctly iden=fying the study type (observa=onal or experimental study) and
1 point for providing a reasonable jus=fica=on.
(b) [1 pt] 1 point for correctly iden=fying the popula=on of interest.
(c) [3 pts] 1 point for detailing each variable measured in this study.

Q4. [5 pts] Text, #2.16. Choose an SRS.


A firm wants to understand the aKtudes of its managers toward its system for assessing employee
performance. Following is a list of all 32 by alphabe=cally (down the columns) of the firm’s managers.

Assignment 1 2
STAT100-D900

We aim to select five managers from a list by using the random digits in Table A. To prepare for this
simple random sample, label each of the 32 managers alphabe=cally from 01 to 32. Begin at line 102 (as
indicated below) to iden=fy the five selected managers. If you do not reach the desired sample size by
the end of the line, con=nue selec=ng by moving to the next line un=l all five managers are chosen.

Hint: your solu=on should be a list of 5 managers in this format: labeling #, name.

Marking Rubric
• One point for each chosen manager.
• Some students might not read the ques=on carefully and label all managers from 01 to 32, in
this case, the list of the sample units would be different. For this type of solu=on, take 2 points
off.

Q5. [10 pts] Parameter, Sta4s4cs and Confidence interval.


A recent survey revealed that pes=cide residues could pose health risks to consumers. An agricultural
inspector takes a Simple Random Samples (SRS) of 300 apples from all apples harvested on a par=cular
day. Laboratory tests find that 15 of these apples contain detectable levels of pes=cide residues.
Unbeknown to the orchard owner, 2% of all apples harvested contain pes=cide residues.
(a) [2 pts] In this situa=on, specify the number 15 is a parameter or a sta=s=c, and briefly explain
your answer.
(b) [2 pts] In this situa=on, specify the number 2% is a parameter or a sta=s=c, and briefly explain
your answer.
(c) [1 pt] Based on the sample data from the orchard, what is the es=mated propor=on of
contaminated apples in this popula=on? Show the details of your calcula=on, round your answer
to three decimal places and express it as a percentage. E.g. 2/3 =0.667 = 66.7%.

Assignment 1 3
STAT100-D900

(d) [1 pt] Use the quick approxima=on method, find the margin of error for a 95% confidence
interval for the popula=on propor=on of contaminated apple. Show details of your calcula=on,
round your answer to three decimal places and express it as a percentage.
(e) [2 pts] Find the 95% confidence interval (CI) for the popula=on propor=on of contaminated
apples. Show details of your calcula=on and ensure your answer in this form: (lower CI limit,
upper CI limit). Does the interval contain the true popula=on propor=on, p=2%?
(f) [2 pts] Based on another sample, another researcher obtained the 95% confidence interval is
between 3.5% and 7.1%. However, the true popula=on propor=on of contaminated apples is 2%
which doesn’t fall within the CI: (3.5%, 7.1%). Is this possible? Explain your answer.

Marking Rubric:
(a) [2 pts] 1 point for the correct iden=fica=on of number 15. 1 point for a reasonable explana=on.
(b) [2 pts] 1 point for the correct iden=fica=on of 2%. 1 point for a reasonable jus=fica=on.
(c) [1 pt] 0.5 points for the calcula=on details; 0.5 points for the final answer.
(d) [1 pt] 0.5 points for showing calcula=on details, 0.5 points for the final answer.
(e) [2 pts] 1 point for applying the right formula to get the CI. 0.5 points for the lower CI limit, and
another 0.5 points for the upper CI limit. Further conclude whether 2% is fall within this interval
or not.
(f) [2 pts] 1 point for sta=ng whether this is possible or not. 1 point for any reasonable explana=on.

Q6. [5 pts] Iden4fy error in survey sampling.


Each of the following is a source of error in a survey sampling. Label each as sampling error and
nonsampling error and explain your answers.
(a) [1 pt] The par=cipant underreports the amount of weekly exercises due to embarrassment.
(b) [1 pt] A data entry error occurs, recording ‘150 minutes’ of exercise as ’15 minutes’.
(c) [1 pt] Par=cipants self-select by responding to an ad on a fitness app.
(d) [1 pt] Researchers randomly select gym members to survey at a local fitness center.
(e) [1 pt] Some selected par=cipants did not respond to mul=ple email invita=ons to par=cipate in
the survey.

Marking Rubric:
(a) 0.5 points for the error type; 0.5 points for explana=on.
(b) 0.5 points for the error type; 0.5 points for explana=on
(c) 0.5 points for the error type; 0.5 points for explana=on
(d) 0.5 points for the error type; 0.5 points for explana=on
(e) 0.5 points for the error type; 0.5 points for explana=on

Assignment 1 4

You might also like