0% found this document useful (0 votes)
82 views9 pages

Stat 201 - Project 1 - Spring 2019: Due Friday, February 8, 2019

This project involves analyzing a random sample of 900 plus the last two digits of the student's ID number from a larger dataset of survey responses. Students will explore relationships between variables, compare groups using boxplots, and examine distributions. Output from JMP analyses must be embedded within the written report. Guidelines are provided for writing the report and completing each question correctly.

Uploaded by

Bao Gan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
82 views9 pages

Stat 201 - Project 1 - Spring 2019: Due Friday, February 8, 2019

This project involves analyzing a random sample of 900 plus the last two digits of the student's ID number from a larger dataset of survey responses. Students will explore relationships between variables, compare groups using boxplots, and examine distributions. Output from JMP analyses must be embedded within the written report. Guidelines are provided for writing the report and completing each question correctly.

Uploaded by

Bao Gan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 9

Stat 201 – Project 1 – Spring 2019

Due Friday, February 8th, 2019


*(11:59 PM in the evening, submitted to Canvas)*
Assignments submitted by *11:59pm on Wednesday February 6th will receive +7 bonus points
Late projects should be counted off 15 points per day late.

In this project, you will explore a subset (i.e., a sample) of the data collected from the survey that most Stat 201 students
completed this semester. See pages 6-8 for a complete list of the survey questions. You will be including a substantial amount of
output within your write-up. INCLUDE ONLY THE OUTPUT NECESSARY TO ANSWER THE PROJECT
QUESTIONS. ANY JMP WORK YOU DO TO ANSWER A QUESTION MUST BE INCLUDED AS A SCREENSHOT
IN YOUR REPORT.

Getting Started: The data are found in the file “Stat 201 Spring 2019 Survey Results.jmp”, which is located on the Stat 201
Webpage under the “Projects” tab. This file contains 1308 responses. In real life situations, researchers would use all the data
they have available after conducting a survey. For this project, however, you will get JMP to help you take a random sample from
the entire data set so that each student will have different results, and therefore will be turning in a UNIQUE project. The size of
the random sample must be 900 plus the last two digits of your UT student ID number. For example, if your UT student ID
number is 000314791, you will take a random sample of size 900 + 91 = 991. When you create your random sample from the
original JMP file, JMP creates a new file that will be named “Subset of Stat 201 Spring 2019 Survey Results”. You should
immediately save a copy of this file by clicking the “File” menu and choosing “Save As…”. JMP will prompt you to keep the
same name, which is acceptable, or you can rename it to something like “Stat Project1 – My Data”

Taking Screenshots: Although there are many ways to get JMP graphics into a written presentation, we want you to use the
“screen shot” method in all cases. Please see the video at http://tinyurl.com/utk-screenshots for instructions on how to take
selective screen shots on a PC or a Mac. Clearly label what question and part you are answering so your project is graded
correctly! See page 5 for an example project format.

Tutorials and Write-up: See the JMP tutorials at http://web.utk.edu/~cwiek/201Tutorials/ and the Stat 201 YouTube Channel
Playlist for instructions on how to get JMP to perform most tasks. Use page 4 of this project for guidance in which tutorial to
look at for each question in the project, as well as important “notes” to further help you with certain JMP tasks. In every question
that asks you to produce output from JMP, we expect the output you produce to answer the question to be within the write-up.
You should put this output immediately after your comments regarding that specific part of the assignment (i.e., not just a series
of printouts from JMP at the back of your write-up). You can get help in the Stat 201 Lab with specific questions about the
project. You can NOT ask a Stat 201 Lab worker to read your entire project for suggestions on what to change. Your finished
work must be submitted within Canvas (see “Assignments”), and must be a Microsoft Word document (.doc or .docx).

Important note for Windows (PC) users: Open JMP, click on the File menu and select Preferences. Then, on the left, select
Windows Specific. About three-fourths of the way down this window, where it says Auto Hide Menu and Toolbars, click the
drop down arrow and select Never. Click OK, then close this window. You should only have to do this once, and Mac users do
not need to do this.

JMP and Hodges Library computers: Using JMP installed on your own computer is much simpler than using JMP on a library
computer! If you choose to use a computer in the library to do your project, be sure to first read the document “Using JMP in the
Library”, found on the Stat 201 webpage under the “Projects” tab. Also, you will need to save your project and your random
sample subset file to a location you can access later, such as a memory stick. Or, you could e-mail these files to yourself for later
use.

Writing a Good STAT 201 Project Report: Please take note that on page 9 of this document there is a page titled “Writing a
Good Stat 201 Project Report”. This page contains a series of guidelines for the written part of your report. A portion of your
grade (9%) is related to following these guidelines.

1
Project Questions
Question One

1. Using the full JMP data file, get JMP to select a random sample of size 900 plus the last 2
digits of your student ID number (as described earlier in this document). Report the size
of your sample. Although we wouldn’t normally do this, make a histogram of the
variable “Q0 - Original Row Number (Identifier)”. Make sure your histogram is in
horizontal layout and has a count axis (See the example on page 5 of this project.).
Include the summary statistics. Save your JMP file, and use it for all remaining
questions. (10 points)

Question Two

2.a) Choose two categorical variables for this problem. Do not use any variable with more
than five levels. For example, the variable “Q15 – Major” has over 200 levels, so this
variable would not be an option. Name the two variables you will be using and describe
the relationship you expect them to have. NOTE: Do not use the variables in the
example project on page 5. (3 points)

b) Create a mosaic plot and contingency table for your two variables mentioned in 2a. The
contingency table in your output should display “Count” and “Row %” values only.
Include this screenshot in your report. (9 point)

c) Using the mosaic plot, explain the relationship between these two variables. In your
explanation, you must mention some of the “Row %” values. (Notice that the “Row %”
values are what the heights of the bars in the Mosaic plot are based on.) (9 points)

d) Did the Mosaic plot verify or refute your prediction in part (a)? Do you believe the
association you see between these two variables is “strong”? Briefly explain your
answer. NOTE: the word “strong” is open to interpretation, so there is no right answer
for this, it’s just your opinion. (6 points)

e) Go through additional variables in the survey to find two variables that do not have a
relationship. Try to find the weakest relationship possible. It is possible to view multiple
mosaic plots at one time by including more than one variable in X variable field. Include
a screenshot of the weakest relationship you could find along with the contingency table.
NOTE – You do not have to find the weakest relationship possible. There are many
possible graphics for this question. (12 points)

2
Question Three

3. In this question, you will explore the variable “Q61 – Facebook Friends”. Read the full
survey question description for this variable (page 7).

a) Generate a horizontal boxplot of this variable using graph builder. Within your document,
use the “tabulate” option under “analyze” to include the min, max, median, Q1 and Q3.
Interpret in context the median. Also, calculate and interpret the IQR from your results.
(12 points, make sure to interpret what you report here)

b) Use tabulate to create a table for the mean and standard deviation. Include this table in
your report. Find the z-score for how many Facebook Friends you have. Show the math
for how you found your z-score and explain what the z-score means. (3 point)

c) Next you will explore possible differences for answers to this question depending on if
they are in Greek life “Q07 - Greek Life”. Prior to producing any output, which group do
you think will have more Facebook friends: those involved in Greek life or those not
involved? Provide a guess of the difference between medians (in Facebook friends) you
expect to find between the two groups. There is no “right” answer to this question, but
you must provide a logical explanation for your conclusion. (3 point)

d) Generate side-by-side boxplots for these two groups and include the quantiles below.
Report and interpret in context the median and IQR for each group. Comment on any
differences you see between the summary statistics. (12 points, make sure to interpret
what you report here)

e) Include another table that shows the mean and median for “Q61 – Facebook Friends”.
Explain briefly why the differences between the mean and median is so large for this
variable. (6 Points)

f) The data is heavily skewed so the normal model does not apply. Use your distribution to
show this by calculating according to the normal curve the lowest 2.5% of Facebook
friends we would expect people to have. Show your work on how you came to this
answer. NOTE – Use the empirical rule to find your answer. (6 Points)

Remaining points distribution:


Project organization and flow (3 points)

Projects should look neat and organized. Use the crop tool in Word if you need to improve screenshots.
Your project should read like a report without the prompt of each question.

Use of the guidelines on page 9 – (6 points)

The opening paragraph on the project should give a short summary (3-5 sentences) of the analysis
they’re about to read. The closing paragraph should summarize interesting finds and discuss any ideas
regarding further data collection and analysis. Make sure to use an opening and closing that is relevant
to this project and your data.
3
JMP Tutorials Needed for Each Question
Question Heading Tutorial Notes
1 Miscellaneous Topics Taking a Simple
Random Sample
1 Graphical Display of Histogram and Box Plot After you place “Q0 - Original Row
Quantitative Data Number (Identifier)” in the Y,Columns
box. The red arrow at the top allows
you to stack the histogram and put it in
horizontal format. The second red
arrow has histogram options for a count
axis.
2b and 2e Graphical Display of Mosaic Plot To get the contingency table to display
Categorical Data “Count” and “Row%” only, uncheck
everything except “Count” and “Row%”
in the drop-down menu next to the
“Contingency Table” output. Also, since
your data are “unaggregated”, you
should place nothing in the “Freq” box.
3a Graphical Display of Box Plot Under the graph tab click on “graph
Quantitative Data builder”. Here you can add “Q61 –
Facebook Friends” to the x-axis so it is
in horizontal layout. Click “Done” and
then take a screenshot.
3d Graphical Display of Side-by-Side Box Plots After you place “Q61 – Facebook
Quantitative Data Friends” in the Y,Columns box, click on
“Q07 - Greek Life” and then click the
“X” button to produce separate output
for the two groups.
3a, 3b, 3e Graphical Display of Tabulate Use the “Tabulate” option under
Quantitative Data “Analyze”. From here hold control
(PC) or command (Mac) and select the
summary statistics you want then drag
these summary statistics to the column
field. Next choose the variable you
want the summary statistics for and put
this variable in to the rows field.
Decimals can be changed with the
“Change format” option and changing
the default of 12 to 6.

4
EXAMPLE FORMAT – Showing Question 1, 2a and 2b
STAT 201 Project #1 – Spring 2019 – Learning about Spring 2019 Stat 201 Students
Submitted by Jane Q. Student

Note: Your first three to five sentences should address item #1 on page 9 of this document. Make sure
that your opening is unique to you and is not copy/pasted between projects. Your introduction should
feel like the introduction to a book which informs the reader.

1. Since the last two digits of my student ID number are 22, I took a random sample of size 922.

2a. I have picked the variables “Q7 – Greek Life” and “Q3 – Born in TN”. I expect the relationship
between them to… (Include reasoning)

2b.

5
STAT 201 SURVEY – Spring 2019
FOR REFERENCE ONLY - FULL TEXT OF QUESTIONS ASKED

Q1 Which section of Stat 201 are you in?


Q2 What is your gender?
Q3 How old are you (In years)?
Q4 Were you born in Tennessee?
Q5 What is your relationship status?
Q6 How far do you live from campus?
Q7 What was your high school GPA?
Q8 Are you a member of a fraternity or sorority?
Q9 Are you an only child, oldest child, middle child or youngest child? Pick one answer that best
describes your birth order.
Q10 Have you ever broken a bone?
Q11 Do you have a Roth IRA?
Q12 How many pets do you own?
Q13 Estimate how much you spend on your pets per year; include veterinary expenses, food, toys,
treats, grooming, etc. Enter zero if you have no pets.
Q14 How many hours per week do you spend reading assignments from textbooks that your instructors
assign Include all classes, not just Stat 201.
Q15 What is your major?
Q16 How would you identify the economic level of your immediate family?
Q17 Are you in the honors program at UT?
Q18 What do you expect your starting annual salary (in US dollars) to be when you obtain a college
degree?
Q19 Do you (or your parents) plan on, or have you (or your parents) already, taken out student loans to
pay for your college expenses?
Q20 How many hours a week do you currently work at a job? If you are not employed, please put 0.
Q21 How many languages can you speak fluently? This includes your native language. This is the
language you first learned.
Q22 How would you classify your views on economic political issues?
Q23 How would you classify your views on social political issues?
Q24 What should happen to Confederate statues?
Q25 Should the United States stop making pennies? (Eliminate the penny)
Q26 Will humans step foot on Mars?
Q27 Select the option below that completes the following sentence in a way that best describes your
opinion. "Climate change on Earth:
Q28 Which of these do you believe to be closest to the truth regarding life on Earth?
Q29 Have you ever smoked marijuana?
Q30 Do you think marijuana should be legalized at the federal level (For the whole US)?
Q31 Should states have the ability to regulate what couples can marry? (i.e. defining marriage as only
one man and one woman)
Q32 Age when you had your first alcoholic beverage. IMPORTANT- Don't count sips or
communion. This should be an actual drink of alcohol.

6
STAT 201 SURVEY – Spring 2019
FOR REFERENCE ONLY - FULL TEXT OF QUESTIONS ASKED

Q33 When you eat out at a restaurant that involves a waitress or waiter, what percent do you usually
tip? Enter response as a whole number with no decimals for a percentage from 0 to 100.
Q34 Which statement best describes you behavior when you drink water on campus?
Q35 What is the most you've paid for a single coffee based drink? This includes any size, additions and
tips given for the drink.
Q36 Do you use any tobacco products?
Q37 Is vaping a safer alternative to smoking?
Q38 How many times have you cheated in college? This includes looking at another test during an exam,
taking another student's work and presenting it as your own and other forms of academic dishonesty.
Q39 How many times did you cheat in high school? This includes looking at another test during an
exam, taking another student's work and presenting it as your own and other forms of academic
dishonesty.
Q40 On an average night, how many hours of sleep during the school year do you usually get?
Q41 What is the longest number of consecutive hours you've stayed awake?
Q42 Have you ever been arrested?
Q43 Approximately how many text messages do you send a week?
Q44 Approximately how many text messages do you send on the weekend?
Q45 On a typical school day last semester, approximately how many text messages would you send
during class (while you were attending class)?
Q46 What is your favorite app on your phone?
Q47 What percentage of your income do you believe you should save in your 20s? Enter response as a
whole number with no decimals for a percentage from 0 to 100.
Q48 Have you ever purchased perishable food items online?
Q49 In the past 6 months, have you purchased a product based on a TV commercial?
Q50 How many on-line purchases (not counting music downloads) have you made in the last week?
Q51 How often do you use coupons when you shop (not including on-line shopping)?
Q52 Roughly, how many selfies have you posted on social media in the past month?
Q53 How much are you will to pay to see your favorite musician in concert?
Q54 What do you think of Kanye West as an individual?
Q55 What do you think of Steven Colbert as an individual?
Q56 Have you ever watched most or all of a live sporting event on a smart phone or tablet?
Q57 Have you ever gone to a physical store to check out the features of a product, with the intention of
purchasing the item online later?
Q58 How do you usually to listen to music?
Q59 Have you ever reviewed a business or product on social media (i.e. Twitter, Facebook, ect.)?
Q60 Are you "friends" on Facebook with one or more of your parents (include step-parents in your
answer)?

7
STAT 201 SURVEY – Spring 2019
FOR REFERENCE ONLY - FULL TEXT OF QUESTIONS ASKED

Q61 Approximately how many friends do you have on Facebook? If you don't have a Facebook account,
answer 0 for this question.
Q62 Approximately how many people have you defriended on Facebook. If you don't have a Facebook
account, answer 0 for this question.
Q63 In a typical week last semester, how much time (in hours) did you spend on "social media"? Include
both time reading social media and time communicating with social media.
Q64 The Tennessee Vols are scheduled to play 12 football games in the 2019 season (I.e., not including
the SEC championship game, and not including a bowl game). Number of games won reported
Q65 Did you lie at any point on this survey?

8
Writing a Good STAT 201 Project Report
Writing a report to your boss about a statistical analysis he has asked you to do is
very different than writing a novel, or writing to your Statistics instructor. What
does it take to write a good project report? Of course, it’s important to know
your audience when you write anything.

Let’s assume you are writing your project report for some busy executives in the
company, and they have asked you to answer the questions in the project. They
are very intelligent people, but they are not “Statisticians”. Assume that these
executives have had some basic statistical education, but perhaps a long time ago.
Keep this in mind as you complete your project.

Below are some guidelines for writing an effective project report:

1. The first sentence or two of your report should “orient” the reader. What is
this document about? Who is it from? On what date did you complete the
analysis?

2. Answer each question on the project instructions using correct sentence


structure, spelling and grammar. Sentences should be succinct and clear. You
can assume the executives have a copy of the questions they asked.

3. Avoid using "statistical jargon". Explain the results of the analysis in a way that
the executives can understand it.

4. As explained in the project instructions, graphics from JMP and/or Excel that
address the project question must be imbedded within the document, at the
point where the executives need to see them. Don’t make them hunt for the
output at the back of your report.

5. Avoid including discussion and/or graphics within the report that have no
relevance to the question being addressed.

6. Wrap-up and sign off: some sort of closing sentence regarding how happy you
were to answer the executives’ questions, and demonstrate your willingness
to do similar work in the near future. The word “Sincerely” or “Regards”,
followed by your name is a nice touch at the very end!
rev. 2015-01-22
9

You might also like