0% found this document useful (0 votes)

93 views22 pages

Data8 Fa21 Midterm

This document is an exam for a Data 8 course. It provides instructions for taking the exam online or by email. The exam is personalized for each student's email address. It contains questions in multiple choice and checkbox formats. The questions cover topics like working with tables, arrays, probabilities, comparing sample sizes, and conducting an A/B test. Students are asked to write code to analyze datasets and calculate statistical values. The exam is due by a specified deadline.

Uploaded by

Baoxin Zhang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views22 pages

Data8 Fa21 Midterm

Uploaded by

Baoxin Zhang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

DATA 8 Sample Exam.

Fall 2021 Final Exam

INSTRUCTIONS
This is your exam. Complete it either at exam.cs61a.org or, if that doesn’t work, by emailing course staff with your
solutions before the exam deadline.
This exam is intended for the student with email address <EMAILADDRESS>. If this is not your email address, notify
course staff immediately, as each exam is different. Do not distribute this exam PDF even after the exam ends, as
some students may be taking the exam in a different time zone.
For questions with circular bubbles, you should select exactly one choice.
# You must choose either this option
# Or this one, but not both!
For questions with square checkboxes, you may select multiple choices.
2 You could select this choice.
2 You could select this one too!
You may start your exam now. Your exam is due at <DEADLINE> Pacific Time. Go to the next page
to begin.
Exam generated for <EMAILADDRESS> 2

Preliminaries
You can complete and submit these questions before the exam starts. Note ‘. . . ’ can mean any code after the
given variable.
(a) What is your full name?

(b) What is your student ID number?

(c) Who is your Lab GSI?

Exam generated for <EMAILADDRESS> 3

1. (18 points) Working with Tables

After the Data 8 midterm, Will, Eddie, and Melissa decide to get dinner at a restaurant in Berkeley, but they’re
having trouble deciding on a single place. They create a table of all Berkeley restaurants, RESTS_TBL, with four
columns:
• “REST_NAME”: The name of the restaurant
• “CUISTYP”: The cuisine (type of food) served at this restaurant
• “Rating”: The numerical rating given to the restaurant by the Daily Cal (a float)
• “Distance From Sproul”: The distance, in miles, the restaurant is from Sproul Hall (a float)

REST_NAME CUISTYP Rating Distance From Sproul

Imm Thai Thai 9.9 0.2
Berkeley Social Club Korean 8.7 0.8
Italian Homemade Italian 7.9 1.1

(. . . 76 more rows)
(a) (3 pt) Help Will count how many restaurants there are for each cuisine. Write a line of code that outputs
a table with two columns: one column with the type of cuisine, and one column containing a count of how
many restaurants there are with that cuisine.
Reminder: the columns of RESTS_TBL are “REST_NAME”, “CUISTYP”, “Rating”, and “Distance From
Sproul”.

(b) (3 pt) Will wants to eat at the highest-rated restaurant. Write a line of code that evaluates to the name
of the restaurant with the highest rating. (You can assume there is only one restaurant with the highest
rating; there are no ties.)
Reminder: the columns of RESTS_TBL are “REST_NAME”, “CUISTYP”, “Rating”, and “Distance From
Sproul”.

(c) (3 pt) Melissa only wants to eat at a Thai restaurant. Write a line of code that evaluates to a table
containing all four columns but only the rows for restaurants whose cuisine is “Thai”.
Reminder: the columns of RESTS_TBL are “REST_NAME”, “CUISTYP”, “Rating”, and “Distance From
Sproul”.
Exam generated for <EMAILADDRESS> 4

(d) (3 pt) Eddie didn’t want to walk to any restaurants that were further than one mile away from Sproul.
Fill in the code below to assign the variable EDDIE_CHOICE to a table containing only restaurants that are
less than one mile from Sproul.
EDDIE_CHOICE = ...
Reminder: the columns of RESTS_TBL are “REST_NAME”, “CUISTYP”, “Rating”, and “Distance From
Sproul”.

(e) (3 pt) Will decides to randomly pick a restaurant from the restaurants that are less than one mile from
Sproul. Write code to randomly pick a restaurant from the EDDIE_CHOICE table and assigns the variable
WILL_CHOICE to the name of that restaurant.
WILL_CHOICE = ...
Reminder: the columns of RESTS_TBL are “REST_NAME”, “CUISTYP”, “Rating”, and “Distance From
Sproul”.

(f ) (3 pt) Write a line of code that evaluates to the number of different cuisines that appear in the “CUISTYP”
column of the RESTS_TBL table.
Exam generated for <EMAILADDRESS> 5

2. (10 points) Arrays and Tables

Several Data 8 staff are reserving rooms for study groups. The rooms table has one row per room that can
potentially be reserved:

Room Capacity Region

110MC Kresge 10 Northside
B4 Gardner 5 Central
Warbler, 435 Moffitt 4 Central

(. . . 223 more rows)

All room names are different and every room appears only once in the rooms table.
The RESEVS table has one row per reservation they have made:

STNAME Room DAYCOL Time

Meghan Quail, 431 Moffitt Tuesday 10
Rita C6 Gardner Monday 3
Margaret 110MC Kresge Friday 12

(. . . 47 more rows)
(a) (3 pt) Write a line of code that evaluates to the total capacity if we reserved every room in the rooms
table.
Reminder: rooms’s columns are “Room”, “Capacity”, and “Region”. RESEVS’s columns are “STNAME”,
“Room”, “DAYCOL”, and “Time”.

(b) (3 pt) Write a line of code that evaluates to the number of reservations that TARGETPERSON has made.
Reminder: rooms’s columns are “Room”, “Capacity”, and “Region”. RESEVS’s columns are “STNAME”,
“Room”, “DAYCOL”, and “Time”.

(c) (4 pt) Write code that assigns the variable TOP_REGION to the region of campus that has the most number
of reservations. Note that the “Region” column of the rooms table shows the campus region for each room.
TOP_REGION = ...
Reminder: rooms’s columns are “Room”, “Capacity”, and “Region”. RESEVS’s columns are “STNAME”,
“Room”, “DAYCOL”, and “Time”.
Exam generated for <EMAILADDRESS> 6

3. (11 points) Chances

Each morning, Noor grabs a mug from her cabinet for coffee during the day. She has 9 mugs in total: 3 each of
the colors green, black, and white.
Each morning, Noor picks one mug at random from all 9 mugs regardless of the mugs she picks on other days.
In each question below, pick the correct answer.
(a) (3 pt) The weekend (Saturday and Sunday) is coming up. What is the chance that Noor picks a green
mug on both those days?
3
# 9
3 3
# +
9 9
3 3
# ×
9 9

(b) (4 pt) Noor brings her mug to each Data 8 lecture. Next week, Data 8 lectures will be on Monday,
Wednesday, and Friday. What is the chance that Noor brings a black mug to at least one of the three
lectures?

# 39 + 39 + 39
# 93 × 39 × 39
# 1 − 69 × 69 × 69

# 1 − 39 × 39 × 39

(c) (4 pt) One of Noor’s classes has online office hours in the morning. She will attend the office hours on
Tuesday and Thursday next week, bringing her mug with her. What is the chance that the mugs she has
on those two days are the same color?
3
# 9
3 3
# ×
9 9
3 6
# 1− ×
9 9
Exam generated for <EMAILADDRESS> 7

4. (9 points) Comparing Chances

In the United States, 28% of adults use LinkedIn. Suppose you sample US adults randomly so that each sampled
adult has chance 0.28 of being a LinkedIn user independently of all the others.
(a) (2 pt) For which sample size below is there a higher chance that the percent of LinkedIn users in the
sample will be at least 25%?
# 200
# 400

(b) (2 pt) For which sample size below is there a higher chance that the percent of LinkedIn users in the
sample will be at least 50%?
# 200
# 400

(c) (2 pt) For which sample size below is there a higher chance that the percent of LinkedIn users in the
sample will be at least 25% but less than 50%?
# 200
# 400

(d) (3 pt) Briefly explain your choices in Parts (a)-(c).

Exam generated for <EMAILADDRESS> 8

5. (10 points) A/B Test on Turtles

When hatching a baby turtle from an egg, we incubate the egg at some temperature. Ellen read that the
temperature an egg is incubated at influences whether or not the turtle that hatches will be male or female.
Ellen loves turtles and is wondering whether this is really right, or whether differences might just be due to
chance. She collects data on 100 randomly drawn turtles. She records the incubation temperature (in Celsius)
and the sex of the turtle that hatches in the table turtles:

Temperature Sex
30.8 M
31.5 F
32.4 F

(. . . 97 more rows)
(a) (6 pt) Ellen decides to visualize her data before doing any inference. She creates the following histograms,
using the same bins for female and male turtles. All bars of the histograms are clearly visible.

Histogram of incubation temperatures

Which of the following are conclusions that can be drawn from the histogram? Select all that apply.
2 In this sample, the number of male turtles with incubation temperatures between 29.5 and 30 degrees
is the same as the number of female turtles incubated between 30.5 and 31 degrees.
2 In this sample, the proportion of male turtles with incubation temperatures between 29.5 and 30
degrees is the same as the proportion of female turtles incubated between 30.5 and 31 degrees.
2 There was not a single male turtle in this sample incubated at a temperature above 31 degrees.
2 For at least half the male turtles in the sample, the incubation temperature was below 29.5 degrees.
2 In this sample, males and female turtles have different distributions of incubation temperatures.
2 None of the above
Exam generated for <EMAILADDRESS> 9

(b) (4 pt) Ellen performs an A/B test to see whether females in the population in general have higher
incubation temperatures than the males, or if the observed difference in distributions is due to chance.
Ellen’s test statistic is the difference between average incubation temperatures, defined as “female average
minus male average”. She simulates the statistic 1000 times under the null hypothesis. The histogram
below shows the 1000 simulated differences. The red dot shows the observed difference.

Results of simulating the test statistic

Which of the following statements is justified based on this visualization?
# Based on the test, a reasonable conclusion is that the difference observed in the sample is due to chance.
# Based on the test, a reasonable conclusion is that the average incubation temperature of females in the
population is higher than the average for males in the population.
# Based on the test, Ellen cannot reasonably decide between her two hypotheses.
Exam generated for <EMAILADDRESS> 10

6. (12 points) Testing Hypotheses

In the United States, 31% of adults report being online almost constantly. A team of data scientists took a
random sample of 100 adults in San Francisco and found that 37 reported being online almost constantly.
One member of the team says, “The percent of San Francisco adults who are online almost constantly is more
than in the nation.”
Another member of the team says, “No, it’s just chance.”
In order to decide between these two positions, the data scientists will conduct a test of hypotheses.
(a) (4 pt) State a clear and complete null hypothesis.

(b) (3 pt) In order to decide between their two hypotheses, the data scientists have picked an appropriate
test statistic and simulated it 10,000 times under appropriate conditions. One of the graphs below is the
histogram of their simulated values. Which one is it, and why? [Note that in each graph, some relevant
values are labeled on the horizontal axis.]
#
#
#
Exam generated for <EMAILADDRESS> 11

Testing Option A

Testing Option B

Testing Option C
Exam generated for <EMAILADDRESS> 12

(d) (3 pt) The 10,000 simulated values of the data scientists’ test statistic are in an array called SIM_STAT_ARR.
Write an expression that evaluates to the p-value of the test.
Exam generated for <EMAILADDRESS> 13

7. (8 points) A/B Testing on News

Each person in a random sample of 1000 U.S. adults was asked if they agreed with the statement, “News
organizations are growing in influence.” Among the sampled men, 39% agreed. Among the sampled women,
43% agreed.
Data scientists have used an A/B test to see whether or not the observed difference is due to chance.
(a) (3 pt) The null hypothesis is one of the statements below. Pick the right one.
# In the sample, the percent of women who agree is the same as the percent of men who agree. The
observed difference is due to chance.
# In the U.S., 39% of the men agree and 43% of the women agree, due to chance.
# In the U.S., the percent of men who agree is the same as the percent of women who agree. The
difference in the sample is due to chance.
# In the U.S., the percent of women who agree is different from the percent of men who agree, due to
chance.

(b) (5 pt) The data scientists are using a 1% cutoff for the p-value of the test. They run the test and the
p-value comes out to be 0.5%, that is, 1 in 200.
Select all of the true statements below. Only one may be true, or more. Make sure you select all that are
true.
2 The data scientists will conclude that the data are consistent with the null hypothesis.
2 There is only a 1 in 200 chance that the null hypothesis is true.
2 There is a 199 in 200 chance that the alternative hypothesis is true.
2 The data scientists will reject the null hypothesis.
2 The assumptions made in the null hypothesis are used in the calculation of the p-value.
2 None of the above statements is true.
Exam generated for <EMAILADDRESS> 14

8. (14 points) Simulation

The table WELCOME_TBL contains the results of this semester’s Data 8 welcome survey. The first two rows are
shown below. Each row corresponds to a student. In the column Extraversion, each student scored themselves
on a scale of 1 (not extraverted) to 10 (extremely extraverted).

Year Extraversion Number of Textees Hours of Sleep Handedness First Pant Leg Sleep Position
Second 8 5 6 Right- Right Left
handed
Second 7 8 7.5 Right- Right Left
handed

(. . . 1000 rows omitted)

(a) (4 pt) Complete the code below to define a function FUN_NAME that takes a sample size as its argument.
The function should sample that many times at random without replacement from all the students and
return the maximum extraversion score of the sampled students.
def FUN_NAME(...):
...
...
Exam generated for <EMAILADDRESS> 15

(b) (5 pt) Complete the code below so that the last line evaluates to an array of 10,000 simulated values of
the maximum extraversion score in a random sample of size 25 drawn without replacement from all the
students. Your code should use the function FUN_NAME that you defined above.
repetitions = ...
SIM_VALS = ...

for ... in ...:

...

SIM_VALS
Exam generated for <EMAILADDRESS> 16

(c) (3 pt) A student mistypes the sample size in the previous question to be 55 instead of 25. One of the
histograms below shows the distribution of the maximum values simulated by this student. The other
shows the distribution of the maximum values that you simulated using a sample size of 25. Which is
which?

B:
# A is sample of 25, B is sample of 55
# A is sample of 55, B is sample of 25
Exam generated for <EMAILADDRESS> 17

(d) (2 pt) Explain your answer above.

Exam generated for <EMAILADDRESS> 18

9. (8 points) Interpreting Visualizations

A medical institute that specializes in sports medicine has recorded data on athletes with leg injuries. The
variables are the distance that the athlete achieved in a test called the triple hop, and how high the athlete
could jump vertically. Both distances were measured in centimeters.
The data are in a table called jump that has columns labeled Triple Hop and Vertical.

Triple Hop Vertical

443 59
481 62

(. . . 86 more rows)
(a) (3 pt) The histogram below shows the distribution of the triple hop distances, drawn using the following
code.
jump.hist('Triple Hop', bins=np.arange(300, 900, 50))

Histogram of triple hop distances

Complete the sentence with the correct option.
The percent of athletes whose triple hop distances were at least 400 centimeters but less than 500 centimeters
is equal to
# 0.7%
# 7%
# 30%
# 35%
# 40%
# some value that is none of the above or cannot be computed based on the information given
Exam generated for <EMAILADDRESS> 19

Scatter plot of athlete data

Exam generated for <EMAILADDRESS> 20

(b) (5 pt) The scatter plot below has a point for each of the athletes. Pick all the conclusions that can be
drawn from the scatter plot. Make sure you pick all that apply.
2 More than half the athletes jumped less than 60 centimeters vertically.
2 Most of the athletes whose triple hop distances were longer than average also jumped higher than
average.
2 If athletes were to increase their triple hop distances then they would be able to jump higher.
2 If athletes were to increase the heights of their vertical jumps, they would be able to triple hop longer
distances.
2 None of the above conclusions can be drawn from the scatter plot.
Exam generated for <EMAILADDRESS> 21

10. (0 points) Final Words

(a) (0 pt) If there was any question on the exam that you thought was ambiguous and required clarification
to be answerable, please identify the question and state your assumptions. Be warned: We only plan to
consider this information if we agree that the question was erroneous or ambiguous and we consider your
assumption reasonable.
Exam generated for <EMAILADDRESS> 22

No more questions.

MS TEA 2 DP 1 AI HL Paper 1
No ratings yet
MS TEA 2 DP 1 AI HL Paper 1
20 pages
Final Review
100% (6)
Final Review
8 pages
Exam 1 Review, Solutions, and Formula Sheet, Chapters 1-4
100% (1)
Exam 1 Review, Solutions, and Formula Sheet, Chapters 1-4
9 pages
Ap Statistics Practice Exam From The 2018 Administration
No ratings yet
Ap Statistics Practice Exam From The 2018 Administration
36 pages
UPDATED Practice Final Exams Solutions
100% (1)
UPDATED Practice Final Exams Solutions
39 pages
Final Exam Review: Test Scores Frequency
100% (1)
Final Exam Review: Test Scores Frequency
10 pages
Math T STPM Sem 3 2022
No ratings yet
Math T STPM Sem 3 2022
2 pages
Upsell Model Case PDF
No ratings yet
Upsell Model Case PDF
48 pages
Math IA
No ratings yet
Math IA
11 pages
PDF SAT 3.0 Session 5 10-29-2020
No ratings yet
PDF SAT 3.0 Session 5 10-29-2020
61 pages
Chapter III
No ratings yet
Chapter III
12 pages
Question Bank
No ratings yet
Question Bank
12 pages
E-Note 20895 Content Document 20240607120458PM
No ratings yet
E-Note 20895 Content Document 20240607120458PM
202 pages
Understanding Data
No ratings yet
Understanding Data
3 pages
Lecture2 3
No ratings yet
Lecture2 3
30 pages
Final Exam Fall 2022 - Paper 3 (Quesitons and Answers)
No ratings yet
Final Exam Fall 2022 - Paper 3 (Quesitons and Answers)
20 pages
R17 P&S
No ratings yet
R17 P&S
3 pages
Question Bank On Biostatistics
No ratings yet
Question Bank On Biostatistics
2 pages
Statistics Exercises
No ratings yet
Statistics Exercises
4 pages
CS373 Homework 1: 1 Part I: Basic Probability and Statistics
No ratings yet
CS373 Homework 1: 1 Part I: Basic Probability and Statistics
5 pages
Revision Exercise Stats 101
No ratings yet
Revision Exercise Stats 101
5 pages
Final Exam January 2019 Ines Barkia PDF
No ratings yet
Final Exam January 2019 Ines Barkia PDF
10 pages
Internals2 FDS QP
No ratings yet
Internals2 FDS QP
4 pages
Portfolio Spring 25
No ratings yet
Portfolio Spring 25
5 pages
FYBSc (CA) - CA-106-P - SPPU-Slips - Removed
No ratings yet
FYBSc (CA) - CA-106-P - SPPU-Slips - Removed
17 pages
Saids Dec 22
No ratings yet
Saids Dec 22
3 pages
KTU BTech RB 2019scheme 2019Scheme-S4 2019 Syllabus
No ratings yet
KTU BTech RB 2019scheme 2019Scheme-S4 2019 Syllabus
59 pages
PDF Hypothesis Testing Random Motors Project DD
No ratings yet
PDF Hypothesis Testing Random Motors Project DD
6 pages
Mock Exam - Summer 2024 (Business Stat 1)
No ratings yet
Mock Exam - Summer 2024 (Business Stat 1)
10 pages
MDM4U1-31 Test #1 - Statistics of One Variable Mar. 24, 2025 Name - 1
No ratings yet
MDM4U1-31 Test #1 - Statistics of One Variable Mar. 24, 2025 Name - 1
5 pages
Tut1 Students
No ratings yet
Tut1 Students
4 pages
Module 3 Numericals
No ratings yet
Module 3 Numericals
3 pages
CAPE Applied Mathematics 2016 U1 P2
No ratings yet
CAPE Applied Mathematics 2016 U1 P2
28 pages
MDM4U1-31 - Test #1 - Statistics of One Variable
No ratings yet
MDM4U1-31 - Test #1 - Statistics of One Variable
5 pages
Midterm Samplemth002
No ratings yet
Midterm Samplemth002
10 pages
FC S1 CT 25 Nov 20
No ratings yet
FC S1 CT 25 Nov 20
3 pages
M.S. Degree Examination, May 2020: Total No. of Pages
No ratings yet
M.S. Degree Examination, May 2020: Total No. of Pages
3 pages
PSM Syllabus
No ratings yet
PSM Syllabus
13 pages
Wa0008.
No ratings yet
Wa0008.
157 pages
HW 9.3 Solutions
No ratings yet
HW 9.3 Solutions
6 pages
Sta220h j17
No ratings yet
Sta220h j17
19 pages
BBA2STATSPASTPAPERS
No ratings yet
BBA2STATSPASTPAPERS
14 pages
MATH 221 Final Exam Statistics For Decision
No ratings yet
MATH 221 Final Exam Statistics For Decision
8 pages
Cuny Data Science Challenge
No ratings yet
Cuny Data Science Challenge
8 pages
Aliant, A. & Anindita, R. (2018) - The Effect of Compensation and Work Life Balance On Work Satisfaction Mediated by Work Stress PDF
No ratings yet
Aliant, A. & Anindita, R. (2018) - The Effect of Compensation and Work Life Balance On Work Satisfaction Mediated by Work Stress PDF
9 pages
Test Questions For Grade 11
No ratings yet
Test Questions For Grade 11
10 pages
Applied Stats Exam Prep
No ratings yet
Applied Stats Exam Prep
35 pages
Math B22 Practice Exam 1
No ratings yet
Math B22 Practice Exam 1
2 pages
Medical Statistics Made Easy For The Medical Pract PDF
No ratings yet
Medical Statistics Made Easy For The Medical Pract PDF
6 pages
Ds Imp Qs
No ratings yet
Ds Imp Qs
4 pages
Data8 Fa24 Final
No ratings yet
Data8 Fa24 Final
19 pages
R-Practical questions-Sem-IV
No ratings yet
R-Practical questions-Sem-IV
4 pages
Stats P2 (End of Feb)
No ratings yet
Stats P2 (End of Feb)
6 pages
Data8 Fa24 Final Solutions
No ratings yet
Data8 Fa24 Final Solutions
20 pages
Motor Project Great Calculation
No ratings yet
Motor Project Great Calculation
10 pages
DS1000 Assignment 1
No ratings yet
DS1000 Assignment 1
6 pages
Test 1 Review A
No ratings yet
Test 1 Review A
7 pages
E-Note 24354 Content Document 20240917024357PM
No ratings yet
E-Note 24354 Content Document 20240917024357PM
4 pages
Problem 1 - (Download Data) : Importing Nessceary Libraries
No ratings yet
Problem 1 - (Download Data) : Importing Nessceary Libraries
16 pages
Mdm4U Final Exam Review: This Review Is A Supplement Only. It Is To Be Used As A Guide Along With Other Review
No ratings yet
Mdm4U Final Exam Review: This Review Is A Supplement Only. It Is To Be Used As A Guide Along With Other Review
6 pages
A Study Examining The Students Satisfaction in Higher Education
No ratings yet
A Study Examining The Students Satisfaction in Higher Education
5 pages
Workshop 18th - 20th May 23
No ratings yet
Workshop 18th - 20th May 23
6 pages
22 23 24 25 Math CS Question
No ratings yet
22 23 24 25 Math CS Question
22 pages
9835 - ESE - DEC21 - SOB - Sem 1 - MBA (CORE) - DSQT7001 - Quantitative Methods
No ratings yet
9835 - ESE - DEC21 - SOB - Sem 1 - MBA (CORE) - DSQT7001 - Quantitative Methods
3 pages
Introduction To Statistics Reviewer
No ratings yet
Introduction To Statistics Reviewer
4 pages
Semester - 1 - All Papers
No ratings yet
Semester - 1 - All Papers
11 pages
Fe237Module 1 - Practice Numerical
No ratings yet
Fe237Module 1 - Practice Numerical
5 pages
Practice Questions
No ratings yet
Practice Questions
5 pages
Ahu Persembe - The Effects of Foreign Direct Investment in Turkey On Export Performance
100% (1)
Ahu Persembe - The Effects of Foreign Direct Investment in Turkey On Export Performance
20 pages
Homework 2 IENG584
No ratings yet
Homework 2 IENG584
3 pages
One Sample Hypothesis Testing
No ratings yet
One Sample Hypothesis Testing
9 pages
Chapter 9 Quiz
No ratings yet
Chapter 9 Quiz
26 pages
Assignment 3 Ladder Art
No ratings yet
Assignment 3 Ladder Art
31 pages
1045pm - 27.epra Journals 16817
No ratings yet
1045pm - 27.epra Journals 16817
8 pages
The Effect of Dividend Policy On Stock Price Volatility and Investment Decisions
No ratings yet
The Effect of Dividend Policy On Stock Price Volatility and Investment Decisions
9 pages
Hypothesis Testing: 10.1 Testing The Mean of A Normal Population
No ratings yet
Hypothesis Testing: 10.1 Testing The Mean of A Normal Population
13 pages
All About Statistical Significance and Testing
No ratings yet
All About Statistical Significance and Testing
15 pages
Effectiveness of Problem Based Learning in Mathematics: R.D.Padmavathy
No ratings yet
Effectiveness of Problem Based Learning in Mathematics: R.D.Padmavathy
7 pages
R Code For Canonical Correlation Analysis
No ratings yet
R Code For Canonical Correlation Analysis
10 pages
Neural Networks - Vs - Chaid Tree Ctp4
No ratings yet
Neural Networks - Vs - Chaid Tree Ctp4
14 pages
G 11 Fep&s
No ratings yet
G 11 Fep&s
50 pages
Nutrition: Ethical Issues and Challenges: Sciencedirect
No ratings yet
Nutrition: Ethical Issues and Challenges: Sciencedirect
10 pages
Coyne 2017 Pow Boom Kablam Effects of Viewing
No ratings yet
Coyne 2017 Pow Boom Kablam Effects of Viewing
13 pages
The Impact of Electronic Gadget Uses With Academic Performance Among Secondary School Students
No ratings yet
The Impact of Electronic Gadget Uses With Academic Performance Among Secondary School Students
6 pages
Jawaban Latihan Soal Continuous Probability Distribution
No ratings yet
Jawaban Latihan Soal Continuous Probability Distribution
3 pages
Entrepreneurial Intention Among Millennial Generation: Personal Attitude, Educational Support, and Social Media
No ratings yet
Entrepreneurial Intention Among Millennial Generation: Personal Attitude, Educational Support, and Social Media
6 pages
Inspira Journal of Commerceeconomics Computer Sciencejcecs Vol 05 No 01 January March 2019 Pages 117 To 120
No ratings yet
Inspira Journal of Commerceeconomics Computer Sciencejcecs Vol 05 No 01 January March 2019 Pages 117 To 120
4 pages
Ed 638496
No ratings yet
Ed 638496
16 pages
Final Exam Paper
No ratings yet
Final Exam Paper
3 pages
Apache Cassandra Developer Associate - Exam Practice Tests
From Everand
Apache Cassandra Developer Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet

Data8 Fa21 Midterm

Uploaded by

Data8 Fa21 Midterm

Uploaded by

DATA 8 Sample Exam.

Fall 2021 Final Exam

(b) What is your student ID number?

(c) Who is your Lab GSI?

1. (18 points) Working with Tables

REST_NAME CUISTYP Rating Distance From Sproul

2. (10 points) Arrays and Tables

Room Capacity Region

(. . . 223 more rows)

STNAME Room DAYCOL Time

3. (11 points) Chances

4. (9 points) Comparing Chances

(d) (3 pt) Briefly explain your choices in Parts (a)-(c).

5. (10 points) A/B Test on Turtles

Histogram of incubation temperatures

Results of simulating the test statistic

6. (12 points) Testing Hypotheses

7. (8 points) A/B Testing on News

8. (14 points) Simulation

(. . . 1000 rows omitted)

for ... in ...:

(d) (2 pt) Explain your answer above.

9. (8 points) Interpreting Visualizations

Triple Hop Vertical

Histogram of triple hop distances

Scatter plot of athlete data

10. (0 points) Final Words

You might also like