Course Challenge - Coursera
Course Challenge - Coursera
Quiz • 50
You’ve just started min job as a data analyst for a midsized pharmacy chain with 38 stores in the American
a new
*Weekly challenge 5*
Southwest. Your supervisor shares a new data analysis project with you.
Review Learning Objectives
*Course challenge*
She explains that the pharmacy is considering discontinuing a bubble bath product called Splashtastic. Your
Reading: Test-taking strategies
supervisor wants you to analyze sales data and determine what percentage of each store’s total daily sales come
10 min
from that product. Then, you’ll present your findings to leadership.
Quiz: *Course challenge*
You know thatSubmit your assignment
it's important to follow each step of the data analysis process: ask, prepare, process, analyze,
10 questions Start assignment
share, and act.
Due So,Sep
you18,
begin
12:59by
AMdefining the problem
EEST Attempts and24making
3 every hours sure you fully understand stakeholder
Video: Congrats! Course wrap-up
expectations. You will have 50 minutes to finish
3 min
Reading: Coming up next... One of the questions you ask is where to find the dataset you’ll be working with. Your supervisor explains that the
10 min company database has
Receive all the information you need.
grade Your grade
To Passto 80
Next, you continue %prepare
the or higher step. You access the database and write a query to retrieve data about
-
Splashtastic. You notice that there are only 38 rows of data, representing the company’s 38 stores. In addition,
your dataset contains five columns: Store Number, Average Daily Customers, Average Daily Splashtastic Sales
(Units), Average Daily Splashtastic Sales (Dollars), and Average Total Daily Sales (All Products). You decide to use a
spreadsheet to work
Like with theDislike
data because Report
you know that spreadsheets work well for processing and analyzing a
an issue
small dataset, like the one you’re using.
Fill in the blank: To get the data from the database into a spreadsheet, you would first _____ the data as a
.CSV file, then import it into a spreadsheet.
download
You’ve downloaded the data from your company database and imported it into a spreadsheet. IMPORTANT: To
answer questions using this dataset for the scenario, click the link below and select the “Use Template” button
before answering the questions.
OR
If you don’t have a Google account, you can download the template directly from the attachment below.
Now, it’s time to process the data. As you know, this step involves finding and eliminating errors and inaccuracies
that can get in the way of your results. While cleaning the data, you notice there’s missing data in one of the
rows. What might you do to fix this problem? Select all that apply.
Ask a colleague on your team how they've handled similar issues in the past
Sort the spreadsheet so the row with missing data is at the bottom
Once you’ve found the missing information, you analyze your dataset. During analysis, you create a new column F.
You label the top of the column Average Percentage of Total Sales - Splashtastic.
Fill in the blank: The column label you add to column F is known as ______.
an observation
a title
an attribute
a reference
Next, you determine the average total daily sales over the past 12 months at all stores. The entire range of cells
that contain these sales are E2:E39. To do this, you type a formula. You input =AVERAGE(E2-E39), but this
returns an error. What is the correct command?
=AVERAGE(E2,E39)
=AVERAGE(E2+E39)
=AVERAGE(E2:E39)
=AVERAGE(E2;E39)
Next, you create a slideshow, which includes a data visualization to highlight the Splashtastic sales insights
you've discovered. You’ve reached which phase of the data analysis process?
Act
Analyze
Share
Manage
You’ve been working for the nonprofit National Dental Society (NDS) as a junior data analyst for about two
months. The mission of the NDS is to help its members advance the oral health of their patients. NDS members
include dentists, hygienists, and dental office support staff.
The NDS is passionate about patient health. Part of this involves automatically scheduling follow-up
appointments after crown replacement, emergency dental surgery, and extraction procedures. NDS believes the
follow-up is an important step to ensure patient recovery and minimize infection.
Unfortunately, many patients don’t show up for these appointments, so the NDS wants to create a campaign to
help its members learn how to encourage their patients to take follow-up appointments seriously. If successful,
this will help the NDS achieve its mission of advancing the oral health of all patients.
Your supervisor has just sent you an email saying that you’re doing very well on the team, and he wants to give
you some additional responsibility. He describes the issue of many missed follow-up appointments. You are
tasked with analyzing data about this problem and presenting your findings using data visualizations.
An NDS member with three dental offices in Colorado offers to share its data on missed appointments. So, your
supervisor uses a database query to access the dataset from the dental group. The query instructs the database to
retrieve all patient information from the member’s three dental offices, located in zip code 81137.
The table is dental_data_table, and the column name is zip_code. You write the following query, but get an
error. What statement will correct the problem?
SELECT *
FROM dental_data_table
WHERE zip code = 81137
zip_code = 81137
WHERE 81137
The dataset your supervisor retrieved and imported into a spreadsheet includes a list of patients, their
demographic information, dental procedure types, and whether they attended their follow-up appointment. To
use the dataset for this scenario, click the link below and select “Use Template.”
OR
If you don’t have a Google account, you can download the template directly from the attachment below.
The patient demographic information includes data such as age and gender. As you’re learning, it’s your
responsibility as a data analyst to make sure your analysis is fair. Which aspect of patient demographics might
get in the way of fairness?
The dataset indicates which dental procedure the patients had performed.
The dataset includes people who all live in the same zip code.
As you’re reviewing the dataset, you notice that there are a disproportionate number of senior citizens. So, you
investigate further and find out that this zip code represents a rural community in Colorado with about 800
residents. In addition, there’s a large assisted-living facility in the area. Nearly 300 of the residents in the 81137 zip
code live in the facility.
You recognize that’s a sizable number, so you want to find out if age has an effect on a patient’s likelihood to
attend a follow-up dental appointment. You analyze the data, and your analysis reveals that older people tend to
miss follow-ups more than younger people.
So, you do some research online and discover that people over the age 60 are 50% more likely to miss dentist
appointments. Sometimes this is because they’re on a fixed income. Also, many senior citizens lack
https://www.coursera.org/learn/foundations-data/exam/NGYrw/course-challenge/attempt 1/1