Course 2 Google
Course 2 Google
MODULE 1
Course 2 content
Each course is broken into modules. Here’s a quick overview of the skills you’ll gain in each of
the four Course 2 modules.
Data analysts are constantly asking questions in order to find solutions and identify business
potential. In this part of the course, you’ll learn about effective questioning techniques that will
help guide your analysis.
In analytics, data drives decision-making, and this is your opportunity to explore data of all kinds
and its impact on all sorts of business decisions. You’ll also learn how to effectively share your
data through reports and dashboards.
Spreadsheets are a key data analytics tool. Here you’ll learn both why and how data analysts use
spreadsheets in their work. You’ll also investigate how structured thinking helps analysts
understand problems and come up with solutions.
Successful data analysts balance the needs and expectations of their team and the stakeholders they
support. In this part of the course, you’ll learn strategies for managing stakeholder expectations
while establishing clear communication with your team.
Terms and definitions for Course 2, Module 1
Key takeaways
If you’re already familiar with the data analytics concepts and skills presented in the Google Data
Analytics Certificate, you may wish to proceed directly to our more advanced programs:
Google Advanced Data Analytics Certificate
If you’re not sure which program to take, feel free to explore each of them. You can return to the
Google Data Analytics Certificate program at any time if you decide it’s the best starting point for
you!
From issue to action: the six data analysis phases
There are six data analysis phases that will help you make seamless decisions: ask, prepare,
process, analyze, share, and act. Keep in mind, these are different from the data life cycle, which
describes the changes data goes through over its lifetime. Going through the steps will help you
solve all kinds of business problems that you might face on the job.
Step 1: Ask
It’s impossible to solve a problem if you don’t know what it is. These are some things to consider:
Define the problem you’re trying to solve
Make sure you fully understand the stakeholder’s expectations
Focus on the actual problem and avoid any distractions
Collaborate with stakeholders and keep an open line of communication
Take a step back and see the whole situation in context
Step 2: Prepare
You will decide what data you need to collect in order to answer your questions and how to
organize it so that it is useful. You might use your business task to decide:
What metrics to measure
Locate data in your database
Create security measures to protect that data
Step 3: Process
Clean data is the best data and you will need to clean up your data to get rid of any possible errors,
inaccuracies, or inconsistencies. This might mean:
Using spreadsheet functions to find incorrectly entered data
Using SQL functions to check for extra spaces
Removing repeated entries
Checking as much as possible for bias in the data
Questions to ask yourself in this step:
What data errors or inaccuracies might get in my way of getting the best possible answer
to the problem I am trying to solve?
How can I clean my data so the information I have is more consistent?
Step 4: Analyze
You will want to think analytically about your data. At this stage, you might sort and format your
data to make it easier to:
Perform calculations
Combine data from multiple sources
Create tables with your results
Step 5: Share
Everyone shares their results differently so be sure to summarize your results with clear and
enticing visuals of your analysis using data via tools like graphs or dashboards. This is your chance
to show the stakeholders you have solved their problem and how you got there. Sharing will
certainly help your team:
Make better decisions
Make more informed decisions
Lead to stronger outcomes
Successfully communicate your findings
Step 6: Act
Now it’s time to act on your data. You will take everything you have learned from your data
analysis and put it to use. This could mean providing your stakeholders with recommendations
based on your findings so they can make data-driven decisions.
When you are starting out in your career as a data analyst, it is normal to feel pulled in a few
different directions with your role and expectations. Following processes like the ones outlined
here and using structured thinking skills can help get you back on track, fill in any gaps and let
you know exactly what you need.
Making predictions
A company that wants to know the best advertising method to bring in new customers is an
example of a problem requiring analysts to make predictions. Analysts with data on location,
type of media, and number of new customers acquired as a result of past ads can't guarantee
future results, but they can help predict the best placement of advertising to reach the target
audience.
Categorizing things
An example of a problem requiring analysts to categorize things is a company's goal to improve
customer satisfaction. Analysts might classify customer service calls based on certain keywords
or scores. This could help identify top-performing customer service representatives or help
correlate certain actions taken with higher customer satisfaction scores.
Spotting something unusual
A company that sells smart watches that help people monitor their health would be interested in
designing their software to spot something unusual. Analysts who have analyzed aggregated
health data can help product developers determine the right algorithms to spot and set off alarms
when certain data doesn't trend normally.
Identifying themes
User experience (UX) designers might rely on analysts to analyze user interaction data. Similar
to problems that require analysts to categorize things, usability improvement projects might
require analysts to identify themes to help prioritize the right product features for improvement.
Themes are most often used to help researchers explore certain aspects of data. In a user study,
user beliefs, practices, and needs are examples of themes.
By now you might be wondering if there is a difference between categorizing things and
identifying themes. The best way to think about it is: categorizing things involves assigning
items to categories; identifying themes takes those categories a step further by grouping them
into broader themes.
Discovering connections
A third-party logistics company working with another company to get shipments delivered to
customers on time is a problem requiring analysts to discover connections. By analyzing the wait
times at shipping hubs, analysts can determine the appropriate schedule changes to increase the
number of on-time deliveries.
Finding patterns
Minimizing downtime caused by machine failure is an example of a problem requiring analysts
to find patterns in data. For example, by analyzing maintenance data, they might discover that
most failures happen if regular maintenance is delayed by more than a 15-day window.
Key takeaway
As you move through this program, you will develop a sharper eye for problems and you will
practice thinking through the problem types when you begin your analysis. This method of
problem solving will help you figure out solutions that meet the needs of all stakeholders.
More about smart questions
Companies in lots of industries today are dealing with rapid change and rising uncertainty. Even
well established businesses are under pressure to keep up with what is new and figure out what is
next. To do that, they need to ask questions. Asking the right questions can help spark the
innovative ideas that so many businesses are hungry for these days.
The same goes for data analytics. No matter how much information you have or how advanced
your tools are, your data won’t tell you much if you don’t start with the right questions. Think of
it like a detective with tons of evidence who doesn’t ask a key suspect about it. Coming up, you
will learn more about how to ask highly effective questions, along with certain practices you want
to avoid.
Specific:
Is the question specific? Does it address the problem? Does it have context? Will it uncover a lot
of the information you need?
Measurable: Will the question give you answers that you can measure?
Action-oriented: Will the answers provide information that helps you devise some type of
plan?
Relevant: Is the question about the particular problem you are trying to solve?
Time-bound: Are the answers relevant to the specific time being studied?
Closed-ended questions: questions that ask for a one-word or brief response only
END MODULE 1
MODULE 2
Introduction
A data analytics professional’s job is to provide the data necessary to inform key decisions. They
also need to frame their analysis in a way that helps business leaders make the best possible
decisions. In this reading, you’re going to explore the role of data in decision-making and the
reasons why data analytics professionals are so important to this process. You’ll compare data-
driven and data-inspired decisions to understand the difference between them. You’ll also check
out some examples where projects failed or succeeded based on how the data was applied.
Both data-driven and data-inspired approaches are rooted in the idea that data is inherently valuable
for making a decision. Well-curated data can provide information to decision-makers that
improves the quality of their decisions. Remember: Data does not make decisions, but it does
improve them.
Data-driven decisions
As you’ve been learning, data-driven decision-making means using facts to guide business
strategy. The phrase “data-driven decisions” means exactly that: Data is used to arrive at a
decision. This approach is limited by the quantity and quality of readily-available data. If the
quality and quantity of the data is sufficient, this approach can far improve decision-making. But
if the data is insufficient or biased, this can create problems for decision-makers. Potential dangers
of relying entirely on data-driven decision-making can include overreliance on historical data, a
tendency to ignore qualitative insights, and potential biases in data collection and analysis
Data-inspired decisions
Data-inspired decisions include the same considerations as data-driven decisions while adding
another layer of complexity. They create space for people using data to consider a broader range
of ideas: drawing on comparisons to related concepts, giving weight to feelings and experiences,
and considering other qualities that may be more difficult to measure. Data-inspired decision-
making can avoid some of the pitfalls that data-driven decisions might be prone to.
Example of a data-inspired decision
A customer support center gathers customer satisfaction data (often known as a “CSAT” score).
They use a simple 1–10 score along with a qualitative description in which the customer describes
their experience. The customer support center manager wants to improve customer experience, so
they set a goal to improve the CSAT score. They start by analyzing the CSAT scores and reading
each of the descriptions from the customers. Additionally, they interview the people working in
the customer support center. From there, the manager formulates a strategy and decides what needs
to improve the most in order to raise customer satisfaction scores. While the manager certainly
relies on the CSAT data in the decision-making process, input of support center representatives
and other qualitative information informs the approach as well.
When data is used strategically, businesses can transform and grow their revenue. Consider the
example below.
PepsiCo
Since the days of the New Coke launch, things have changed dramatically for beverage and other
consumer packaged goods (CPG) companies.
According to a Think with Google article by Shyam Venugopal, PepsiCo “hired analytical talent
and established cross-functional workflows around an infrastructure designed to put consumers’
needs first. Then [the company] set up the right processes to make critical decisions based on data
and technology use cases. Finally, [it] invested in the right technology stack and platforms so that
data could flow into a central cloud-based hub. This is critical. When data comes together, we
develop a holistic understanding of the consumer and their journeys."
In this data-inspired decision, PepsiCo is not just using its own set of data, but also employing
external sources to supplement its datasets and expand its market reach. Learn about how PepsiCo
is delivering a more personal and valuable experience to customers using data in How one of the
world’s biggest marketers ripped up its playbook and learned to anticipate intent.
Key takeaways
As a data analyst, you’ll rarely need to consider, “Am I being data-driven or data-inspired?” It’s
helpful to have some context for these two approaches, though your own skills and knowledge will
be the most important parts of any analysis project. So, keep a data-driven mindset and ask lots of
questions. Experiment with many different possibilities. And use both logic and creativity along
the way. Using this approach, you’ll be prepared to interpret your data with the highest levels of
care and accuracy.
Qualitative and quantitative data in business
Key takeaways
Data analysts will generally use both types of data in their work. Usually, qualitative data can help
analysts better understand their quantitative data by providing a reason or more thorough
explanation. In other words, quantitative data generally gives you the what, and qualitative data
generally gives you the why. By using both quantitative and qualitative data, you can learn when
people like to go to the movies and why they chose the theater. Maybe they really like the reclining
chairs, so your manager can purchase more recliners. Maybe the theater is the only one that serves
root beer. Maybe a later show time gives them more time to drive to the theater from where popular
restaurants are located. Maybe they go to matinees because they have kids and want to save money.
You wouldn’t have discovered this information by analyzing only the quantitative data for
attendance, profit, and showtimes.
In this course, you’ll work with Tableau and spreadsheets. Both of these tools have advantages
and disadvantages Often, data analysts will discover they need to use multiple tools, even on a
single project. What you use will largely be determined by the work you’re doing and your goals.
This reading explores two of the tools you might use to visualize and present data: spreadsheets
and Tableau.
Spreadsheets
Google Workspace and Microsoft Office Suite both offer spreadsheet applications. You’ve worked
with Google Sheets in this course, and it’s very similar in function to Microsoft Excel. If you want
to compare some of the features of Sheets to Excel, check out the Microsoft video Create a chart
from start to finish.
Both Sheets and Excel are go-to choices for creating static charts and graphs. They offer basic
data visualization capabilities that are often enough for simple visualizations. In addition, you can
use them to clean, sort, and filter data. And both offer a range of chart types, graphing tools, and
pivot tables for creating effective data visualizations. These charts are easy to manage; they update
when the source data is updated, so they don’t require much manual intervention once
implemented.
Sheets and Excel are connected to other apps in their product suites. Google Docs and Slides are
very similar to Microsoft Word and PowerPoint, for example. You can incorporate data
visualizations from Sheets or Excel into reports and documents in Docs and Word. Presentation
programs such as Slides and PowerPoint allow you to create engaging presentations that include
data visualizations so you can share insights in a presentation format. Learn more about the power
of this interconnectivity among Google tools in the article Link a chart, table, or slides to Google
Docs or Slides.
Tableau
Tableau is used to create powerful and interactive visualizations, making it an excellent choice
for data visualizations such as live dashboards. Tableau also makes it easy to create charts, graphs,
and dashboards in a drag-and-drop interface. The application supports a wide range of data sources
and provides advanced analytics capabilities. These features allow for in-depth exploration of data
trends and patterns.
Tableau is particularly useful for creating visualizations using huge datasets, like in this World
Happiness Report by Sustainable Development Solutions which uses global reporting data on
different countries' happiness ratings. Likewise, this visualization of Population and Housing State
Data from 2020 United States Census Data compares population rates in the United States and
available housing.
Tableau is widely known and used for its versatility and power, but it can take quite a bit of time
to learn to use Tableau effectively. Soon, you’ll begin practicing with Tableau. But if you’d like
to check it out now, there is a free environment you can access at Tableau Public.
Key takeaways
There are many visualization tools you will have the opportunity to use as a data professional.
Different tools have different advantages and disadvantages. Although Tableau ultimately has
more power than a basic spreadsheet application, it’s most often used for specific cases and to
work with large datasets. Don’t underestimate how much you can do with spreadsheets or how
powerful interconnectivity between apps can be!
Most of the time, especially for something like a quick report; you’re more likely to reach into
your toolkit for your spreadsheet app of choice. But your data career will definitely benefit from
Tableau, so as you progress take advantage of opportunities to work with it. With so many different
data analysis situations, familiarity with all of these tools will help you know which is the best for
each situation.
Design compelling dashboards
Dashboards are powerful visual tools that help you tell your data story. A dashboard is a tool that
monitors live, incoming data. It organizes information from multiple datasets into one central
location, offering huge time savings. Data analysts use dashboards to track, analyze, and
visualize data in order to answer questions and solve problems. For a basic idea of what
dashboards look like, refer to this article: “Real-world examples of business intelligence
dashboards.”
The following table summarizes the benefits of using a dashboard for both data analysts and
their stakeholders.
It’s important to remember that changed data is pulled into dashboards automatically only if the
data structure is the same. If the data structure changes, you have to update the dashboard design
before the data can update live.
Tableau
There are many different visualization tools available. One of the most powerful is Tableau, which
supports a range of data sources and has advanced analytics capabilities that allow for in-depth
exploration of data trends and patterns. Tableau can handle more data and larger datasets than
many other tools and offers real-time data availability.
It does take some time to learn to use Tableau, but your efforts can be well-rewarded, as Tableau
visualizations are pleasantly interactive. For a dashboard to be successful, it needs to engage users
and help them learn. Tableau has put in a lot of effort to ensure that its users have a great experience
and the platform is accessible to everyone.
Create a dashboard
Here’s a process you can follow to create a dashboard, whether in Tableau or another visualization
tool:
1. Identify the stakeholders who need to see the data and how they will use it
Begin by asking effective questions. Check out this dashboard requirements gathering worksheet
to explore a wide range of good questions you can use to identify relevant stakeholders and their
data needs. This is a great resource to help guide you through this process again and again.
Use these tips to help make your dashboard design clear and easy to follow:
A mockup is a simple draft of a visualization used for planning a dashboard and evaluating its
progress. This is optional, but a lot of data analysts like to sketch out their dashboards before
creating them.
You have a lot of options here. Which visualizations you select depends on the data story you are
telling. If you need to show a change in values over time, line charts or bar graphs might be the
best choice. If your goal is to show how each part contributes to the whole amount being reported,
a pie or donut chart is probably a better choice.
To learn more about choosing the right visualizations, check out Tableau’s galleries:
For more samples of area charts, column charts, and other visualizations, visit the Tableau
Dashboard Showcase. This gallery is full of great examples that were created using real data;
explore this resource on your own to get some inspiration.
Explore Tableau’s Viz of the Day to check out visualizations curated by the community. These are
visualizations created by Tableau users and are a great way to learn more about how other data
analysts are using data visualization tools.
Filters show certain data while hiding the rest of the data in a dashboard. This can be a big help to
identify patterns while keeping the original data intact. It’s common for data analysts to use and
share the same dashboard, but manage their part of it with a filter. To dig deeper into filters and
find an example of filters in action, visit Tableau’s page on Filter Actions. This is a useful resource
to save and come back to when you start practicing using filters in Tableau on your own.
Key takeaways
Just like how the dashboard on an airplane shows the pilot their flight path, your dashboard does
the same for your stakeholders. It helps them navigate the path of a project inside the data. If you
add clear markers and highlight important points on your dashboard, users will understand where
your data story is headed. Then, you can work together to make sure the business gets where it
needs to go.
Big and small data
As a data analyst, you will work with data both big and small. Both kinds of data are valuable,
but they play very different roles.
Whether you work with big or small data, you can use it to help stakeholders improve business
processes, answer questions, create new products, and much more. But there are certain
challenges and benefits that come with big data and the following table explores the differences
between big and small data.
Simple to collect, store, manage, sort, and Takes a lot of effort to collect, store, manage,
visually represent sort, and visually represent
Usually already a manageable size for Usually needs to be broken into smaller
analysis pieces in order to be organized and analyzed
effectively for decision-making
The amount of data The different kinds of How fast the data can The quality and
data be processed reliability of the data
END MODULE 2
MODULE 3
AVERAGE: A spreadsheet function that returns an average of the values from a selected
range
Borders: Lines that can be added around two or more cells on a spreadsheet
Cell reference: A cell or a range of cells in a worksheet typically used in formulas and
functions
COUNT: A spreadsheet function that counts the number of cells in a range that meet a
specific criteria
Equation: A calculation that involves addition, subtraction, multiplication, or division (also
called a math expression)
Fill handle: A box in the lower-right-hand corner of a selected spreadsheet cell that can be
dragged through neighboring cells in order to continue an instruction
Filtering: The process of showing only the data that meets a specified criteria while hiding
the rest
Header: The first row in a spreadsheet that labels the type of data in each column
Math expression: A calculation that involves addition, subtraction, multiplication, or
division (also called an equation)
Math function: A function that is used as part of a mathematical formula
MAX: A spreadsheet function that returns the largest numeric value from a range of cells
MIN: A spreadsheet function that returns the smallest numeric value from a range of cells
Open data: Data that is available to the public
Operator: A symbol that names the operation or calculation to be performed
Order of operations: Using parentheses to group together spreadsheet values in order to
clarify the order in which operations should be performed
Problem domain: The area of analysis that encompasses every activity affecting or affected
by a problem
Range: A collection of two or more cells in a spreadsheet
Report: A static collection of data periodically given to stakeholders
Return on investment (ROI): A formula that uses the metrics of investment and profit to
evaluate the success of an investment
Revenue: The total amount of income generated by the sale of goods or services
Scope of work (SOW): An agreed-upon outline of the tasks to be performed during a
project
Sorting: The process of arranging data into a meaningful order to make it easier to
understand, analyze, and visualize
SUM: A spreadsheet function that adds the values of a selected range of cells
Spreadsheets and the data life cycle
To better understand the benefits of using spreadsheets in data analytics, let’s explore how they
relate to each phase of the data life cycle: plan, capture, manage, analyze, archive, and destroy.
Plan for the users who will work within a spreadsheet by developing organizational standards.
This can mean formatting your cells, the headings you choose to highlight, the color scheme, and
the way you order your data points. When you take the time to set these standards, you will improve
communication, ensure consistency, and help people be more efficient with their time.
Capture data by the source by connecting spreadsheets to other data sources, such as an online
survey application or a database. This data will automatically be updated in the spreadsheet. That
way, the information is always as current and accurate as possible.
Manage different kinds of data with a spreadsheet. This can involve storing, organizing, filtering,
and updating information. Spreadsheets also let you decide who can access the data, how the
information is shared, and how to keep your data safe and secure.
Analyze data in a spreadsheet to help make better decisions. Some of the most common
spreadsheet analysis tools include formulas to aggregate data or create reports, and pivot tables for
clear, easy-to-understand visuals.
Archive any spreadsheet that you don’t use often, but might need to reference later with built-in
tools. This is especially useful if you want to store historical data before it gets updated.
Destroy your spreadsheet when you are certain that you will never need it again, if you have
better backup copies, or for legal or security reasons. Keep in mind, lots of businesses are required
to follow certain rules or have measures in place to make sure data is destroyed properly.
Now that you have learned some basic ways to avoid errors, you can focus on what to do when
that dreaded pop-up does appear. The following table is a reference you can use to look up
common spreadsheet errors and examples of each. Knowing what the errors mean takes some of
the fear out of getting them.
Context in data analytics is the condition and circumstances that surround and give meaning to the
data. Context is important in data analytics because it helps make disorganized data accessible and
understood. The fact is, data has little value if it is not paired with context.
Understanding the context behind the data can help us make it more meaningful at every stage of
the data analysis process. For example, you might be able to make a few guesses about what you're
looking at in the following table, but you couldn't be certain without more context.
2010 28000
2005 18000
2000 23000
1995 10000
On the other hand, if the first column was labeled to represent the years when a survey was
conducted, and the second column showed the number of people who responded to that survey,
then the table would start to make a lot more sense. Take this a step further, and you might notice
that the survey is conducted every 5 years. This added context helps you understand why there are
five-year gaps in the table.
2010 28000
2005 18000
2000 23000
1995 10000
Context can turn raw data into meaningful information. It is very important for data analysts
to contextualize their data. This means giving the data perspective by defining it. To do this, you
need to identify:
Who: The person or organization that created, collected, and/or funded the data collection
What: The things in the world that data could have an impact on
Understanding and including the context is important during each step of your analysis process,
so it is a good idea to get comfortable with it early in your career. For example, when you collect
data, you’ll also want to ask questions about the context to make sure that you understand the
business and business process. During organization, the context is important for your naming
conventions, how you choose to show relationships between variables, and what you choose to
keep or leave out. And finally, when you present, it is important to include contextual information
so that your stakeholders understand your analysis.
END MODULE 3
MODULE 4
Cloud: A place to keep data online, rather than a computer hard drive
Reframing: Restating a problem or challenge, then redirecting it toward a potential
resolution
Turnover rate: The rate at which employees voluntarily leave a company
Your data analysis project should answer the business task and create opportunities for data-driven
decision-making. That's why it is so important to focus on project stakeholders. As a data analyst,
it is your responsibility to understand and manage your stakeholders’ expectations while keeping
the project goals front and center.
You might remember that stakeholders are people who have invested time, interest, and resources
into the projects that you are working on. This can be a pretty broad group, and your project
stakeholders may change from project to project. But there are three common stakeholder groups
that you might find yourself working with: the executive team, the customer-facing team, and the
data science team.
Let’s get to know more about the different stakeholders and their goals. Then we'll learn some
tips for communicating with them effectively.
Executive team
The executive team provides strategic and operational leadership to the company. They set goals,
develop strategy, and make sure that strategy is executed effectively. The executive team might
include vice presidents, the chief marketing officer, and senior-level professionals who help plan
and direct the company’s work. These stakeholders think about decisions at a very high level and
they are looking for the headline news about your project first. They are less interested in the
details. Time is very limited with them, so make the most of it by leading your presentations with
the answers to their questions. You can keep the more detailed information handy in your
presentation appendix or your project documentation for them to dig into when they have more
time.
For example, you might find yourself working with the vice president of human resources on an
analysis project to understand the rate of employee absences. A marketing director might look to
you for competitive analyses. Part of your job will be balancing what information they will need
to make informed decisions with their busy schedule.
But you don’t have to tackle that by yourself. Your project manager will be overseeing the progress
of the entire team, and you will be giving them more regular updates than someone like the vice
president of HR. They are able to give you what you need to move forward on a project, including
getting approvals from the busy executive team. Working closely with your project manager can
help you pinpoint the needs of the executive stakeholders for your project, so don’t be afraid to
ask them for guidance.
Customer-facing team
The customer-facing team includes anyone in an organization who has some level of interaction
with customers and potential customers. Typically they compile information, set expectations, and
communicate customer feedback to other parts of the internal organization. These stakeholders
have their own objectives and may come to you with specific asks. It is important to let the data
tell the story and not be swayed by asks from your stakeholders to find certain patterns that might
not exist.
Let’s say a customer-facing team is working with you to build a new version of a company’s most
popular product. Part of your work might involve collecting and sharing data about consumers’
buying behavior to help inform product features. Here, you want to be sure that your analysis and
presentation focuses on what is actually in the data-- not on what your stakeholders hope to find.
Discuss goals. Stakeholder requests are often tied to a bigger project or goal. When they ask you
for something, take the opportunity to learn more. Start a discussion. Ask about the kind of results
the stakeholder wants. Sometimes, a quick chat about goals can help set expectations and plan the
next steps.
Feel empowered to say “no.” Let’s say you are approached by a marketing director who has a
“high-priority” project and needs data to back up their hypothesis. They ask you to produce the
analysis and charts for a presentation by tomorrow morning. Maybe you realize their hypothesis
isn’t fully formed and you have helpful ideas about a better way to approach the analysis. Or maybe
you realize it will take more time and effort to perform the analysis than estimated. Whatever the
case may be, don’t be afraid to push back when you need to.
Stakeholders don’t always realize the time and effort that goes into collecting and analyzing data.
They also might not know what they actually need. You can help stakeholders by asking about
their goals and determining whether you can deliver what they need. If you can’t, have the
confidence to say “no,” and provide a respectful explanation. If there’s an option that would be
more helpful, point the stakeholder toward those resources. If you find that you need to prioritize
other projects first, discuss what you can prioritize and when. When your stakeholders understand
what needs to be done and what can be accomplished in a given timeline, they will usually be
comfortable resetting their expectations. You should feel empowered to say no-- just remember to
give context so others understand why.
Plan for the unexpected. Before you start a project, make a list of potential roadblocks. Then,
when you discuss project expectations and timelines with your stakeholders, give yourself some
extra time for problem-solving at each stage of the process.
Know your project. Keep track of your discussions about the project over email or reports, and
be ready to answer questions about how certain aspects are important for your organization. Get
to know how your project connects to the rest of the company and get involved in providing the
most insight possible. If you have a good understanding about why you are doing an analysis, it
can help you connect your work with other goals and be more effective at solving larger problems.
Start with words and visuals. It is common for data analysts and stakeholders to interpret things
in different ways while assuming the other is on the same page. This illusion of agreement* has
been historically identified as a cause of projects going back-and-forth a number of times before a
direction is finally nailed down. To help avoid this, start with a description and a quick visual of
what you are trying to convey. Stakeholders have many points of view and may prefer to absorb
information in words or pictures. Work with them to make changes and improvements from there.
The faster everyone agrees, the faster you can perform the first analysis to test the usefulness of
the project, measure the feedback, learn from the data, and implement changes.
Communicate often. Your stakeholders will want regular updates on your projects. Share notes
about project milestones, setbacks, and changes. Then use your notes to create a shareable report.
Another great resource to use is a change-log, which is a tool that will be explored further
throughout the program. For now, just know that a change-log is a file containing a chronologically
ordered list of modifications made to a project. Depending on the way you set it up, stakeholders
can even pop in and view updates whenever they want.
Being able to communicate in multiple formats is a key skill for data analysts. Listening, speaking,
presenting, and writing skills will help you succeed in your projects and in your career. This
reading covers effective communication strategies, including examples of clearly worded emails
for common situations.
Here's an important first tip: Know your audience! When you communicate your analysis and
recommendations as a data analyst, it's vital to keep your audience in mind.
Be sure to answer these four important questions related to your audience:
Project example
As a data analyst, you'll get plenty of requests and questions through email. Let’s walk through an
example of how you might approach answering one of these emails. Assume you're a data analyst
working at a company that develops mobile apps. Let's start by reviewing answers to the four
audience questions we just covered:
Kiri received updates about our project from its planning stages, including the most recent project
report, sent two weeks ago.
• What they need to know
Kiri needs an update on the analysis project’s progress and needs to know that the executive team
approved changes to the data and timeline. You know that adding a new variable to the analysis
will impact the current project timeline. Kiri will need to change the project’s milestones and
completion date.
You can start by sending an email update to Kiri with the latest timeline for the project, but a
meeting might be necessary if she wants to talk through her concerns about missing a deadline.
After answering the audience questions, you have the key building blocks you need to write an
email to Kiri. Here's an example of how these questions can help organize the flow of the email
message:
After receiving your email, Kiri will have a clearer view of the changes to the analysis project and
will be able to make adjustments to work with the new timeline.
After the next report is completed, you can also send out a project update offering more
information. The email could look like this:
Good communication keeps stakeholders updated on progress and ultimately helps prevent
problems. Carefully worded responses are key. Whether you gather and address feedback using
email, meetings, or reports, everyone you work with will know what to expect. As a result, they
will be able to better manage their own schedules, resources, and teams.
Limitations of a data
Data is powerful, but it has its limitations. Has someone’s personal opinion found its way into
the numbers? Is your data telling the whole story? Part of being a great data analyst is knowing
the limits of data and planning for them. This reading explores how you can do that.
If you have incomplete or nonexistent data, you might realize during an analysis that you don't
have enough data to reach a conclusion. Or, you might even be solving a different problem
altogether! For example, suppose you are looking for employees who earned a particular
certificate but discover that certification records go back only two years at your company. You
can still use the data, but you will need to make the limits of your analysis clear. You might be
able to find an alternate source of the data by contacting the company that led the training. But to
be safe, you should be up front about the incomplete dataset until that data becomes available.
If you're collecting data from other teams and using existing spreadsheets, it is good to keep in
mind that people use different business rules. So one team might define and measure things in a
completely different way than another. For example, if a metric is the total number of trainees in
a certificate program, you could have one team that counts every person who registered for the
training, and another team that counts only the people who completed the program. In cases like
these, establishing how to measure things early on standardizes the data across the board for
greater reliability and accuracy. This will make sure comparisons between teams are meaningful
and insightful.
Dirty data refers to data that contains errors. Dirty data can lead to productivity loss, unnecessary
spending, and unwise decision-making. A good data cleaning effort can help you avoid this. As a
quick reminder, data cleaning is the process of fixing or removing incorrect, corrupted,
incorrectly formatted, duplicate, or incomplete data within a dataset. When you find and fix the
errors - while tracking the changes you made - you can avoid a data disaster. You will learn how
to clean data later in the training.
Tell a clear story
Avinash Kaushik, a Digital Marketing Evangelist for Google, has lots of great tips for data
analysts in his blog: Occam's Razor. Below are some of the best practices he recommends for
good data storytelling:
Compare the same types of data: Data can get mixed up when you chart it for visualization. Be
sure to compare the same types of data and double check that any segments in your chart
definitely display different metrics.
Visualize with care: A 0.01% drop in a score can look huge if you zoom in close enough. To
make sure your audience sees the full story clearly, it is a good idea to set your Y-axis to 0.
Leave out needless graphs: If a table can show your story at a glance, stick with the table
instead of a pie chart or a graph. Your busy audience will appreciate the clarity.
Test for statistical significance: Sometimes two datasets will look different, but you will need a
way to test whether the difference is real and important. So remember to run statistical tests to
see how much confidence you can place in that difference.
Pay attention to sample size: Gather lots of data. If a sample size is small, a few unusual
responses can skew the results. If you find that you have too little data, be careful about using it
to form judgments. Look for opportunities to collect more data, then chart those trends over
longer periods.
be the judge
In any organization, a big part of a data analyst’s role is making sound judgments. When you
know the limitations of your data, you can make judgment calls that help people make better
decisions supported by the data. Data is an extremely powerful tool for decision-making, but if it
is incomplete, misaligned, or hasn’t been cleaned, then it can be misleading. Take the necessary
steps to make sure that your data is complete and consistent. Clean the data before you begin
your analysis to save yourself and possibly others a great amount of time and effort.
Lead great meetings
One day soon, you might find yourself planning a meeting in your role as a data analyst. Great
things can happen when participants anticipate a well-executed meeting. Attendees show up on
time. They aren’t distracted by their laptops and phones. They feel like their time will be well
spent. It all comes down to good planning and communication of expectations. The following are
our best practical tips for leading meetings.
Identify your objective. Establish the purpose, goals, and desired outcomes of the
meeting, including any questions or requests that need to be addressed.
Acknowledge participants and keep them involved with different points of view and
experiences with the data, the project, or the business.
Organize the data to be presented. You might need to turn raw data into accessible
formats or create data visualizations.
A solid meeting agenda sets your meeting up for success. Here are the basic parts your agenda
should include:
Here's an example of an agenda for an analysis project that is just getting started:
Sharing your agenda ahead of time
After writing your agenda, it's time to share it with the invitees. Sharing the agenda with
everyone ahead of time helps them understand the meeting goals and prepare questions,
comments, or feedback. You can email the agenda or share it using another collaboration tool.
Ask for feedback (this is an effective way to figure out if you missed anything in your
recap)
END MODULE 4