Data Analytics - Review 1
Data Analytics - Review 1
approach tasks
Overview
Earlier you learned about how data analysts at one organization used data to improve employee retention.
Now, you’ll complete an entry in your learning log to track your thinking and reflections about those data
analysts' process and how they approached this problem. By the time you complete this activity, you will
have a stronger understanding of how the six phases of the data analysis process can be used to break
down tasks and tackle big questions. This will help you apply these steps to future analysis tasks and start
tackling big questions yourself.
Before you write your entry in your learning log, reflect on the case study from earlier. The data analysts
wanted to use data to improve employee retention. In order to do that, they had to break this larger
project into manageable tasks. The analysts organized those tasks and activities around the six phases of
the data analysis process:
1. Ask
2. Prepare
3. Process
4. Analyze
5. Share
6. Act
The analysts asked questions to define both the issue to be solved and what would equal a successful
result. Next, they prepared by building a timeline and collecting data with employee surveys that were
designed to be inclusive. They processed the data by cleaning it to make sure it was complete, correct,
relevant, and free of errors and outliers. They analyzed the clean employee survey data. Then the analysts
shared their findings and recommendations with team leaders. Afterward, leadership acted on the results
and focused on improving key areas.
Analysts use data-driven decision-making and follow a step-by-step process. You have learned that there
are six steps to this process:
Consider an example of a restaurant entrepreneur, partnering with a well known chef to develop a new
restaurant in a bustling part of the city’s central shopping district. The well known chef has several
restaurants across the city. Banking on their reputation, the restaurant entrepreneur and chef followed gut
instinct and created another uniquely themed restaurant. However, fundraising efforts fell short to fund
the opening of the restaurant after months of planning and preparation. The property will go back on the
market to be sold at a loss. Had the entrepreneur done more research, they would've found data showing
prospective customers in this new restaurant location were very different from the chef's other
restaurants.
The more you understand the data related to a project, the easier it will be to figure out what is required.
These efforts will also help you identify errors and gaps in your data so you can communicate your findings
more effectively. Sometimes past experience helps you make a connection that no one else would notice.
For example, a detective might be able to crack open a case because they remember an old case just like
the one they’re solving today. It's not just gut instinct.
In addition, try asking yourself these questions about a project to help find the perfect balance:
It is time to enter the data analysis life cycle—the process of going from data to decision. Data goes
through several phases as it gets created, consumed, tested, processed, and reused. With a life cycle
model, all key team members can drive success by planning work both up front and at the end of the data
analysis process. While the data analysis life cycle is well known among experts, there isn't a single defined
structure of those phases. There might not be one single architecture that’s uniformly followed by every
data analysis expert, but there are some shared fundamentals in every data analysis process. This reading
provides an overview of several, starting with the process that forms the foundation of the Google Data
Analytics Certificate.
The process presented as part of the Google Data Analytics Certificate is one that will be valuable to you as
you keep moving forward in your career:
1. Discovery
2. Pre-processing data
3. Model planning
4. Model building
5. Communicate results
6. Operationalize
EMC Corporation is now Dell EMC. This model, created by David Dietrich, reflects the cyclical nature of real-
world projects. The phases aren’t static milestones; each step connects and leads to the next, and
eventually repeats. Key questions help analysts test whether they have accomplished enough to move
forward and ensure that teams have spent enough time on each of the phases and don’t start modeling
before the data is ready. It is a little different from the data analysis life cycle this program is based on, but
it has some core ideas in common: the first phase is interested in discovering and asking questions; data
has to be prepared before it can be analyzed and used; and then findings should be shared and acted on.
For more information, refer to this e-book, Data Science & Big Data Analytics.
1. Ask
2. Prepare
3. Explore
4. Model
5. Implement
6. Act
7. Evaluate
The SAS model emphasizes the cyclical nature of their model by visualizing it as an infinity symbol. Their
life cycle has seven steps, many of which we have seen in the other models, like Ask, Prepare, Model, and
Act. But this life cycle is also a little different; it includes a step after the act phase designed to help analysts
evaluate their solutions and potentially return to the ask phase again.
For more information, refer to Managing the Analytics Life Cycle for Decisions at Scale.
For more information, refer to Understanding the data analytics project life cycle.
For more information, refer to Big Data Adoption and Planning Considerations.
Key takeaway
From our journey to the pyramids and data in ancient Egypt to now, the way we analyze data has evolved
(and continues to do so). The data analysis process is like real life architecture, there are different ways to
do things but the same core ideas still appear in each model of the process. Whether you use the structure
of this Google Data Analytics Certificate or one of the many other iterations you have learned about, we are
here to help guide you as you continue on your data journey.