0% found this document useful (0 votes)
7 views

Introduction To Data Analytics

Uploaded by

Delviano Prakoso
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views

Introduction To Data Analytics

Uploaded by

Delviano Prakoso
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 46

#RintisKarirImpian

Intro to Data Analytics


Achmad Rozie
Data Analyst at Flip
Wednesday, 11th January 2022
Outline
Data Analytics

Steps in Data Analysis

#RintisKarirImpian
#RintisKarirImpian
#RintisKarirImpian
Data Analytics

#RintisKarirImpian
Data Analytics
(?)

#RintisKarirImpian
#RintisKarirImpian
Simple definition
Process of examining raw data to extract useful information

#RintisKarirImpian
#RintisKarirImpian
#RintisKarirImpian
#RintisKarirImpian
Steps in Data Analysis
❏ Define problems

❏ Confirm the analysis point first! with your business user ofc.

❏ Find the data

❏ Preprocess the data

❏ Do the analysis

❏ Interpret the result

#RintisKarirImpian
Define Problems

#RintisKarirImpian
What is problem?

#RintisKarirImpian
Problem = Gap

#RintisKarirImpian
Example
❏ You are the manager of a cafe. The sales on Q4 2022 is decreased if compared
with with Q3 2022. The sales is only achieved IDR 100 Mio while the target is
150 Mio.

Ideal Condition:
Sales of cafe is achieved by
IDR 150 Mio on end Q4

Gap:
IDR 50 Mio
Existing Condition:
Sales of cafe is achieved by
IDR 100 Mio on end Q4

#RintisKarirImpian
Remember!

● Good gap / problem is able to be quantified.

● If you can’t measure it, you can’t improve it!

● How if the problem is subjective?


○ try to look the benchmark,
○ or set your own threshold!

● Redefine the problem with your own stakeholder!

#RintisKarirImpian
Define Problems
❏ Related to “why do you do this analysis”?

❏ Refine and realign the problems with all stakeholders

❏ Cutting as much assumptions as possible

❏ Stating the hypotheses

❏ Setting the priority and urgency

#RintisKarirImpian
What are problem statements of these cases?

❏ You are the director of a hospital. Lately you feel that the hospital is more
crowded than usual and the rooms are unusually full, but the number of patients
seems steady from the record. What problem will you need to address to your
analysts?

❏ You are the manager of a cafe. The sales of coffee is decreased 20% if
compared with last month. What problem will you address to your analysts?

#RintisKarirImpian
What metrics can you propose to these cases?

❏ How to rank best students in the class?

❏ How to categorize consumers loyalty?

❏ How to measure our delivery speed?

❏ How to know the users satisfaction?

#RintisKarirImpian
Confirm your analysis point first!

#RintisKarirImpian
Why?

In order to minimize your rework.


Remember, most of analysis is never ending process, we
need to minimize it.

#RintisKarirImpian
How? we can use / optimize through analysis framework!

After you got the problem (gap):


1. Define your analysis point. It can be:
a. Hypotheses,
b. Point that may be affected
c. Factor / event that may cause the problem

2. Discuss with business user


confirm is it enough or is there anything else that not inserted on your
analysis framework

3. Also confirm the constraints


It can be the:
a. the data should we used
b. The time period
c. the condition

#RintisKarirImpian
Analysis Framework Component
Analysis Framework that usually I used is contained of 8 steps problems and problem tree. I
usually divide to 3 things. Problem Setting, Problem Breakdown, Root Cause analysis

1. Problem Setting 3. Root cause Analysis / Problem Tree

2. Problem Breakdown (Optional)

#RintisKarirImpian
Example of Analysis Framework (1) - easier to be used
Problem Setting Root cause Analysis / Problem Tree

Ideal Condition
The target sales of Hypotheses 1: There is out of stock
Maxue, branch of Ice at the prime time (material)
Pulogebang is IDR 3
Billion rupiah
Gap
There is IDR 1 billion gap Hypotheses 2 : The online customer
that make this branch is
not reached the target
is slowly response (man, method)
Actual Condition
The sales of Maxue,
branch Pulogebang is Analysis
achieved only IDR 2
Billion rupiah
Hypotheses 3 : Customer receive
bad treatment from the employee
(method)

Hypotheses 4 : Our food quality is


bad (material)
Example of Analysis Framework (2) - more targeted
Problem Setting Root cause Analysis / Problem Tree

Ideal Condition
The target sales of Hypotheses 1: There is out of stock
Maxue, branch of Ice at the prime time (material)
Pulogebang is IDR 3
Billion rupiah
Gap
There is IDR 1 billion gap Hypotheses 2 : The online customer
that make this branch is
not reached the target
is slowly response (man, method)
Actual Condition
The sales of Maxue,
branch Pulogebang is Analysis
achieved only IDR 2
Billion rupiah
Hypotheses 3 : Customer receive
bad treatment from the employee
(method)
2.Food Beverage
Problem Breakdown (Optional)
The food is IDR 200 Mio The food is IDR 800 Mio
below the target below the target Hypotheses 4 : Our food quality of
ice cream is bad (material)
Juice & Latte Ice Cream
#RintisKarirImpian
This cluster is achieved 150
mio below the target
This cluster is achieved IDR
550 mio below the target
Additional tips for “confirm your analysis point first!”

1. At the beginning, usually i write it on a piece of


paper / tab first!
→ help to organize what we want to look for

2. Don’t make it too complex!


→ because the analysis will come and come continuously, we must
finish it quickly (of course still within our work pace)

#RintisKarirImpian
Continue your previous problem definition (5 mins)
At least 3 hypotheses

❏ You are the director of a hospital. Lately you feel that the hospital is more
crowded than usual and the rooms are unusually full, but the number of patients
seems steady from the record. What problem will you need to address to your
analysts?

❏ You are the manager of a cafe. The sales of coffee is decreased 20% if
compared with last month. What problem will you address to your analysts?

#RintisKarirImpian
Find the Data

#RintisKarirImpian
External
Data
Internal Data

❏ Google analytics
❏ Competitors data
❏ Vendors data

❏ App tracking Open


❏ Business logs
❏ Customers data Data
❏ Public holidays
❏ Postal codes
❏ Landmark locations

#RintisKarirImpian
This is what we do, daily …

Raw sources Analytics databases Analytics tools


(Big Query, Google
Sheets, Python, Redash,
Looker, or even your
note)

#RintisKarirImpian
Common issues

❏ Are the data available?

❏ Are the data ready to use?

❏ Are the data reliable?

❏ Are the data restricted?

❏ Are the data condition is right? (underrated but important)

❏ etc

#RintisKarirImpian
Preprocess the Data

#RintisKarirImpian
#RintisKarirImpian
Data preprocessing

❏ Remove duplicates

❏ Handle anomalies and dirty data

❏ Take action on missing data

❏ Standardize the format and types

#RintisKarirImpian
Do the analysis

#RintisKarirImpian
#RintisKarirImpian
SQL based Spreadsheet based Coding based

#RintisKarirImpian
Common statistics in data analysis

❏ Count
❏ Count distinct
❏ Sum
❏ Min
❏ Max
❏ Average
❏ Median
❏ Percentile
❏ etc

#RintisKarirImpian
Interpret the Result

#RintisKarirImpian
Problem = Gap. Know the gap first!

#RintisKarirImpian
Problem = Gap. Know the gap first!

#RintisKarirImpian
#RintisKarirImpian
#RintisKarirImpian
#RintisKarirImpian
#RintisKarirImpian

You might also like