Introduction To Data Analytics
Introduction To Data Analytics
#RintisKarirImpian
#RintisKarirImpian
#RintisKarirImpian
Data Analytics
#RintisKarirImpian
Data Analytics
(?)
#RintisKarirImpian
#RintisKarirImpian
Simple definition
Process of examining raw data to extract useful information
#RintisKarirImpian
#RintisKarirImpian
#RintisKarirImpian
#RintisKarirImpian
Steps in Data Analysis
❏ Define problems
❏ Confirm the analysis point first! with your business user ofc.
❏ Do the analysis
#RintisKarirImpian
Define Problems
#RintisKarirImpian
What is problem?
#RintisKarirImpian
Problem = Gap
#RintisKarirImpian
Example
❏ You are the manager of a cafe. The sales on Q4 2022 is decreased if compared
with with Q3 2022. The sales is only achieved IDR 100 Mio while the target is
150 Mio.
Ideal Condition:
Sales of cafe is achieved by
IDR 150 Mio on end Q4
Gap:
IDR 50 Mio
Existing Condition:
Sales of cafe is achieved by
IDR 100 Mio on end Q4
#RintisKarirImpian
Remember!
#RintisKarirImpian
Define Problems
❏ Related to “why do you do this analysis”?
#RintisKarirImpian
What are problem statements of these cases?
❏ You are the director of a hospital. Lately you feel that the hospital is more
crowded than usual and the rooms are unusually full, but the number of patients
seems steady from the record. What problem will you need to address to your
analysts?
❏ You are the manager of a cafe. The sales of coffee is decreased 20% if
compared with last month. What problem will you address to your analysts?
#RintisKarirImpian
What metrics can you propose to these cases?
#RintisKarirImpian
Confirm your analysis point first!
#RintisKarirImpian
Why?
#RintisKarirImpian
How? we can use / optimize through analysis framework!
#RintisKarirImpian
Analysis Framework Component
Analysis Framework that usually I used is contained of 8 steps problems and problem tree. I
usually divide to 3 things. Problem Setting, Problem Breakdown, Root Cause analysis
#RintisKarirImpian
Example of Analysis Framework (1) - easier to be used
Problem Setting Root cause Analysis / Problem Tree
Ideal Condition
The target sales of Hypotheses 1: There is out of stock
Maxue, branch of Ice at the prime time (material)
Pulogebang is IDR 3
Billion rupiah
Gap
There is IDR 1 billion gap Hypotheses 2 : The online customer
that make this branch is
not reached the target
is slowly response (man, method)
Actual Condition
The sales of Maxue,
branch Pulogebang is Analysis
achieved only IDR 2
Billion rupiah
Hypotheses 3 : Customer receive
bad treatment from the employee
(method)
Ideal Condition
The target sales of Hypotheses 1: There is out of stock
Maxue, branch of Ice at the prime time (material)
Pulogebang is IDR 3
Billion rupiah
Gap
There is IDR 1 billion gap Hypotheses 2 : The online customer
that make this branch is
not reached the target
is slowly response (man, method)
Actual Condition
The sales of Maxue,
branch Pulogebang is Analysis
achieved only IDR 2
Billion rupiah
Hypotheses 3 : Customer receive
bad treatment from the employee
(method)
2.Food Beverage
Problem Breakdown (Optional)
The food is IDR 200 Mio The food is IDR 800 Mio
below the target below the target Hypotheses 4 : Our food quality of
ice cream is bad (material)
Juice & Latte Ice Cream
#RintisKarirImpian
This cluster is achieved 150
mio below the target
This cluster is achieved IDR
550 mio below the target
Additional tips for “confirm your analysis point first!”
#RintisKarirImpian
Continue your previous problem definition (5 mins)
At least 3 hypotheses
❏ You are the director of a hospital. Lately you feel that the hospital is more
crowded than usual and the rooms are unusually full, but the number of patients
seems steady from the record. What problem will you need to address to your
analysts?
❏ You are the manager of a cafe. The sales of coffee is decreased 20% if
compared with last month. What problem will you address to your analysts?
#RintisKarirImpian
Find the Data
#RintisKarirImpian
External
Data
Internal Data
❏ Google analytics
❏ Competitors data
❏ Vendors data
#RintisKarirImpian
This is what we do, daily …
#RintisKarirImpian
Common issues
❏ etc
#RintisKarirImpian
Preprocess the Data
#RintisKarirImpian
#RintisKarirImpian
Data preprocessing
❏ Remove duplicates
#RintisKarirImpian
Do the analysis
#RintisKarirImpian
#RintisKarirImpian
SQL based Spreadsheet based Coding based
#RintisKarirImpian
Common statistics in data analysis
❏ Count
❏ Count distinct
❏ Sum
❏ Min
❏ Max
❏ Average
❏ Median
❏ Percentile
❏ etc
#RintisKarirImpian
Interpret the Result
#RintisKarirImpian
Problem = Gap. Know the gap first!
#RintisKarirImpian
Problem = Gap. Know the gap first!
#RintisKarirImpian
#RintisKarirImpian
#RintisKarirImpian
#RintisKarirImpian
#RintisKarirImpian