DATA01 - Data Science Primer
DATA01 - Data Science Primer
Big Data
Because of internet and connectivity,
we generate so much data now!
V’s of Big Data
Volume Variety Velocity
https://blog.unbelievable-machine.com/en/what-is-big-data-definition-five-vs
V’s of Big Data
Veracity Value
https://blog.unbelievable-machine.com/en/what-is-big-data-definition-five-vs
Accelerating Factors of DS
Data storage is
cheaper
Accelerating Factors of DS
Estimated by Statista
Accelerating Factors of DS
Inferential Is the daily change of cases in NCR the same as the change of cases in
Region 4?”
Predictive How many COVID cases will happen in the next month?
Causal If mask wearing was not implemented in the Philippines, how will it affect
the number of cases in the country?
Descriptiv
e
Explorator
y
Inferentia
l
Causal
Predictive
The “trend” of commute time is going up.
Mechanist
ic
Types of Analysis
Descripti
ve
Explorato
ry
Inferenti
al
Causal
Predictiv
e
What is the correlation of height and weight? People who
Mechanis are taller are generally observed to be heavier on average.
tic
Types of Analysis
Descriptiv
e
Explorato
ry
Inferentia
l
Causal
Predictive
What is the GDP Next year?
Mechanist
ic
Types of Analysis
Descriptiv
e
Explorato
ry
Inferentia
l
Causal
Predictive
Mechanist Are Males paid more on average as compared to Females?
Is gender gap real? Whats the difference between the two
ic groups?
Types of Analysis
Descriptiv
e
Explorator
y
Inferential
Causal
Predictive
Mechanisti
c What is the total number of covid patients?
Pipeline and Roles
who does what?
Analytics
Analytics Manager
Data Steward Data Data
Engineer Scientist
data governance
data privacy data data analytics
data security infrastructure data mining
data quality
data ETL machine learning
(extract, transform,
load) statistical
data management modeling
data warehousing
Functional Analyst Analytics Manager