0% found this document useful (0 votes)
57 views

PSD02 - Data Science Overview

The document discusses definitions and concepts related to data science. It provides several definitions of data science from different sources that describe it as a process of extracting insights from data, validating hypotheses, uncovering trends, and translating data into stories. It also discusses some fundamentals of data science, including its data analysis component, vast data sources, computing power, and how data scientists work to solve business problems by analyzing data from many sources.

Uploaded by

Eren Yeager
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
57 views

PSD02 - Data Science Overview

The document discusses definitions and concepts related to data science. It provides several definitions of data science from different sources that describe it as a process of extracting insights from data, validating hypotheses, uncovering trends, and translating data into stories. It also discusses some fundamentals of data science, including its data analysis component, vast data sources, computing power, and how data scientists work to solve business problems by analyzing data from many sources.

Uploaded by

Eren Yeager
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 64

DATA

SCIENCE
DATA SCIENTIST
Ganjil 2020/2021
Capaian Pembelajaran Mata Kuliah
CPMK - 2

Mampu mendeskripsikan dan menjelaskan tentang sains data


• Mampu menjelaskan definisi sains data
• Mampu menjelaskan komponen-komponen utama di bidang sains
data
• Mampu menjelaskan mengapa sains data menarik dan saintis data
sangat dibutuhkan
• Mampu menjelaskan beberapa aplikasi sains data
Capaian Pembelajaran Mata Kuliah
Outline
DEFINING DATA SCIENCE
What is Data Science?
What is Data Science?
IBM Developers

• Data Science is a process,


not an event.
• It is the process of using
data to understand
different things, to
understand the world.
What is Data Science?
IBM Developers

• Data science is when you


have a model or hypothesis
of a problem, and you try
to validate that hypothesis
or model with your data.

Rafael B. Da Silva
What is Data Science?
IBM Developers

• Data science is the art of


uncovering the insights and
trends that are hiding
behind data.

Diana Zarate Diaz


What is Data Science?
IBM Developers

• It's when you translate data


into a story.
• So use storytelling to
generate insight.
• And with these insights,
you can make strategic
choices for a company or
an institution.
What is Data Science?
IBM Developers

• Data science is a field about


processes and systems to
extract data from various
forms of whether it is
unstructured or structured
form.

Mandeep Kaur
What is Data Science?
IBM Developers

• Data science is the study of data.


• Like biological sciences is a study of
biology, physical sciences, it's the
study of physical reactions.
• Data is real, data has real properties,
and we need to study them if we're
going to work on them.
Stephen Sherman
What is Data Science?
Academia

• Data Science involves data and


some science.
• I'd see data science as one's
attempt to work with data, to find
answers to questions that they are
exploring.
What is Data Science?
Academia

• In a nutshell, it's more about data


than it is about science.
• If you have data, and you have
curiosity, and you're working with
data, and you're manipulating it,
you're exploring it, analyzing it,
trying to get some answers from
it.
What is Data Science?
Past vs Today

• Data science is very relevant today


• Past: we worried about the lack of data, Now: we have a data deluge.
• Past: we didn't have algorithms, Now: we have algorithms.
• Past: the software was expensive, Now: it's open-source and free.
• Past: we couldn't store large amounts of data, Now: we can have
gazillions of datasets for a very low cost.
• There's never been a better time to be a data scientist
• Tools to work with data, the very availability of data, and the ability to
store and analyze data, it's all cheap, it's all available
DEFINING DATA SCIENCE
Fundamentals of Data Science
Fundamentals of Data Science
Data Analysis Component

Data Science has a significant data analysis component


Fundamentals of Data Science
Vast Quantitiy of Data

The new thing is the vast quantity of data available from


massively varied sources
Fundamentals of Data Science
Computing Power

At the same time, we have the computing power


Fundamentals of Data Science
Data Science Task
Fundamentals of Data Science
Data Science Task

Data scientists exploring the best way to provide value to


the business
Fundamentals of Data Science
Data Science Task

Data science focus on a specific problem to clarify the


question that the organization wants to answer
Fundamentals of Data Science
Data Science Task

Good data scientists are curious people who ask questions


to clarify the business need
Fundamentals of Data Science
Data Science Task

Data scientists can analyze structured and unstructured


data from many sources
• Sometimes, it will confirm what the organization suspects, but
sometimes it will be completely new knowledge
Fundamentals of Data Science
Data Science Task

Data scientists becomes a storyteller, communicating


the results to the project stakeholders
Fundamentals of Data Science
Data Science Task
Fundamentals of Data Science
Data Science Role
WHAT DATA SCIENTISTS DO
Old Problems, New Solutions
Old Problems New Solutions
Introduction

All organizations ultimately use data science to


discover optimum solutions to existing problems
Old Problems New Solutions
Introduction

three examples of data science providing innovative


solutions for old problems
Old Problems New Solutions
Uber Case

Uber collects real-time user data to discover many


things for better solutions
Old Problems New Solutions
Toronto Transportation Case

Toronto Transportation Commission has made great


strides in solving an old problem with traffic flows
Old Problems New Solutions
Toronto Transportation Case

traffic performance

customer complaints

data scientists team


streetcar operations
Old Problems New Solutions
Toronto Transportation Case

Traffic Congestion Dropped


Old Problems New Solutions
Environment Case

Freshwater lakes supply a variety of human and


ecological needs
Old Problems New Solutions
Environment Case

There are many projects and studies to solve this


long-existing dilemma
Old Problems New Solutions
Environment Case

In the US, a team of scientists is developing and deploying


high-tech tools to explore cyanobacteria in lakes
Old Problems New Solutions
Environment Case

In the US, a team of scientists is developing and deploying


high-tech tools to explore cyanobacteria in lakes
Old Problems New Solutions
The Solution

The project is also building new algorithmic models to


assess the findings
Old Problems New Solutions
The Solution

The information collected will lead to better predictions


Old Problems New Solutions
The Solution

gathering a lot of data analyzing it

cleaning and preparing it develop better solutions


Old Problems New Solutions
The Solution

How do you get a better solution that is efficient?


WHAT DATA SCIENTISTS DO
Advices for Data Scientists
Advices for Data Scientists
The Advices

• Curious
• Extremely Argumentative
• Judgmental
• Ability to tell a story
• See your competitive advantage
• Some proficiency in the tools
DATA SCIENCE APPLICATION
Data Science Approaches
Data Science Approaches
Algorithmn Helps

• Regression: helped us understand data.


• Data visualization: a key element for people to get
across their message to people that don't understand
that well what data science is.
• Artificial neural networks: we have a lot to learn with
nature
• Nearest neighbor: it's the simplest but it just gets the
best results so many more times than some
overblown
Data Science Approaches
Cloud is Beautiful

• Cloud is the central storage system


• Cloud allows you to deploy the analytics and storage capacities
of advanced machines
• Cloud allows you to deploy very advanced computing
algorithms and the ability to do high-performance computing
• Cloud enables you to get instant access to open source
technologies
• Cloud gives you access to the most up-to-date tools and
libraries
Data Science Approaches
Cloud is Beautiful

• Cloud allows multiple entities to work with same data


at the same time
• You can use cloud-based technologies from your
laptop, from your tablet, and even from your phone,
enabling collaboration more easily than ever before
• IBM offers the IBM Cloud, Amazon offers Amazon
Web Services or AWS, and Google offers Google
Cloud Platform
DATA SCIENCE APPLICATION
Application At Glance
Application At Glance
Big Data and Data Science

Data Science and Big Data are making an undeniable


impact on businesses
Application At Glance
Big Data and Data Science

sometimes it’s hard to see exactly how


Application At Glance
Big Data and Data Science

In this era of Big Data almost everyone generates


masses of data every day
Application At Glance
Big Data and Data Science

Recommendation Engine is a common application


of Data Science
Application At Glance
Impact on Business

Amazon, Netflix, and Spotify use algorithms to make


specific recommendations
Application At Glance
Impact on Business

Siri on Apple devices and Google use Data Science


Application At Glance
Impact on Business

Wearable devices add information about your activity levels,


sleep patterns, and heart rate to the data you generate
Application At Glance
Impact on Business

Data Science is impacting business


Application At Glance
Impact on Business
Application At Glance
Netflix Case

Netflix collects and analyzes massive amounts of data


from millions of users
Application At Glance
Netflix Case

Netflix can be confident that a show will be a hit


before filming even begins
Application At Glance
Netflix Case

Netflix knew that significant numbers of


people who liked Fincher also liked Wright
Application At Glance
Netflix Case

All this information would be a good investment


for the company
Application At Glance
Netflix Case

Thanks to Data Science


Netflix knows what people want before they do
SUMMARY
Summary of The Course

• Data science is the study of large quantities of data, which


can reveal insights that help organizations make strategic
choices.
• New data scientists need to be curious, judgemental and
argumentative.
• Some ways that data is generated by consumers.
Thank You

Credit by:

IBM

You might also like