1.2 Introduction To Applied Data Science
1.2 Introduction To Applied Data Science
Assistant Professor
Department of Electronics and Computer Engineering
K. J. Somaiya College of Engineering
Somaiya Vidyavihar University
07/06/2024 1
Module No. 1
Introduction to Applied Data
Science and Data Scraping
07/06/2024 2
Contents
• Datafication- Data everywhere
• Big Data
• What is Data Science?
• Big Data and Data Science
• Current landscape of perspectives
• Data Scientist Skill sets
• Challenges and skill Sets needed and various applications
areas.
• Impact of applying Data Science in business scenario
• Estimation and validation for added value due to data
science
07/06/2024 3
Data is every where
07/06/2024 4
Datafication
07/06/2024 5
Datafication
Definition
Example
• Datafication is a
technological trend • Quantify friends with ‘likes’
turning many aspects of • Googles augmented reality glass to
our life into data which is quantify the gaze
subsequently transferred • Twitter datify the thoughts
into information realized • LinkedIn datify our professional
networks
as a new form of value
• Browsing web, unintentionaly with
cookies
• Walk in store, street we are
datafied via sensors, cameras,
google glasses
• Taking part of social media
experiment
07/06/2024 6
Big Data
07/06/2024 7
Additional V- Veracity
07/06/2024 8
Big Data Definition
07/06/2024 9
07/06/2024 10
07/06/2024 11
07/06/2024 12
07/06/2024 13
07/06/2024 14
What is Data Science
07/06/2024 15
What is Data Science?
• Data Science is a science of
analyzing raw data using statistics
and machine learning with the
purpose of drawing conclusion
about the information.
07/06/2024 16
07/06/2024 17
Drew Conway’s Venn diagram of data science
07/06/2024 18
Big Data vs Data science
07/06/2024 19
A Data Science Profile
• Computer Science
• Math
• Statistics
• Machine learning
• Domain expertise
• Communication and presentation skills
• Data Visualization
07/06/2024 20
07/06/2024 21
Data Science team profile
07/06/2024 22
Data Scientist
07/06/2024 23
07/06/2024 24
Data Science Process/Life Cycle
07/06/2024 25
Role of Data scientist in the process
07/06/2024 26
What does Data Scientist do Really?
In Industry:
More generally someone who knows:
• How to design experiments?
• Knows the process of collecting, cleaning and munging
data
• Skills that are necessary for understanding the biases in
the data and for debugging logging output from code
• Exploratory data analysis which combines visualization
and data sense.
• Finding patterns, build models and algorithms
• Use analysis for decision making
07/06/2024 27
Data Science Applications
07/06/2024 28
Question
?
07/06/2024 29
Thank
You!!
07/06/2024 30