Introduction to Data Science
Introduction to Data Science
3
Data Science – Why all the excitement?
4
Data Analysis Has Been Around for a While…
Howard
Dresner
Data Science: Why all the Excitement?
Exciting new effective
applications of data analytics
e.g.,
Google Flu Trends:
Detecting outbreaks
two weeks ahead
of CDC data
Predicting political
champagne and election
Outcome
7
PageRank: The web as a behavioral dataset
Sponsored search
Sponsored search
• Google revenue around $50 bn/year from marketing, 97% of
the companies revenue.
• Text Data, Social Media Data Product Review and Consumer Satisfaction (Facebook,
Twitter, LinkedIn), E-discovery
13
“Big Data” Sources
User Generated (Web &
It’s All Happening On-line Mobile)
Every:
Click
Ad impression
Billing event
….
Fast Forward, pause,… .
Server request
Transaction
Network message
Fault
…
to produce:
18
“Data Science” an Emerging Field
19
Data Science – A Definition
Data Science is the science which uses computer science, statistics and
machine learning, visualization and human-computer interactions to
collect, clean, integrate, analyze, visualize, interact with data to create
data products.
20
Goal of Data Science