0% found this document useful (0 votes)
62 views31 pages

CS481 - Data Science: Muhammad Sohail Afzal

This document discusses data science and the data science process. It defines data science as being about creating impact through data. It then lists the main steps in the data science process as setting a research goal, gathering data, preparing data, exploring the data through analysis, modeling, and presenting and automating results. It also mentions different data types and forms and provides a link about how data analytics helped Obama win the 2012 election.

Uploaded by

Neha
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
62 views31 pages

CS481 - Data Science: Muhammad Sohail Afzal

This document discusses data science and the data science process. It defines data science as being about creating impact through data. It then lists the main steps in the data science process as setting a research goal, gathering data, preparing data, exploring the data through analysis, modeling, and presenting and automating results. It also mentions different data types and forms and provides a link about how data analytics helped Obama win the 2012 election.

Uploaded by

Neha
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 31

CS481 - Data Science

Muhammad Sohail Afzal


What is data science?
In one sentence, you may say data science is about :

How much impact you can create through data !


Data Science
BigData and 4 V’s of BigData
Data Analysis has been around for a while
Why Data Science!
Why Data Science!
Why Data Science!
Why Data Science!
Why Data Science!

Link on how data analytics helped Obama get victory in 2012


elections :
https://www.scmp.com/yp/learn/college-uni-life/university-prog
rammes/article/3071524/how-data-analytics-helped-obama-win
Why Data Science!
Why Data Science!

24000 sq. m housing 400 containers


Each container contains 2500 servers
Integrated computing, networking, power, cooling systems
300 MW supplied from two power substations situated on opposite sides
of the datacenter
Dual water-based cooling systems circulate cold water to containers,
eliminating need for air conditioned rooms
Why Data Science!
Why Data Science!
Why Data Science!
Why Data Science!
Data Forms/Types
Data Types
Data Types
Data Types
Data Types
Data Types
Data Types
Data Science Process

Data Science process can be divided into following steps :


• Setting research goal
• Gathering data
• Preparing data
• Data exploration (EDA – exploratory data analysis)
• Modeling
• Presentation & Automation
• Watch movie :
“Moneyball (2011)”
Application of statistics and data science in baseball game

You might also like