Big Data - An Introduction
Big Data - An Introduction
AN
INTRODUCTION
Outline
Big Data definition
The 3 Imperatives
Talent Creation
Demand, Education, Professional Development
Open Data
Open Innovation
Big Data applications
Acknowledgement
Malaysia Digital Economy
Corporation Sdn. Bhd., Malaysia
Asia Pacific Centre of
Analytics (APCA),
APU, Malaysia
Other online resources incl.:
EMC, Gartner, Edureka, Active Informatics,
Revolution Analytics, Labour Insight, LinkedIn,
Binary Briyani, IEEE Spectrum, etc.
R. Logeswaran
What is
Big Data?
Oxford English Dictionary:
data of a very large size,
typically to the extent that its
manipulation and management present
significant logistical challenges
2011 big data study by McKinsey:
datasets whose size is beyond
the
Broadening data
VOLUME
VARIETY
years
Turning
big data into
Value
ECONOMIC
BENEFITS
Establishing the
Increasing data
VELOCITY
GOVERNMENT
BENEFITS
175,000
SOCIETAL
BENEFITS
tweets per
second
R. Logeswaran
VERACITY
of big data sources
Big Data technology allows us to
establish quality and accuracy
especially in unstructured data
Big Data
expanding on the 4 fronts
R. Logeswaran
Focus @ My
#1Talent
TALENT CREATION
2014
2020 (Malaysia)
4,088
Source:
IDC
R. Logeswaran
3. Professional Development
Game changing 8 week
intense data scientist
acceleration programme.
Top Data Accelerator globally
(Backed by Cornell Uni.)
Massive Open Online Course
(MOOC) with blended
approach - Highest sign up for
Coursera Data Science
MOOC globally for 2015.
R. Logeswaran
Skills of a Data
Scientist
Local demand
(Malaysia)
Curious &
Creative
Technical
Quantitative
Skeptical
Communicative
& Collaborative
R. Logeswaran
R. Logeswaran
Big
Data
Jobs
10
R. Logeswaran
11
R. Logeswaran
12
13
Source: EMC
R. Logeswaran
14
R. Logeswaran
15
16
R. Logeswaran
Business,
Statistics, etc.
17
3. Professional Development
18
Tools
R. Logeswaran
Tools
19
Tools
R. Logeswaran
20
Tools
Tools
R. Logeswaran
21
Tools
R. Logeswaran
22
Tools
R. Logeswaran
23
R. Logeswaran
24
Dashboard Visualisation
R. Logeswaran
25
Focus @ My #2
Open Data
2013
2020
(Target)
ST
NA
MY
41
UK
US
MY
15,000
Datasets
157,000
Datasets
RD
ST
30
TH
MY
117
Datasets
26
Open Data
Malaysia Government Open Data Partoal
(2014): Data.gov.my
United States Open Government Initiative
(2009): Data.gov
United Kingdom (2010): Data.gov.uk
Kenya Open Data Portal (2011)
Ghana Open Data Initiative (2012)
Japan Open Data Initiative (2013)
Others: United Nations, World Bank, EU Open
Data Portal, etc.
R. Logeswaran
27
DATA.GOV.MY
Number of Datasets
As at
12/12/15
As at
05/09/16
R. Logeswaran
28
Focus @ My #3
Open
Innovation
ACCELERATING INDUSTRY-DRIVEN
COEs FOR IMPACTFUL USE CASES
4 CENTRES OF
EXCELLENCE
formed with MDeC to
create national highimpact
BDA solutions
R. Logeswaran
29
PRGS
2015
R. Logeswaran
30
BIG App
Challenge
2015
R. Logeswaran
31
32
Teradata CTO
Roadshow 2015
R. Logeswaran
33
R. Logeswaran
34
CONCLUSION
Huge opportunities
Very good job prospects
employment, research, development etc.
Government & industry willingness,
effort and funds required
More open data initiatives required for
community-based applications
Increase activities create more
awareness
R. Logeswaran
35
R. Logeswaran
36