Big Data Analytics
Big Data Analytics
LEARNING OBJECTIVES
• Learn what Big Data is and how it is changing the world of analytics
• Understand the motivation for and business drivers of Big Data
analytics
• Become familiar with the wide range of enabling technologies for
Big Data analytics
• Learn about Hadoop, MapReduce, and NoSQL as they relate to Big
Data analytics
• Understand the role of and capabilities/skills for data scientist as a
new analytics profession
(Continued…)
LEARNING OBJECTIVES
• Variety - the structured and unstructured data that has the possibility of getting generated either by humans or by
machines.
• Velocity- Velocity is the measure of how fast the data is coming in.
• Veracity-
• Variability
• Value
• …
5 Vs of BIG DATA
A HIGH-LEVEL CONCEPTUAL (by AsterData / Teradata)
ARCHITECTURE FOR BIG DATA SOLUTIONS
UNIFIED DATA ARCHITECTURE
System Conceptual View
ERP
ERP MOVE MANAGE ACCESS
Marketing
Marketing
Executives
SCM
DATA Operational
PLATFORM Applications
Systems
CRM
INTEGRATED
DATA WAREHOUSE Customers
Business
Partners
Images Intelligence
Frontline
Audio Workers
and Video Data
Mining
Business
Machine Analysts
Logs DISCOVERY PLATFORM
Math
and Stats
Data
Text Scientists
EVENT
PROCESSING Languages
Web and Engineers
Social
Keys to Success
with Big Data
Alignment
Analytics
The right between the
analytics tools business and IT
strategy
A fact-based
A strong data
decision-making
infrastructure
culture
ENABLERS OF BIG DATA
ANALYTICS
• In-memory analytics
• Storing and processing the complete data set in RAM
• In-database analytics
• Placing analytic procedures close to where data is stored
• Appliances
• Combining hardware, software, and storage in a single unit for performance and
scalability
CHALLENGES OF BIG DATA
ANALYTICS
• Data volume
• The ability to capture, store, and process the
huge volume of data in a timely manner
• Data integration
• The ability to combine data quickly/cost
effectively
• Processing capabilities
• The ability to process the data quickly, as it is
captured (i.e., stream analytics)
BUSINESS PROBLEMS ADDRESSED BY
BIG DATA ANALYTICS
• Process efficiency and cost reduction
• Brand management
• Revenue maximization, cross-selling/up-selling
• Enhanced customer experience
• Churn identification, customer recruiting
• Improved customer service
• Identifying new products and market opportunities
• Risk management
• Regulatory compliance
• Enhanced security capabilities
• …
APPLICATION EXAMPLE CASE 1
Moving from many old systems to a unified new system
Before After
DATA
SCIENTIST
Curiosity and Programming,
Creativity Scripting and Hacking
Source: Teradata
BIG DATA VENDORS
$60
$50
$40
$30
$20
$10
$0
HOW TO SUCCEED WITH BIG DATA
1. Simplify
2. Coexist
3. Visualize
4. Empower
5. Integrate
6. Govern
7. Evangelize
BIG DATA AND STREAM
ANALYTICS
• Data-in-motion analytics and real-time data analytics
• One of the Vs in Big Data = Velocity
• Analytic process of extracting actionable information from
continuously flowing/streaming data
• Why Stream Analytics?
• It may not be feasible to store the data
• It may lose its value if not processed immediately
STREAM ANALYTICS
A USE CASE IN ENERGY
INDUSTRY Energy Production System
(Traditional and/or Renewable) Capacity Decisions
Sensor Data
(Energy Production
System Status)
Streaming Analytics
Meteorological Data Data Integration
(Predicting Usage,
(Wind, Light, and Temporary
Production and
Temperature, etc.) Staging
Anomalies)
Permanent
Usage Data
Storage Area
(Smart Meters,
Smart Grid Devices)