Big Data: Introduction To Terms, Concepts and Tools
Big Data: Introduction To Terms, Concepts and Tools
Big data burst upon the scene in the first decade of the
21st century.
BRIEF OVERVIEW
Big Data
○ ‘Big Data’ is similar to ‘Small Data’, but bigger in size but having data bigger it
requires different approaches:
○ An aim to solve new problems or old problems in a better way.
○ Big Data generates value from the storage and processing of very large quantities
of digital information that cannot be analyzed with traditional computing
techniques.
ORGANIZATIONS EMBRACING BIG DATA
CHARACTERISTICS OF BIG DATA
OPERATIONAL ANALYTICAL
Overview of Big Data stores
Data models
○ Key-value
○ Graph
○ Document
○ Column-family
Hadoop Distributed File System
○ Hadoop is ideal for storing large amounts of data, like terabytes and
petabytes, and uses HDFS as its storage system. HDFS lets you
connect nodes (commodity personal computers) contained within
clusters over which data files are distributed. You can then access and
store the data files as one seamless file system. Access to data files is
handled in a streaming manner, meaning that applications or
commands are executed directly using the MapReduce processing
model
Hadoop
○ The data processing framework is the tool used to work with the data
itself. By default, this is the Java-based system known
as MapReduce. You hear more about MapReduce than the HDFS
side of Hadoop for two reasons: It's the tool that actually gets data
processed.
Selecting Big Data Stores
Structured
○ Most traditional data sources
Semi-structured
○ Many sources of big data
Unstructured
○ Video data
○ Audio data
The Structure of Big Data
Maintenance in Aerospace
Social Media Analysis
○ 90% of the data in the world today has been created in the
last two years alone
BIG DATA SOURCES
○ Mobile Devices
○ Microphones
○ Readers / Scanners
○ Science Facilities
○ Programs / Softwares
○ Social Media
○ Cameras
SUMMARY
○ Big Data
○ Organizations embracing Big Data
○ Characteristics of Big Data
○ Big Data Technologies
○ Big Data Stores
○ Processing Big Data
○ Structure of Big Data
○ Why Big Data
○ Big Data Sources
BIG DATA
INTRODUCTION TO TERMS, CONCEPTS AND TOOLS