Unit I
Unit I
• https://www.montecarlodata.com/blog-da
ta-pipeline-architecture-explained/
Hadoop Architecture and
Components
Type of Digital Data
Type of Digital Data
Type of Digital Data
Type of Digital Data
a1 … an
1 -- n
2 -- n
. . .
. . .
. . .
Type of Digital Data
Definition of Big Data
• Big Data refers to
• large, complex datasets that are difficult to
process, store, and analyze using traditional
data management tools due to their:
• Volume
• Velocity
• Variety
• Veracity
• value
5 – V’s of Big Data
Volume: Massive amounts of data
generated daily.
Transfor
m
Data Pipeline Data Pipeline
Extract Load Transfor
Load
m
IoT
Transfor
m
SQL
Web
Dashboard
Serv
er
Data Lake Data Science
Analysis
Transfor
m
APIs Data Warehouse
Analytical Data Plane
Data Source
Hadoop Distributions
Year Event