0% found this document useful (0 votes)

11 views41 pages

Hadoop Platform & Services

The document outlines the objectives and system overview of a data engineering platform designed to solve data-related problems for businesses using an optimal technology stack. It details the architecture and components of the Hadoop ecosystem, including HDFS, YARN, and various data storage and processing technologies. Additionally, it discusses the design principles of distributed file systems, resource management, and the application lifecycle in Hadoop.

Uploaded by

jwyxhwzbqz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views41 pages

Hadoop Platform & Services

Uploaded by

jwyxhwzbqz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

Data Engineering

Data & Compute Platform and Services ...

Objective
➔ Build platform to address data related problems for Businesses

➔ Use of optimal technology stack

➔ Develop right platforms based on requirements

➔ Optimize overall Cost - Usage based cost optimization

System Overview

➔ Compute: ~50,000 applications running on an average in a day

➔ Storage: (~1.4 PB) Tiered Storage

● SSD for latency critical processes (HBase)
● Magnetic disk for historical storage & scan oriented processes
● Live data kept in hot storage with the aim of providing high throughputs
● Older data is periodically moved to cold storage (decreased replication, lower throughput)
Technology Stack
➔ HADOOP (Distributed Data Warehouse)
◆ HDFS (Storage Layer)

◆ HBase (Distributed key-value Store)

◆ YARN (Compute Engine)

◆ Hive/Spark (Querying Engines)

◆ Oozie (Scheduling Engine)

➔ Kafka (Messaging Queue: Primary Source for Data Ingestion)

➔ ES, Druid (Other data stores used for analytics and reporting)

➔ Hue/Zeppelin/Jupyter (Data Access Platforms)

Hadoop Distributed
File System (HDFS)
Distributed File System

▪ How to store Data ?

‣ Transactional based systems v/s Event based systems

‣ Storage Mechanism and Scale:

■ Single node
■ Multi node (data partitioning, consistency, availability)
Distributed File System
▪ Unit of data that can be read or written ?
‣ Folders, files, blocks ... ? Single Node Distributed
Storage Storage (eg:hdfs)

▪ What should be the optimal size of a block ?

Storage Unit FileSystemBlock Hdfs Block

‣ Unit of access Block Size 4KB 128MB

Data Availability Using RAID Replicated Blocks

across Nodes

▪ How to ensure data availability ?

Storage Policy Will reserve at least Need not reserve
‣ Replicate Blocks 1 block on the disk complete block for
(i.e. 4KB) smaller data sizes
‣ Separate data from metadata
Hadoop Distributed File System (HDFS)
▪ Based on Google File System (GFS)
▪ Optimized for huge files
▪ Write once, read many
‣ Create new data. Never update-in-place, only append.
‣ No write locks (only 1 writer!).

▪ Optimized for sequential reads

‣ Typically, start at a point and read to completion.

▪ Throughput favoured over low latency

‣ Low total time for reading all data, than time per small files.

▪ Survive high disk/node failures

HDFS Design
▪ Master-slave architecture
‣ Master manages namespace, directory/file names/tree structure, metadata, block ids,
permissions

‣ Slave manages blocks containing data

Master: Name Node
▪ Persists names, trees, metadata, permissions
‣ Namespace image (fsimage), cached in-memory
‣ Edit log of deltas (rename, permission, create)
• Transaction persisted on disk, then applied to in-memory fsimage
‣ fsimage and edit log merged on disk when HDFS restarted
‣ Mapping from files to list of blocks

▪ Block location not persistent, kept in-memory

‣ Mapping from blocks to locations is dynamic
• Why?
‣ Reconstructs location of blocks from data nodes
‣ ~150 bytes of in-memory metadata per block/file/dir
Master: Name Node
▪ Detects health of FS
‣ Is data node alive?
‣ Is data block under-replicated?
‣ Rebalancing block allocation across data nodes, improved disk utilization

▪ Coordinates file operations

‣ Directs application clients to datanodes for reads
‣ Allocates blocks on datanodes for writes

▪ Security is not a priority

‣ Basic file and dir permissions (rwx)
‣ Default enforcement relies on client machine ‘username’
Master: Name Node
▪ File system does no work if NameNode not accessible!

▪ Single Point of failure! (Hadoop 1.x)

‣ Cold start → 10mins load FS image, 1hr for block list for every file
‣ Host recovery → Copy FS image, config data node

▪ Sync atomic writes to multiple disk file systems

‣ Local disk + NFS

▪ Secondary NameNode
‣ Merge FS image with edit log periodically … avoids downtime
when merging
‣ Serves as stale copy of FS image … data loss possible

http://blog.cloudera.com/blog/2012/03/high-availability-for-the-hadoop-distributed-file-system-hdfs/
Secondary
Name Node

Hadoop: The Definitive Guide, Tom White, 4th Edition, 2015

Master: Name Node
▪ NameNode High Availability (2.x)
‣ Reliable shared NFS for edit log
‣ Hot standby loads FS image in-memory
‣ Constantly reads edit logs from disk
‣ DataNodes send heartbeat, block list to both
• But ops received only from active

‣ On NameNode failover, standby can takeover immediately

http://blog.cloudera.com/blog/2012/03/high-availability-for-the-hadoop-distributed-file-system-hdfs/
Slave/Worker: Data Node
▪ Store & retrieve blocks
▪ Respond to client and master requests for block operations
▪ Sends heartbeat every 3 secs for liveliness
▪ Periodically sends list of block IDs and location on that node
‣ Piggyback on heartbeat message
‣ e.g., send block list every hour
▪ Caches blocks in-memory using cache-directives per file, on
single data node
‣ E.g. index, lookup table, etc.
‣ Can be used by schedulers
Network Topology
▪ Same Node, Same Rack, Same Data Center, Different Data Centers
▪ Distance function between two logical nodes provided in config
‣ /dc/rack/node … default is “flat”, i.e. same distance

Hadoop: The Definitive Guide, Tom White, 4th Edition, 2015

File Reads
▪ Client-Data Node direct
transfer .. Not through the Name Node

▪ Client gets data node

list for each block from NameNode
‣ First few blocks returned
initially, Sorted by distance

▪ Blocks read in order

‣ Connection opened and closed to nearest DataNode for each block
‣ Tries alternate data nodes on network failure, checksum failure
‣ Remembers & reports failures/corrupt blocks to Name Node

▪ Allows scaling to many concurrent clients

Hadoop: The Definitive Guide, Tom White, 4th Edition, 2015

File Writes
▪ Write one only…Append, Truncate…Strict one writer at a time, per file
▪ Clients get list of data nodes to store a block’s replica
‣ First copy on same data node as client, or random.
‣ Second is off-rack. Third on same rack as second.
▪ Blocks written in order. Forwarded in a pipeline. Acks from all replicas expected before next block
written.

Hadoop: The Definitive Guide, 4th Edition, 2015

Hadoop YARN
Yet Another Resource Negotiator
Slave Slave

Master Slave Slave

Master

Slave Slave

Fig-1 Fig-2
MRv1 vs MRv2 Application Lifecycle

Apache Hadoop YARN, Arun C. Murthy, et al, HortonWorks, Addison Wesley, 2014
MapReduce v1 → MapReduce v2 (YARN)

Apache Hadoop YARN, Arun C. Murthy, et al, HortonWorks, Addison Wesley, 2014
YARN
▪ Designed for scalability
‣ 10k nodes, 400k tasks
▪ Designed for availability
‣ Separate application management from resource management

▪ Improve utilization
‣ Flexible slot allocation. Slots not bound to Map or Reduce types.

▪ Go beyond MapReduce
YARN
▪ ResourceManager for cluster
‣ Keeps track of nodes, capacities, allocations
‣ Failure and recovery (heartbeats)
▪ Coordinates scheduling of jobs on the cluster
‣ Decides which node to allocate to a job
‣ Ensures load balancing

▪ Used by programming frameworks to schedule distributed applications

‣ MapReduce, Spark, etc.
▪ NodeManager
‣ Offers slots with given capacity on a host to schedule tasks
‣ Container maps to one or more slots…Container can be a Unix process or cgroup
Application Manager
▪ Coordinates
‣ resource acquisition,
‣ scheduling,
‣ monitoring progress ,
‣ and termination
‣ for a specific application type
▪ E.g. MapReduce, MPI, Spark, etc.
▪ AppManager runs in its own container
‣ May launch additional containers for its compute tasks
‣ Or may run job locally in JVM for “small” applications
YARN Application Lifecycle

Apache Hadoop YARN, Arun C. Murthy, et al, HortonWorks, Addison Wesley, 2014
Container
heartbeat
status to AM

Apache Hadoop YARN, Arun C. Murthy, et al, HortonWorks, Addison Wesley, 2014
Hadoop: The Definitive Guide, 4th Edition, 2015
MapReduce AppManager
▪ First requests Map containers
‣ As many as number of splits

▪ Reduce containers requested after 5% Map tasks complete

‣ User specified. 1 by default!
▪ Map containers try for data locality as “split”
‣ Same node, Same rack

▪ Containers have CPU and Memory resource requirements

‣ Config per job, or default for cluster

▪ AppManager asks Node Manager to start container

‣ Container task fetches jar, config locally, executes, commits
Scheduling in YARN
▪ FIFO

▪ Capacity
‣ using different queues, min capacity per
queue
‣ Allocate excess resource to more loaded

▪ Fair
‣ Give all available
‣ Redistribute as jobs arrive
Hadoop MapReduce
Mapping tasks to blocks
▪ FileInputFormat converts blocks to “splits”
‣ Typically, 1 split per block … reduce task creation
overhead vs. overwhelm single task
‣ Can specify splits smaller/larger than a block size
‣ Affects locality if spanning blocks
‣ Affects performance with many small files (combine!)

▪ Each split handled by a single Mapper task

‣ Records read from each split, forms Key-Value pair input
to Map function
Mapping tasks to blocks

Hadoop: The Definitive Guide, 4th Edition, 2015

Resource Mapping
▪ Resource acquisition either at beginning (Map tasks) or during (Reduce tasks)
application lifetime
‣ Higher priority for Map container requests

▪ AppManager can specify locality constraints to YARN

‣ Compute tasks are moved to data block location
‣ Location of one of three replicas of block
‣ Prefer same node, followed by rack, then cluster
Local Disk Local Disk

▪ Background thread “spills” to disk when circular memory buffer (100MB) threshold
reached (80%)
‣ Asynchronous, avoid blocking unless thread write slower than Map task
▪ Divides the data into in-memory partitions, one for each reducer
‣ Performs sort by key
‣ Runs combiner sorted outputs
‣ Writes to local directory, accessible by reducers over HTTPS (Not HDFS!)
Local Disk Local Disk

▪ Output files are merged, partitioned and sorted into single file on disk
‣ If multiple spill files (3) once Map task done, runs combiner again.
‣ Optionally compressed
▪ Map task output always written to disk…recovery!
Local Disk Local Disk

▪ Reducer copies files as soon as available from any Map task

‣ Copied to reducer memory if small,
‣ On threshold: Merged , Combiner then spilled to disk
▪ Incremental merge sort takes place in background thread
Local Disk Local Disk

▪ When output from all Map tasks available, final Merge-sort over all spilled files,
before reduce method called
‣ Multiple rounds, 10 files merged per round
‣ Input to reducer from sorted file and trailing in-memory sorted KVP
Liveliness
▪ A Hadoop job or task is alive as long as it is
making progress
‣ Reading/writing input record
‣ Setting status or incrementing counter
▪ Progress reported to App- Manager by
Tasks ~3secs
▪ Client polls AppManager
‣ ~1 sec
Reading
▪ Hadoop: The Definitive Guide, 4th Edition, 2015
‣ Chapters 3, 4, 7

Additional Resources
▪ Apache Hadoop YARN: Moving Beyond MapReduce and Batch Processing with
Apache Hadoop, 2015
‣ Chapters 1, 3, 4, 7

Big Data Notes
No ratings yet
Big Data Notes
191 pages
Understanding Hadoop Ecosystem
No ratings yet
Understanding Hadoop Ecosystem
38 pages
BDS Session 6
No ratings yet
BDS Session 6
78 pages
M2 Q&a
No ratings yet
M2 Q&a
31 pages
Unit-3 BDA
No ratings yet
Unit-3 BDA
30 pages
Unit-Iv CC&BD CS71
No ratings yet
Unit-Iv CC&BD CS71
148 pages
BDA Unit 1
No ratings yet
BDA Unit 1
35 pages
Introduction To Hadoop
No ratings yet
Introduction To Hadoop
56 pages
Big Data Unit-2 PPT Part1
No ratings yet
Big Data Unit-2 PPT Part1
76 pages
Unit 5
No ratings yet
Unit 5
101 pages
Lec 5 - Big Data Storage Technologies I - Hadoop
No ratings yet
Lec 5 - Big Data Storage Technologies I - Hadoop
44 pages
Hadoop Frame Work
No ratings yet
Hadoop Frame Work
38 pages
Hadoop Intro and Hdfs
No ratings yet
Hadoop Intro and Hdfs
37 pages
DW - Bigdata9
No ratings yet
DW - Bigdata9
113 pages
2-Hadoop History Terminologies DFS-03-01-2025
No ratings yet
2-Hadoop History Terminologies DFS-03-01-2025
52 pages
Official Microsoft Assessment For PL300 - 02
No ratings yet
Official Microsoft Assessment For PL300 - 02
29 pages
Cse3002 Big Data m1
No ratings yet
Cse3002 Big Data m1
62 pages
Introduction To Hadoop
No ratings yet
Introduction To Hadoop
44 pages
Hadoop 1
No ratings yet
Hadoop 1
75 pages
Chapter - 6 - Hadoop
No ratings yet
Chapter - 6 - Hadoop
51 pages
Data Science
No ratings yet
Data Science
14 pages
Structured, Semi-Structured and Unstructured Data
No ratings yet
Structured, Semi-Structured and Unstructured Data
2 pages
Introduction To Hadoop
No ratings yet
Introduction To Hadoop
5 pages
Unit - 2
No ratings yet
Unit - 2
42 pages
Chapter2 Bdi
No ratings yet
Chapter2 Bdi
101 pages
CH 2
No ratings yet
CH 2
6 pages
DSECL ZG 522: Big Data Systems: Session 6: Hadoop Architecture and Filesystem
No ratings yet
DSECL ZG 522: Big Data Systems: Session 6: Hadoop Architecture and Filesystem
56 pages
Google SEO Search Engine Optimization Introduction Powerpoint Presentation
No ratings yet
Google SEO Search Engine Optimization Introduction Powerpoint Presentation
23 pages
Module-2 PPT-1
No ratings yet
Module-2 PPT-1
126 pages
HDFS 79
No ratings yet
HDFS 79
74 pages
Hadoop 1
No ratings yet
Hadoop 1
26 pages
Bda Unit34
No ratings yet
Bda Unit34
17 pages
Bda Unit 2
No ratings yet
Bda Unit 2
79 pages
Unit 3
No ratings yet
Unit 3
18 pages
Big Data-UNIT-2
No ratings yet
Big Data-UNIT-2
46 pages
Introduction To Hadoop: Dr. G Sudha Sadhasivam Professor, CSE PSG College of Technology Coimbatore
No ratings yet
Introduction To Hadoop: Dr. G Sudha Sadhasivam Professor, CSE PSG College of Technology Coimbatore
34 pages
Unit 2 Hadoop
No ratings yet
Unit 2 Hadoop
67 pages
Unit 2 Hadoop
No ratings yet
Unit 2 Hadoop
60 pages
Haoop Architecture
No ratings yet
Haoop Architecture
34 pages
3.1 Hadoop Ecosystem
No ratings yet
3.1 Hadoop Ecosystem
48 pages
bdcc-2 2
No ratings yet
bdcc-2 2
12 pages
Hadoop Intro
No ratings yet
Hadoop Intro
40 pages
10th August Morning and Afternoon Session Hadoop
No ratings yet
10th August Morning and Afternoon Session Hadoop
18 pages
Unit 3 Da
No ratings yet
Unit 3 Da
43 pages
ECS765P - W3 - Hadoop Principles and Components
No ratings yet
ECS765P - W3 - Hadoop Principles and Components
47 pages
Lecture 2
No ratings yet
Lecture 2
28 pages
Oracle PostgreSQL DBA Resume
No ratings yet
Oracle PostgreSQL DBA Resume
4 pages
DBMS LAB Manual Final22
0% (1)
DBMS LAB Manual Final22
74 pages
Hadoop: A Software Framework For Data Intensive Computing Applications
No ratings yet
Hadoop: A Software Framework For Data Intensive Computing Applications
47 pages
SQL Command
No ratings yet
SQL Command
86 pages
Hadoop Architecture
No ratings yet
Hadoop Architecture
48 pages
NYOUG Hadoop Presentaton
No ratings yet
NYOUG Hadoop Presentaton
47 pages
UNIT-1-part-2-BIG DATA ANALYTICS AND TOOLS
No ratings yet
UNIT-1-part-2-BIG DATA ANALYTICS AND TOOLS
19 pages
Hadoop
No ratings yet
Hadoop
4 pages
Part2 HDFS
No ratings yet
Part2 HDFS
33 pages
Lecture-1 - 3 Hadoop - HDFS - Mapreduce (Self Study)
No ratings yet
Lecture-1 - 3 Hadoop - HDFS - Mapreduce (Self Study)
25 pages
Bda - Unit 2
No ratings yet
Bda - Unit 2
56 pages
Create Standby Database On Oracle 10G
No ratings yet
Create Standby Database On Oracle 10G
10 pages
Top 20 DB Monitoring SQL Scripts For DBAs-2
No ratings yet
Top 20 DB Monitoring SQL Scripts For DBAs-2
11 pages
Prepared By: Manoj Kumar Joshi & Vikas Sawhney
No ratings yet
Prepared By: Manoj Kumar Joshi & Vikas Sawhney
47 pages
1Z0 448 Demo
No ratings yet
1Z0 448 Demo
5 pages
Big Data Unit 3 by Multi Atoms
No ratings yet
Big Data Unit 3 by Multi Atoms
6 pages
Apex Institute of Technology: Big Data Security
No ratings yet
Apex Institute of Technology: Big Data Security
30 pages
2022 Winter Model Answer Papermsbte Study Resources
No ratings yet
2022 Winter Model Answer Papermsbte Study Resources
25 pages
Printing Big Data Hadoop
No ratings yet
Printing Big Data Hadoop
24 pages
Big Data
No ratings yet
Big Data
16 pages
Keys
No ratings yet
Keys
6 pages
087 Khushboo
No ratings yet
087 Khushboo
40 pages
Unit 2 Notes BDA
No ratings yet
Unit 2 Notes BDA
10 pages
NoSQL - Database Revolution
No ratings yet
NoSQL - Database Revolution
10 pages
Jenny Blog
No ratings yet
Jenny Blog
12 pages
MCQ in Bcom Ii Semester Management Informtion System: Multiple Choice Questions
No ratings yet
MCQ in Bcom Ii Semester Management Informtion System: Multiple Choice Questions
16 pages
Introduction To Structured Query Language (SQL)
No ratings yet
Introduction To Structured Query Language (SQL)
55 pages
Apache Hadoop Filesystem and Its Usage in Facebook
No ratings yet
Apache Hadoop Filesystem and Its Usage in Facebook
33 pages
DBMS - 21CS652 - Test1 - Scheme of Evaluation
No ratings yet
DBMS - 21CS652 - Test1 - Scheme of Evaluation
5 pages
Unit 5.1 DBMS
No ratings yet
Unit 5.1 DBMS
18 pages
Linked List3
No ratings yet
Linked List3
19 pages
Data Migration To S4 Hana
No ratings yet
Data Migration To S4 Hana
28 pages
Administrating A MySQL Server
100% (1)
Administrating A MySQL Server
4 pages
IT-243 Assignment 2nd Semester
No ratings yet
IT-243 Assignment 2nd Semester
8 pages
Oracle
No ratings yet
Oracle
1 page
Tech Note 921 - Optimizing SQL Server For Large Galaxy Migration
No ratings yet
Tech Note 921 - Optimizing SQL Server For Large Galaxy Migration
10 pages
EXAM Test Papers
No ratings yet
EXAM Test Papers
4 pages
Lec 37-40 PDF
No ratings yet
Lec 37-40 PDF
87 pages
Vijaya Bharathi
No ratings yet
Vijaya Bharathi
2 pages
SD Related Tables & Structures
No ratings yet
SD Related Tables & Structures
4 pages
A Novel Approach For Evaluating Effectiveness of Recommendation Algorithms
No ratings yet
A Novel Approach For Evaluating Effectiveness of Recommendation Algorithms
5 pages
Northwind Database Analysis in SQL
No ratings yet
Northwind Database Analysis in SQL
5 pages
Assignment 7: Indexing and Query Processing
No ratings yet
Assignment 7: Indexing and Query Processing
5 pages
Big Data Analytics
From Everand
Big Data Analytics
Nitin Kumar Yadav
No ratings yet

Hadoop Platform & Services

Uploaded by

Hadoop Platform & Services

Uploaded by

Data Engineering

Data & Compute Platform and Services ...

➔ Use of optimal technology stack

➔ Develop right platforms based on requirements

➔ Optimize overall Cost - Usage based cost optimization

➔ Compute: ~50,000 applications running on an average in a day

➔ Storage: (~1.4 PB) Tiered Storage

◆ HBase (Distributed key-value Store)

◆ YARN (Compute Engine)

◆ Hive/Spark (Querying Engines)

◆ Oozie (Scheduling Engine)

➔ Kafka (Messaging Queue: Primary Source for Data Ingestion)

➔ Hue/Zeppelin/Jupyter (Data Access Platforms)

▪ How to store Data ?

‣ Transactional based systems v/s Event based systems

‣ Storage Mechanism and Scale:

▪ What should be the optimal size of a block ?

‣ Unit of access Block Size 4KB 128MB

Data Availability Using RAID Replicated Blocks

▪ How to ensure data availability ?

▪ Optimized for sequential reads

▪ Throughput favoured over low latency

▪ Survive high disk/node failures

‣ Slave manages blocks containing data

▪ Block location not persistent, kept in-memory

▪ Coordinates file operations

▪ Security is not a priority

▪ Single Point of failure! (Hadoop 1.x)

▪ Sync atomic writes to multiple disk file systems

Hadoop: The Definitive Guide, Tom White, 4th Edition, 2015

‣ On NameNode failover, standby can takeover immediately

Hadoop: The Definitive Guide, Tom White, 4th Edition, 2015

▪ Client gets data node

▪ Blocks read in order

▪ Allows scaling to many concurrent clients

Hadoop: The Definitive Guide, Tom White, 4th Edition, 2015

Hadoop: The Definitive Guide, 4th Edition, 2015

Master Slave Slave

▪ Used by programming frameworks to schedule distributed applications

▪ Reduce containers requested after 5% Map tasks complete

▪ Containers have CPU and Memory resource requirements

▪ AppManager asks Node Manager to start container

▪ Each split handled by a single Mapper task

Hadoop: The Definitive Guide, 4th Edition, 2015

▪ AppManager can specify locality constraints to YARN

▪ Reducer copies files as soon as available from any Map task

You might also like