Lecture 5 MapReduce Working
Lecture 5 MapReduce Working
By
Dr. Aditya Bhardwaj
MapReduce
What is Job Tracker Task Tracker
Working
MapReduce Role Role
Architecture
Functional Architecture of Hadoop
• The core component of Hadoop includes HDFS and MapReduce.
Working of MapReduce
The basic unit of information used by MapReduce is a key-
value pair.
• The Map task takes a set of data and converts it into
another set of data, where individual elements are
broken down into tuples (key-value pairs).
5/24
Example- To Demonstrate MapReduce Working
• For example, consider a MapReduce job that counts the number of times each
word is used across a set of documents.
• Note: Framework sorts all intermediate key-value pair by key, not by value 6/24
How does MapReduce Works High-Level Architecture?
• The Shuffle stage and the Reduce stage together are called
the Reduce stage.
• Shuffling: It is second phase of MapReduce used to sort, group
and shuffle the output coming from the Mapper function.
7/24
MapReduce Wordcount Realtime Applications
Application 1. Break down movie ratings by rating
score
8/24
Job and Task Tracker in Hadoop Map Reduce Architecture
10/24
Functions of JobTracker
Job Tracker –
• JobTracker receives the requests for MapReduce execution from the
client.
11/24
Functions of TaskTracker
TaskTracker –
• TaskTracker run the tasks and report the status of task to JobTracker.
TaskTracker run on DataNodes. It has function of following the orders of
the job tracker and updating the job tracker with its progress status
periodically.
12/24
Features of MapReduce
1. Simplicity – MapReduce jobs are easy to run. Applications
can be written in any language such as java, C++,
andFeatures
python. of MapReduce
2. Scalability – MapReduce framework are built in such a way
that they can accommodate more machines as and when
required.
3. Synchronization: Execution of several concurrent processes
requires synchronization. The MapReduce framework tracks all
the tasks along with their mapping timings and start the
reduction process after the completion of mapping phase.
14/2
Thanks Note
15
tungal/presentations/ad2012