MapReduce Architecture
MapReduce Architecture
33
Trending Now DSA Web Tech Foundational Courses Data Science Practice Problem Python
MapReduce Architecture
Last Updated : 10 Sep, 2020
MapReduce Architecture:
1. Client: The MapReduce client is the one who brings the Job to the
MapReduce for processing. There can be multiple clients available
https://www.geeksforgeeks.org/mapreduce-architecture/ 1/8
11/12/24, 8:10 PM MapReduce Architecture - GeeksforGeeks
The MapReduce task is mainly divided into 2 phases i.e. Map phase
and Reduce phase.
https://www.geeksforgeeks.org/mapreduce-architecture/ 2/8
11/12/24, 8:10 PM MapReduce Architecture - GeeksforGeeks
1. Map: As the name suggests its main use is to map the input data in
key-value pairs. The input to the map may be a key-value pair
where the key can be the id of some kind of address and value is
the actual value that it keeps. The Map() function will be executed
in its memory repository on each of these input key-value pairs and
generates the intermediate key-value pair which works as input for
the Reducer or Reduce() function.
How Job tracker and the task tracker deal with MapReduce:
1. Job Tracker: The work of Job tracker is to manage all the resources
and all the jobs across the cluster and also to schedule each map
on the Task Tracker running on the same data node since there can
be hundreds of data nodes available in the cluster.
D diksh… 37
https://www.geeksforgeeks.org/mapreduce-architecture/ 3/8