0% found this document useful (0 votes)

52 views3 pages

MapReduce Architecture

Map reduce architecture

Uploaded by

mirzasaniya716

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views3 pages

MapReduce Architecture

Map reduce architecture

Uploaded by

mirzasaniya716

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

11/12/24, 8:10 PM MapReduce Architecture - GeeksforGeeks

Trending Now DSA Web Tech Foundational Courses Data Science Practice Problem Python

MapReduce Architecture
Last Updated : 10 Sep, 2020

MapReduce and HDFS are the two major components of Hadoop

which makes it so powerful and efficient to use. MapReduce is a
programming model used for efficient processing in parallel over large
data-sets in a distributed manner. The data is first split and then
combined to produce the final result. The libraries for MapReduce is
written in so many programming languages with various different-
different optimizations. The purpose of MapReduce in Hadoop is to
Map each of the jobs and then it will reduce it to equivalent tasks for
providing less overhead over the cluster network and to reduce the
processing power. The MapReduce task is mainly divided into two
phases Map Phase and Reduce Phase.

MapReduce Architecture:

Components of MapReduce Architecture:

1. Client: The MapReduce client is the one who brings the Job to the
MapReduce for processing. There can be multiple clients available

https://www.geeksforgeeks.org/mapreduce-architecture/ 1/8
11/12/24, 8:10 PM MapReduce Architecture - GeeksforGeeks

that continuously send jobs for processing to the Hadoop

MapReduce Manager.
2. Job: The MapReduce Job is the actual work that the client wanted
to do which is comprised of so many smaller tasks that the client
wants to process or execute.
3. Hadoop MapReduce Master: It divides the particular job into
subsequent job-parts.
4. Job-Parts: The task or sub-jobs that are obtained after dividing the
main job. The result of all the job-parts combined to produce the
final output.
5. Input Data: The data set that is fed to the MapReduce for
processing.
6. Output Data: The final result is obtained after the processing.

In MapReduce, we have a client. The client will submit the job of a

particular size to the Hadoop MapReduce Master. Now, the
MapReduce master will divide this job into further equivalent job-
parts. These job-parts are then made available for the Map and
Reduce Task. This Map and Reduce task will contain the program as
per the requirement of the use-case that the particular company is
solving. The developer writes their logic to fulfill the requirement that
the industry requires. The input data which we are using is then fed to
the Map Task and the Map will generate intermediate key-value pair
as its output. The output of Map i.e. these key-value pairs are then fed
to the Reducer and the final output is stored on the HDFS. There can
be n number of Map and Reduce tasks made available for processing
the data as per the requirement. The algorithm for Map and Reduce is
made with a very optimized way such that the time complexity or
space complexity is minimum.

Let’s discuss the MapReduce phases to get a better understanding of

its architecture:

The MapReduce task is mainly divided into 2 phases i.e. Map phase
and Reduce phase.

https://www.geeksforgeeks.org/mapreduce-architecture/ 2/8
11/12/24, 8:10 PM MapReduce Architecture - GeeksforGeeks

1. Map: As the name suggests its main use is to map the input data in
key-value pairs. The input to the map may be a key-value pair
where the key can be the id of some kind of address and value is
the actual value that it keeps. The Map() function will be executed
in its memory repository on each of these input key-value pairs and
generates the intermediate key-value pair which works as input for
the Reducer or Reduce() function.

2. Reduce: The intermediate key-value pairs that work as input for

Reducer are shuffled and sort and send to the Reduce() function.
Reducer aggregate or group the data based on its key-value pair as
per the reducer algorithm written by the developer.

How Job tracker and the task tracker deal with MapReduce:

1. Job Tracker: The work of Job tracker is to manage all the resources
and all the jobs across the cluster and also to schedule each map
on the Task Tracker running on the same data node since there can
be hundreds of data nodes available in the cluster.

2. Task Tracker: The Task Tracker can be considered as the actual

slaves that are working on the instruction given by the Job Tracker.
This Task Tracker is deployed on each of the nodes available in the
cluster that executes the Map and Reduce task as instructed by Job
Tracker.

There is also one important component of MapReduce Architecture

known as Job History Server. The Job History Server is a daemon
process that saves and stores historical information about the task or
application, like the logs which are generated during or after the job
execution are stored on Job History Server.

D diksh… 37

https://www.geeksforgeeks.org/mapreduce-architecture/ 3/8

Hydraulic Seal Catalogue 2022
100% (3)
Hydraulic Seal Catalogue 2022
423 pages
Hadoop (Mapreduce)
No ratings yet
Hadoop (Mapreduce)
43 pages
Unit 3
No ratings yet
Unit 3
27 pages
I80 Maintenance Manual
No ratings yet
I80 Maintenance Manual
226 pages
BDA Unit-2
No ratings yet
BDA Unit-2
11 pages
MT8127 Android Scatter
100% (1)
MT8127 Android Scatter
7 pages
BDA Unit 3 Notes
No ratings yet
BDA Unit 3 Notes
11 pages
CBSE Class 10 Science Qs Paper 2016 Set 2
No ratings yet
CBSE Class 10 Science Qs Paper 2016 Set 2
24 pages
Grade 10 Work Sheet w5 q1
100% (2)
Grade 10 Work Sheet w5 q1
2 pages
3i's - 4th Quarter Reviewer
100% (1)
3i's - 4th Quarter Reviewer
5 pages
Hadoop: Er. Gursewak Singh Dsce
No ratings yet
Hadoop: Er. Gursewak Singh Dsce
15 pages
Unit 2 Topic 4 Map Reduce
No ratings yet
Unit 2 Topic 4 Map Reduce
43 pages
Understanding MapReduce in Hadoop
No ratings yet
Understanding MapReduce in Hadoop
25 pages
Hadoop Architec
No ratings yet
Hadoop Architec
14 pages
Large-Scale Data Management: Cs525: Special Topics in Dbs
No ratings yet
Large-Scale Data Management: Cs525: Special Topics in Dbs
22 pages
Hadoop Map Reduce Concept
No ratings yet
Hadoop Map Reduce Concept
23 pages
What Is MapReduce in Hadoop
No ratings yet
What Is MapReduce in Hadoop
5 pages
Introduction To MapReduce
No ratings yet
Introduction To MapReduce
26 pages
MapReduce Arch
No ratings yet
MapReduce Arch
29 pages
Big Data Analytics-4
No ratings yet
Big Data Analytics-4
26 pages
DM Hadoop Architecture
No ratings yet
DM Hadoop Architecture
6 pages
Big Data Notes
No ratings yet
Big Data Notes
13 pages
BDA UNIT-3 (1) - Merged
No ratings yet
BDA UNIT-3 (1) - Merged
98 pages
Cloud Computing Prof
No ratings yet
Cloud Computing Prof
11 pages
BDA Unit 3 1
No ratings yet
BDA Unit 3 1
37 pages
Unit 5
No ratings yet
Unit 5
7 pages
Unit 5
No ratings yet
Unit 5
35 pages
What Is MapReduce in Hadoop - Architecture - Example
No ratings yet
What Is MapReduce in Hadoop - Architecture - Example
7 pages
Lecture 5 MapReduce Working
No ratings yet
Lecture 5 MapReduce Working
15 pages
Lec 5
No ratings yet
Lec 5
6 pages
1 MapReduce Introduction With Example
No ratings yet
1 MapReduce Introduction With Example
52 pages
Unit-2 MapReduce2024
No ratings yet
Unit-2 MapReduce2024
41 pages
Bda U2
No ratings yet
Bda U2
79 pages
Bda Unit-3
No ratings yet
Bda Unit-3
20 pages
B. Hadoop Ecosystem - III (MapReduce)
No ratings yet
B. Hadoop Ecosystem - III (MapReduce)
55 pages
Lec 5
No ratings yet
Lec 5
5 pages
Unit IV Notes
No ratings yet
Unit IV Notes
25 pages
HadoopMapreduce Summerization
No ratings yet
HadoopMapreduce Summerization
24 pages
Data Science
No ratings yet
Data Science
7 pages
Unit 2 Topic 4 Map Reduce
No ratings yet
Unit 2 Topic 4 Map Reduce
27 pages
Unit - III Advanced Analytics Technology and Tools
No ratings yet
Unit - III Advanced Analytics Technology and Tools
44 pages
Unit 3
No ratings yet
Unit 3
13 pages
Big Data Analytics UNIT 3 Notets
No ratings yet
Big Data Analytics UNIT 3 Notets
12 pages
MapReduce Architecture
No ratings yet
MapReduce Architecture
5 pages
Clinical and Health Psychology
No ratings yet
Clinical and Health Psychology
14 pages
3.1.how Map Reduce Works & 3.2 Anatomy
No ratings yet
3.1.how Map Reduce Works & 3.2 Anatomy
11 pages
Hadoop Karunesh
No ratings yet
Hadoop Karunesh
14 pages
Big Data BCA Unit4
No ratings yet
Big Data BCA Unit4
9 pages
MapReduce Unit3
No ratings yet
MapReduce Unit3
27 pages
Unit 2
No ratings yet
Unit 2
12 pages
03 Firstmrjob Invertedindexconstruction 141206231216 Conversion Gate01 PDF
No ratings yet
03 Firstmrjob Invertedindexconstruction 141206231216 Conversion Gate01 PDF
54 pages
Activity 1
No ratings yet
Activity 1
4 pages
Notes - Unit 3 - Map Reduce Applications
No ratings yet
Notes - Unit 3 - Map Reduce Applications
11 pages
Unit 3 & 4 Big Data
No ratings yet
Unit 3 & 4 Big Data
18 pages
Map Reduce
No ratings yet
Map Reduce
25 pages
Unit 4 1
No ratings yet
Unit 4 1
12 pages
Notes Bug Data and of Apache
No ratings yet
Notes Bug Data and of Apache
4 pages
Big Data Management Continued
No ratings yet
Big Data Management Continued
48 pages
Big Data Unit - 3
No ratings yet
Big Data Unit - 3
7 pages
Datasheet - Cios Connect
No ratings yet
Datasheet - Cios Connect
16 pages
BDA Unit 2 Notes
No ratings yet
BDA Unit 2 Notes
32 pages
Bda Unit 3
No ratings yet
Bda Unit 3
29 pages
Bda Unit 3
No ratings yet
Bda Unit 3
14 pages
Multi-Stage Payment Methods
No ratings yet
Multi-Stage Payment Methods
11 pages
Day 7 - BigData - MapReduce Architecture and Components
No ratings yet
Day 7 - BigData - MapReduce Architecture and Components
2 pages
Unit 5 - Mapreduce
No ratings yet
Unit 5 - Mapreduce
8 pages
ESP32 Microcontroller Based Smart Power
No ratings yet
ESP32 Microcontroller Based Smart Power
8 pages
BDA
No ratings yet
BDA
20 pages
Thesis Paper Project Evaluation 1
No ratings yet
Thesis Paper Project Evaluation 1
25 pages
Factors That Influence Temperature & Rainfall
100% (1)
Factors That Influence Temperature & Rainfall
4 pages
(Original PDF) Business Statistics For Contemporary Decision Making, 2nd Canadian Editioninstant Download
100% (3)
(Original PDF) Business Statistics For Contemporary Decision Making, 2nd Canadian Editioninstant Download
59 pages
Department of Education School Building Inventory Form (As of February 28, 2022)
No ratings yet
Department of Education School Building Inventory Form (As of February 28, 2022)
17 pages
Goal Setting
No ratings yet
Goal Setting
3 pages
Private Placement Memorandum Manager
No ratings yet
Private Placement Memorandum Manager
4 pages
Risk Management and Laboratory Safety
No ratings yet
Risk Management and Laboratory Safety
23 pages
s4 Owners Manual 20042017
No ratings yet
s4 Owners Manual 20042017
70 pages
Solar Plate
No ratings yet
Solar Plate
13 pages
Problem Solving in Organizations A Methodological Handbook For Business Students 1st Edition Van Aken 2024 Scribd Download
100% (11)
Problem Solving in Organizations A Methodological Handbook For Business Students 1st Edition Van Aken 2024 Scribd Download
84 pages
Tenths On A Number Line.196466852
No ratings yet
Tenths On A Number Line.196466852
19 pages
Where Can Buy Gentrification 1st Edition Loretta Lees Ebook With Cheap Price
No ratings yet
Where Can Buy Gentrification 1st Edition Loretta Lees Ebook With Cheap Price
67 pages
9.2.8 Lab - Investigate Dissolved Oxygen Levels (Wet Lab)
No ratings yet
9.2.8 Lab - Investigate Dissolved Oxygen Levels (Wet Lab)
9 pages
Development of Presentation Media Design Based On Google Slides Add-On Pear-Deck On High School Sequences and Series Material
No ratings yet
Development of Presentation Media Design Based On Google Slides Add-On Pear-Deck On High School Sequences and Series Material
9 pages
Dacia Spring 2022 0120
No ratings yet
Dacia Spring 2022 0120
5 pages
Ai Apl#241373
No ratings yet
Ai Apl#241373
7 pages
Sudan To Tamworth Stories of Resilience, Hope and Transition
No ratings yet
Sudan To Tamworth Stories of Resilience, Hope and Transition
4 pages
Tamil Sangam
No ratings yet
Tamil Sangam
3 pages
Tally Prime Shortcut Keys PDF
No ratings yet
Tally Prime Shortcut Keys PDF
6 pages

MapReduce Architecture

Uploaded by

MapReduce Architecture

Uploaded by

11/12/24, 8:10 PM MapReduce Architecture - GeeksforGeeks

MapReduce and HDFS are the two major components of Hadoop

Components of MapReduce Architecture:

that continuously send jobs for processing to the Hadoop

In MapReduce, we have a client. The client will submit the job of a

Let’s discuss the MapReduce phases to get a better understanding of

2. Reduce: The intermediate key-value pairs that work as input for

2. Task Tracker: The Task Tracker can be considered as the actual

There is also one important component of MapReduce Architecture

You might also like