0% found this document useful (0 votes)

11 views

Lecture 5 MapReduce Working

Uploaded by

BHAWANI KUMARI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

Lecture 5 MapReduce Working

Uploaded by

BHAWANI KUMARI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Processing Big Data

With Hadoop Map

Reduce Technology

By
Dr. Aditya Bhardwaj

[email protected]

Big Data Analytics and Business Intelligence (CSET/CMCA-580)

Learning Objectives

MapReduce
What is Job Tracker Task Tracker
Working
MapReduce Role Role
Architecture
Functional Architecture of Hadoop
• The core component of Hadoop includes HDFS and MapReduce.
Working of MapReduce
The basic unit of information used by MapReduce is a key-
value pair.
• The Map task takes a set of data and converts it into
another set of data, where individual elements are
broken down into tuples (key-value pairs).

• The Reduce task takes the output from the Map as an

input and the aggregates those intermediate key-value
pair which is the final output.

5/24
Example- To Demonstrate MapReduce Working
• For example, consider a MapReduce job that counts the number of times each
word is used across a set of documents.

• Note: Framework sorts all intermediate key-value pair by key, not by value 6/24
How does MapReduce Works High-Level Architecture?

• The Shuffle stage and the Reduce stage together are called
the Reduce stage.
• Shuffling: It is second phase of MapReduce used to sort, group
and shuffle the output coming from the Mapper function.
7/24
MapReduce Wordcount Realtime Applications
Application 1. Break down movie ratings by rating
score

Application 2. Log analysis from a web server

8/24
Job and Task Tracker in Hadoop Map Reduce Architecture

•There are two types of nodes for job execution

•One Master -JobTracker
•Multiple Slaves –TaskTracker
What is JobTracker?

• JobTracker is a node which can run on the NameNode

(MasterNode) to allocates the job to task trackers.

• It tracks resource availability and task life cycle

management, tracking its progress, fault tolerance etc.

10/24
Functions of JobTracker

Job Tracker –
• JobTracker receives the requests for MapReduce execution from the
client.

• JobTracker talks to the NameNode to determine the location of the

data.

• JobTracker finds the best TaskTracker nodes to execute tasks based on

the data locality (proximity of the data) and the available slots to
execute a task on a given node.

• JobTracker monitors the individual TaskTrackers and the submits back

the overall status of the job back to the client.

11/24
Functions of TaskTracker

TaskTracker –

• TaskTracker runs on DataNode.

• Map and Reduce functions are executed on DataNodes using TaskTrackers.

• TaskTracker run the tasks and report the status of task to JobTracker.
TaskTracker run on DataNodes. It has function of following the orders of
the job tracker and updating the job tracker with its progress status
periodically.

12/24
Features of MapReduce
1. Simplicity – MapReduce jobs are easy to run. Applications
can be written in any language such as java, C++,
andFeatures
python. of MapReduce
2. Scalability – MapReduce framework are built in such a way
that they can accommodate more machines as and when
required.
3. Synchronization: Execution of several concurrent processes
requires synchronization. The MapReduce framework tracks all
the tasks along with their mapping timings and start the
reduction process after the completion of mapping phase.

4. Fault Tolerance – MapReduce takes care of failures. If one

copy of data is unavailable, another machine has a copy of
the same key pair which can be used for solving the same
subtask.
Conclusion
• The functioning of MapReduce like we just went through is
a sequential flow.

• MapReduce Shuffling and Sorting occurs simultaneously to

summarize the Mapper intermediate output.

14/2
Thanks Note

15
tungal/presentations/ad2012

Manhattan WMS Training MaxMunus
0% (2)
Manhattan WMS Training MaxMunus
7 pages
Unit Iv Mapreduce Applications
No ratings yet
Unit Iv Mapreduce Applications
70 pages
BDA UNIT -4 notes
No ratings yet
BDA UNIT -4 notes
28 pages
BDA Unit 3 Notes
No ratings yet
BDA Unit 3 Notes
11 pages
Big Data Analytics-4
No ratings yet
Big Data Analytics-4
26 pages
Notes Bug Data and of Apache
No ratings yet
Notes Bug Data and of Apache
4 pages
What Is MapReduce in Hadoop - Architecture - Example
No ratings yet
What Is MapReduce in Hadoop - Architecture - Example
7 pages
Unit 3
No ratings yet
Unit 3
13 pages
UNIT 4 Notes by ARUN JHAPATE
No ratings yet
UNIT 4 Notes by ARUN JHAPATE
20 pages
UNIT -4 PPT
No ratings yet
UNIT -4 PPT
50 pages
MapReduce Architecture
No ratings yet
MapReduce Architecture
5 pages
P.Prabu (23x61c) CCS334-BDA - Unit-3
No ratings yet
P.Prabu (23x61c) CCS334-BDA - Unit-3
23 pages
Bda Unit-3
No ratings yet
Bda Unit-3
20 pages
BDA UNIT-3 (1) - Merged
No ratings yet
BDA UNIT-3 (1) - Merged
98 pages
1 UNIT-1
No ratings yet
1 UNIT-1
59 pages
Mapreduce
No ratings yet
Mapreduce
5 pages
Lec 5
No ratings yet
Lec 5
5 pages
Mapreduce Lifecycle
No ratings yet
Mapreduce Lifecycle
8 pages
MapReduce Architecture
No ratings yet
MapReduce Architecture
3 pages
Unit 5 - Mapreduce
No ratings yet
Unit 5 - Mapreduce
8 pages
Introduction To MapReduce
No ratings yet
Introduction To MapReduce
26 pages
3.1.How Map Reduce Works & 3.2 Anatomy
No ratings yet
3.1.How Map Reduce Works & 3.2 Anatomy
11 pages
Module 4 BDA Solutions
No ratings yet
Module 4 BDA Solutions
22 pages
BIG DATA UNIT -3
No ratings yet
BIG DATA UNIT -3
7 pages
Big Data BCA Unit4
No ratings yet
Big Data BCA Unit4
9 pages
Unit 3 Handouts
No ratings yet
Unit 3 Handouts
11 pages
BDA Unit 2 Notes
No ratings yet
BDA Unit 2 Notes
32 pages
Unit 3-1
No ratings yet
Unit 3-1
65 pages
Lec 5
No ratings yet
Lec 5
6 pages
Bda Mod2
No ratings yet
Bda Mod2
8 pages
MapReduce Arch
No ratings yet
MapReduce Arch
29 pages
2 BDA MapReduce
No ratings yet
2 BDA MapReduce
30 pages
Big data unit 3 own
No ratings yet
Big data unit 3 own
20 pages
2inceptez Hadoop Processing
No ratings yet
2inceptez Hadoop Processing
16 pages
Module 4
No ratings yet
Module 4
37 pages
Notes - Unit 3 - Map Reduce Applications
No ratings yet
Notes - Unit 3 - Map Reduce Applications
11 pages
HadoopMapreduce Summerization
No ratings yet
HadoopMapreduce Summerization
24 pages
Big Data Unit 4
No ratings yet
Big Data Unit 4
14 pages
Big Data notes (1)
No ratings yet
Big Data notes (1)
13 pages
What Is MapReduce in Hadoop
No ratings yet
What Is MapReduce in Hadoop
5 pages
3-MapReduce Different Phases-13-01-2025
No ratings yet
3-MapReduce Different Phases-13-01-2025
23 pages
132 P16cse5a-P16ite3a 2020052706582977
No ratings yet
132 P16cse5a-P16ite3a 2020052706582977
15 pages
Chapter 4 MapReduce and New Software Stack
No ratings yet
Chapter 4 MapReduce and New Software Stack
48 pages
Unit-4
No ratings yet
Unit-4
19 pages
A Weather Dataset. Understanding Hadoop API for MapReduce Framework
No ratings yet
A Weather Dataset. Understanding Hadoop API for MapReduce Framework
9 pages
Unit 3
No ratings yet
Unit 3
27 pages
MapReduce Architecture
No ratings yet
MapReduce Architecture
27 pages
BDA_UNIT_2
No ratings yet
BDA_UNIT_2
48 pages
Top Answers To Map Reduce Interview Questions
No ratings yet
Top Answers To Map Reduce Interview Questions
6 pages
BDA-U4
No ratings yet
BDA-U4
25 pages
Bda Assignment
No ratings yet
Bda Assignment
7 pages
BDA U2 - copy
No ratings yet
BDA U2 - copy
79 pages
Hadoop Karunesh
No ratings yet
Hadoop Karunesh
14 pages
How Map Reduce Work
No ratings yet
How Map Reduce Work
99 pages
Unit V Cloud Technologies and Advancements
No ratings yet
Unit V Cloud Technologies and Advancements
33 pages
MapReduce
No ratings yet
MapReduce
14 pages
Hadoop - MapReduce
No ratings yet
Hadoop - MapReduce
5 pages
Unit - III
No ratings yet
Unit - III
37 pages
3 Mapreduce Notes
No ratings yet
3 Mapreduce Notes
25 pages
BDA Module 3 - Part 1 (Mapreduce and HBase) 2023
No ratings yet
BDA Module 3 - Part 1 (Mapreduce and HBase) 2023
15 pages
Hadoop Beginner's Guide
From Everand
Hadoop Beginner's Guide
Garry Turkington
4/5 (7)
Tom
No ratings yet
Tom
20 pages
C o A G U L A: Installation How It Works Painting Tools Making Sound Keyboard Shortcuts Contact
No ratings yet
C o A G U L A: Installation How It Works Painting Tools Making Sound Keyboard Shortcuts Contact
15 pages
8 Qam
100% (1)
8 Qam
16 pages
Split System Air Conditioners: Inverter Series
No ratings yet
Split System Air Conditioners: Inverter Series
428 pages
Delhi Public School, R.K.Puram, New Delhi
No ratings yet
Delhi Public School, R.K.Puram, New Delhi
2 pages
Novedades Gerber 8.5 AE
No ratings yet
Novedades Gerber 8.5 AE
56 pages
Cbip Transmission Line Manualpdf PDF Free
100% (1)
Cbip Transmission Line Manualpdf PDF Free
404 pages
AstroMethods Assignement1 2024
No ratings yet
AstroMethods Assignement1 2024
2 pages
Worksheet:: Physics::: MS-9C
No ratings yet
Worksheet:: Physics::: MS-9C
2 pages
Anisah, A., Winanda, V., & Herawan, E. (2023)
No ratings yet
Anisah, A., Winanda, V., & Herawan, E. (2023)
10 pages
Fov (Field of View) Optimization To Image Quality
No ratings yet
Fov (Field of View) Optimization To Image Quality
5 pages
Tutorial Letter 104/2/2021: Interactive Programming
No ratings yet
Tutorial Letter 104/2/2021: Interactive Programming
11 pages
Unit 2 - Database Management System - WWW - Rgpvnotes.in
No ratings yet
Unit 2 - Database Management System - WWW - Rgpvnotes.in
16 pages
NTU FYP Presentation
No ratings yet
NTU FYP Presentation
46 pages
Deld Unit 1
No ratings yet
Deld Unit 1
25 pages
Methods of Solving Quadratic Equations: Freebie
No ratings yet
Methods of Solving Quadratic Equations: Freebie
8 pages
ANSYS Example 1DHeat
No ratings yet
ANSYS Example 1DHeat
3 pages
Appendix 2 Serva PCTLR 721a Double Pump Cementer
No ratings yet
Appendix 2 Serva PCTLR 721a Double Pump Cementer
2 pages
Android Vulnerability: Analysis With Mercury Framework
No ratings yet
Android Vulnerability: Analysis With Mercury Framework
8 pages
Proportional Directional Valve Amplifier AMP08D: Ring Injenering
No ratings yet
Proportional Directional Valve Amplifier AMP08D: Ring Injenering
5 pages
CSC311 Lecture 4
No ratings yet
CSC311 Lecture 4
15 pages
Hydro Multi-E: Grundfos Hydro Multi-E Booster Systems With 2 To 4 CRE, CRIE or CME Pumps
No ratings yet
Hydro Multi-E: Grundfos Hydro Multi-E Booster Systems With 2 To 4 CRE, CRIE or CME Pumps
52 pages
Animal Tissues Mcq...
No ratings yet
Animal Tissues Mcq...
23 pages
FW102C-SoftwareManual
No ratings yet
FW102C-SoftwareManual
56 pages
SAP PP Job Interview Preparation Guide
No ratings yet
SAP PP Job Interview Preparation Guide
7 pages
Information & Communication Technology: Student Seminar Series - 201 - 2018
No ratings yet
Information & Communication Technology: Student Seminar Series - 201 - 2018
16 pages
Technical Proposal_HDD Onshore
No ratings yet
Technical Proposal_HDD Onshore
4 pages
Manual
No ratings yet
Manual
18 pages
Drill Operations
No ratings yet
Drill Operations
12 pages

Lecture 5 MapReduce Working

Uploaded by

Lecture 5 MapReduce Working

Uploaded by

Processing Big Data

With Hadoop Map

Big Data Analytics and Business Intelligence (CSET/CMCA-580)

• The Reduce task takes the output from the Map as an

Application 2. Log analysis from a web server

•There are two types of nodes for job execution

• JobTracker is a node which can run on the NameNode

• It tracks resource availability and task life cycle

• JobTracker talks to the NameNode to determine the location of the

• JobTracker finds the best TaskTracker nodes to execute tasks based on

• JobTracker monitors the individual TaskTrackers and the submits back

• TaskTracker runs on DataNode.

• Map and Reduce functions are executed on DataNodes using TaskTrackers.

4. Fault Tolerance – MapReduce takes care of failures. If one

• MapReduce Shuffling and Sorting occurs simultaneously to

You might also like