0% found this document useful (0 votes)

47 views48 pages

Distributed Memory Architecture

This document discusses distributed multiprocessor architectures, including tightly coupled and loosely coupled architectures. It provides details on the key aspects of tightly coupled architectures, including models both with and without private caches. Issues like memory conflicts and solutions like adding caches are covered. Loosely coupled architectures are also summarized, including how they use local memory and inter-process communication through message passing over different modules. Specific examples like the Cm* architecture are briefly mentioned.

Uploaded by

sanzog rai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views48 pages

Distributed Memory Architecture

Uploaded by

sanzog rai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 48

Unit 5

Distributed Multiprocessor
Architectures

1
Distributed Multiprocessor Architectures

• Loosely coupled and tightly coupled

architectures
• Cluster computing as an application of loosely
coupled architecture. Examples –CM* and
Hadoop.

2
Some Basics….
• Whenever working on projects, it seems as though
several people coordinating together makes for a
better solution then one person trying to piece things
together on their own.
• This is similar to the concept of multiprocessing.
• Multiprocessing is n number of p processors working
and operating concurrently.
• A multiprocessing system refers to a system
configuration that contains more than one main
control processor unit (CPU).
3
Why use a
multiprocessing system?
• First of all, a multiprocessing system is used to
increase overall system performance in work
being accomplished, also referred to as
throughput.
• By working together problems can be divided
up among processor for faster completion, also
called “divide and conqueror”.
• Another reason for using multiprocessing
systems is to increase system availability.
4
Introduction
• Key attributes of “multiprocessors”:-
– Single computer that includes multiple processors
– Processors may communicate at various levels
• Message passing or shared memory
• Multiprocessor and Multicomputer systems
– Multiple computer system consist of several autonomous computers
which may or may not communicate with each other.
– Multiprocessor system is controlled by single operating system which
provides mechanism for interactions among processors
• Architectural models
– Tightly coupled multiprocessor
– Loosely coupled multiprocessor

5
Tightly coupled multiprocessor(Basics)

• Communicate via shared memory.

• Complete connectivity between processor and
memory.
• This Connectivity accomplished by any
interconnection network.
• Drawback-:
Performance degradation due to memory
conflicts
6
Tightly Coupled Architecture(Details)

• A tightly coupled multiprocessor system may

be used in cases where speed is more of a
concern.
• Models:-
– Without private cache
– With private cache

7
Architecture(Without Private Cache)
• This model consists of p number of processors, l
memory modules, and d I/O channels.
• Everything is then connected using a P/M
interconnection network (PMIN).
• The PMIN is a switch that can connect every
processor to every memory module.
• A memory module can satisfy only one
processors request in a given memory cycle. This
conflict is arbitrated by the PMIN.
8
Tightly Coupled Architecture
• However, in this system the best way to prevent
these types of conflicts is to make l equal to p (i.e.
memory modules equal to the number of
processors).
• Another way of eliminating this conflict is to use
unmapped local memory (ULM)(Reserved Memory
Area For Each Processor)
• By adding the ULM we are able to reduce the
amount of traffic to the PMIN and thereby reducing
conflicts to and from memory.
9
Tightly coupled multiprocessor contd.
Interrupt signal
Interconnection network
(ISIN) Input Output
channels
d-1
p-1 . . disks
Processors .. Input/Output . .
.. . .
.. Interconnection network . .
0 (IOPIN) . .
0

Mapped ...... Unmapped Local Memory

Local Memory

Processor Memory
Interconnection network
(PMIN)

Shared Memory Modules

0 ......... l-1 10
Problem
• In this type of system architecture the memory
references made by the processors is usually
main memory.
• Memory reference common to all processor will
cause conflicts.
• PMIN will surely resolve this conflicts but it will
cause delay in operation,which increases
instruction cycle time,which decreases
throughput..
11
Solution
• Delay can be reduced by having cache for each
processor which will hold memory reference
for each processor.
• But cache coherance problem should be taken
care of.
• Refer to diagram.

12
Tightly coupled multiprocessor contd.
Interrupt signal
Interconnection network
(ISIN) Input Output
channels
d-1
p-1 . . disks
Processors .. Input/Output . .
.. . .
.. Interconnection network . .
0 (IOPIN) . .
Mapped
Local Memory 0
......
Unmapped Local Memory
Private
Caches

Processor Memory
Interconnection network
(PMIN)

Shared Memory Modules

0 ......... l-1 13
Tightly coupled multiprocessor
• ISIN permits each processor to interrupt to
each processor.
• ISIN also used by failing processor to
broadcast message.
• IOPIN permits processor to communicate with
IO channel.

14
Tightly coupled multiprocessor contd.
• Processor types
– Homogeneous, if all processors perform same
function
– Heterogeneous, if processors perform different
functions
Note: Two functionally same processor may differ
along other parameters like I/O, memory size,

etc, i.e. they are asymmetric

15
Loosely Coupled Architecture
• Each processor has its own set of I/O devices
and memory where it accesses most of its
instructions and data
• Computer Module: Processor, I/O interface
and memory
Input/Output
Local memory (I/O)
Processor (LM)
(P)

Channel and
Arbiter Switch
(CAS)
16
Loosely coupled multiprocessor contd.
• Inter-process communicate over different module happens by
exchange of messages, using message transfer system (MTS)
• Distributed system, degree of coupling is loose
• Degree of memory conflicts is less
LM I/O LM I/O

P P

CAS CAS
Computer Module 0 ………….. Computer Module N-1

Message Transfer System (MTS)

17
Loosely coupled multiprocessor
• Inter module communication
– Channel arbiter and switch (CAS)
– Arbiter decide when requests from two or more computer
module collide in accessing a physical segment of MTS

– Also responsible for delaying other request until servicing

request is completed.

18
Loosely coupled multiprocessor
• Message Transfer System (MTS)
– Time based or shared memory
– The latter case can be implemented with set of
memory modules and processor-memory
interconnection network or multiported main
memory.
– MTS determines the performance of
multiprocessor system.

19
Loosely coupled multiprocessor
• For LCS, that use single time shared bus,
performance limited by ,message arrival rate
on bus, message length and bus capacity.
• For LCS with shared memory, limiting factor is
memory conflict problem imposed by
processor memory interconnection network.

20
Cm* Architecture
• Project at Carnegie Melon University
• Now what is computer module?

P S

LM I/O
• Computer module consists of processor, S local, local
memory and I/O.
• S local similar to CAS in loosely coupled arch.
21
Cluster of computer Modules
Inter-cluster Bus

Cm1 Cm10Map Bus

KMAP P S P S
…

LM I/O LM I/O

22
Role Of S local
• Receives and interprets requests for access to P's
local and foreign to local memory and the I / O
• S allows a local P to access external resources Cm
and to make interpretation of local and external
applications software provide a translation of
local address.

23
Address Translation

24
K map Components
• It uses 4 high order bits along with 1 PSW bit
and then they access map table.
• Map Table determines whether memory is local or
not.
• If memory is non local control is given to K map via
map bus.
• CM connected to k map via map bus.
• K map responsible for routing data between s
locals.
25
AP
Kmap Components
Intercluster Bus 1
Intercluster Bus 2

Link
SEND SEND
SERVICE RETURN PORT 1
PORT 2

RUN

KBUS PMAP

OUT Map Bus

Cm Cm … Cm 26
Kmap Components
• Request for non local memory arrives at Kbus
via map bus.
• Link manages communication Between Kmap
and another Kmap.
• Pmap ->mapping processor which response to
request between Kbus and link.

27
Kmap Components
• Kmap can simultaneously handle 8 processor
request.
• Pmap uses the concept of queue to handle
request.
• Service request signaled to kbus whenever
request for non local memory reference, such
computer module called master Cm.

28
Kmap Components
• Kmap fetches virtual address via map bus and
allocates context for Pmap.
• It places the virtual address in Pmap run queue.
• Pmap performs virtual address to physical
address translation.
• Using physical address it can initiate memory
acces in any cm.

29
Kmap Components
• Kmap services the out request by sending
physical memory of memory request via map
bus.
• When destination cm completes memory
access it sends return signal to Kmap.

30
Intra-cluster Communication
KMAP
4
PMAP
3 5 Map Bus
RUN OUT
1
KBUS Cm … Cm
2
Master Slave

• Cm Master initiates a memory access nonlocal

• Master Cm virtual address issued by KBUS
• KBUS activates a context (creating specific data structure
transition) that the PMAP RUN queue
• PMAP treats context and do address translation
• PMAP OUT queue a request for memory cycle Cm Slave of the
current cluster
31
Intra-cluster Communication
KMAP
4
PMAP 6
3 5 Map Bus
RUN OUT
1 9 8 7
KBUS Cm … Cm
2
Master Slave
• KBUS send physical address to Cm Slave by Map Bus
• There is the local slave Cm local memory access cycle .
• KBUS "allow" the result of memory access operation to be
provided by Master Cm
• Cm Master takes the data, complete and continuous operation
during execution

32
Intra-cluster communication
3
Intercluster Bus
2 4
KMAP KMAP
Map Bus Master Slave Map Bus

1 … 5…
Cm Cm Cm Cm
Master Slave

1. Cm Master sends a transfer request to KMAP Master

2. Master prepares KMAP message / request package encode
intercluster
3. Intercluster message is transmitted on the bus intercluster
routing algorithms
4. Slave KMAP decode incoming requests and sends to the cluster or
local Memory cycle request is sent to Cm Slave

33
Intra-cluster communication
Cop Segment Offset 3
Intercluster Bus
8
2 9 7 4
KMAP KMAP
R/W Cm # Page Offset
Map Bus Master Slave Map Bus

1 … 5…
Cm
10
Cm Cm Cm
6
K/U R/W Cm # Page Offset
Master Slave

5. Cm Slave transmits the result to KMAP

6. Slave ready KMAP message intercluster (ie context reactivation)
7. KMAP Slave Master transmits the result to KMAP
8. KMAP Master receives and interprets the message received
9. The result is sent to the Master Cm
10. Result received by Cm.

34
BIGDATA FACTs
• Data intensive applications with Petabytes of
data

• Web pages - 20+ billion web pages x 20KB =

400+ terabytes

– One computer can read 30-35 MB/sec

from disk ~four months to read the web
– same problem with 1000 machines, < 3
hours
35
Single-thread performance doesn’t matter
We have large problems and total throughput/price more
important than peak performance

Stuff Breaks – more reliability

• If you have one server, it may stay up three years (1,000
days)
• If you have 10,000 servers, expect to lose ten a day

“Ultra-reliable” hardware doesn’t really help

At large scales, super-fancy reliable hardware still fails, able it
less often
– software still needs to be fault-tolerant
– commodity machines without fancy hardware give
better perf/price
36
What is Hadoop?


It's a framework for running applications on large
clusters of commodity hardware which produces
huge data and to process it


Hadoop is a framework used to have distributed
processing of big data which is stored at different
physical locations.

37
• The Apache Hadoop software library is a framework that allows
for the distributed processing of large data sets across clusters
of computers using simple programming models.
• It is designed to scale up from single servers to thousands of
machines, each offering local computation and storage.
• Rather than rely on hardware to deliver high-availability, the
library itself is designed to detect and handle failures at the
application layer, so delivering a highly-available service on top
of a cluster of computers, each of which may be prone to
failures.

38
Hadoop Includes


HDFS a distributed filesystem


Map/Reduce HDFS implements this programming model.
It is an offline computing engine

39
Hadoop HDFS
• Hardware failure is the norm rather than the
exception.

• Moving Computation is Cheaper than Moving

Data

40
HDFS
• run on commodity hardware

• HDFS is highly fault-tolerant and is designed to

be deployed on low-cost hardware

• provides high throughput access to application

data

• suitable for applications that have large data

sets
41
NameNode and DataNodes
• HDFS has a master/slave architecture

• NameNode :-manages the file system namespace

and regulates access to files by clients

• DataNodes, usually one per node in the cluster,

which manage storage attached to the nodes that
they run on

• a file is split into one or more blocks

42
• these blocks are stored in a set of DataNodes

• NameNode executes file system namespace operations

like opening, closing, and renaming files and directories

• It also determines the mapping of blocks to DataNodes

• The DataNodes are responsible for serving read and

write requests from the file system’s clients.

• The DataNodes also perform block creation, deletion,

and replication upon instruction from the NameNode.

43
HDFS Internal

44
Hadoop ma-preduce
• Software framework for easily writing applications which process
vast amounts of data (multi-terabyte data-sets) in-parallel on large
clusters (thousands of nodes) of commodity hardware in a reliable,
fault-tolerant manner

• A MapReduce job usually splits the input data-set into independent

chunks which are processed by the map tasks in a completely
parallel manner.

• The framework sorts the outputs of the maps, which are then input
to the reduce tasks

• Typically the compute nodes and the storage nodes are the same
45
Hadoop mapreduce
• The MapReduce framework consists of a single
master JobTracker and one slave TaskTracker per
cluster-node

• The master is responsible for scheduling the jobs'

component tasks on the slaves, monitoring them and
re-executing the failed tasks

• The slaves execute the tasks as directed by the master

46
Hadoop mapreduce
• applications specify the input/output locations
• supply map and reduce functions via
implementations of appropriate interfaces and/or
abstract-classes.
• The Hadoop job client then submits the job and
configuration to the JobTracker

• JobTracker assumes the responsibility of distributing

the software/configuration to the slaves, scheduling
tasks and monitoring them, providing status and
diagnostic information
47
48

Unit 5
No ratings yet
Unit 5
23 pages
Wiring Diagram DSE7310
50% (6)
Wiring Diagram DSE7310
2 pages
Rajib Mall Chapters 1,2,3
100% (4)
Rajib Mall Chapters 1,2,3
343 pages
Driver Booster Export List
No ratings yet
Driver Booster Export List
49 pages
1 Maritime Earthing Guidelines
No ratings yet
1 Maritime Earthing Guidelines
36 pages
Loosely Coupled Architecture
No ratings yet
Loosely Coupled Architecture
25 pages
Term-1 Computer Revision Worksheet (Grade3)
No ratings yet
Term-1 Computer Revision Worksheet (Grade3)
4 pages
STI 9780 Data Sheet
No ratings yet
STI 9780 Data Sheet
2 pages
Applimotion UTH Motor Kits Datasheet
No ratings yet
Applimotion UTH Motor Kits Datasheet
3 pages
Tightly Coupled Microprocessors
100% (1)
Tightly Coupled Microprocessors
14 pages
Unit 4 (Kca 203)
No ratings yet
Unit 4 (Kca 203)
60 pages
Computer Architecture
No ratings yet
Computer Architecture
20 pages
Week 6 A
No ratings yet
Week 6 A
32 pages
10-Multithreading
No ratings yet
10-Multithreading
60 pages
JKSSB Junior Assistant 25 Jan 2022 Shift 1 (English)
No ratings yet
JKSSB Junior Assistant 25 Jan 2022 Shift 1 (English)
16 pages
Lecture 02 AV-323 Computer System Architecture
No ratings yet
Lecture 02 AV-323 Computer System Architecture
16 pages
William Stallings Computer Organization and Architecture: Parallel Processing
No ratings yet
William Stallings Computer Organization and Architecture: Parallel Processing
40 pages
ZVM and IBM
No ratings yet
ZVM and IBM
26 pages
Multiprocessor Systems:: Advanced Operating System
No ratings yet
Multiprocessor Systems:: Advanced Operating System
60 pages
Infocus Projector Setup Guide For A Sony Playstation 2
No ratings yet
Infocus Projector Setup Guide For A Sony Playstation 2
9 pages
Imageclass Mf810Cdn: New! Full Featured 4 in 1 Colour Multifunction Printer For Business
No ratings yet
Imageclass Mf810Cdn: New! Full Featured 4 in 1 Colour Multifunction Printer For Business
6 pages
MPMC Unit-3 Cse Arun
No ratings yet
MPMC Unit-3 Cse Arun
26 pages
Noctua NH-D15 SE-AM4 - Installation Manual: Removing The Stock Retention Module
No ratings yet
Noctua NH-D15 SE-AM4 - Installation Manual: Removing The Stock Retention Module
2 pages
Distributed Computing (S1-23 - CCZG526) (Regular)
No ratings yet
Distributed Computing (S1-23 - CCZG526) (Regular)
60 pages
Basic Design Using Rtos
No ratings yet
Basic Design Using Rtos
12 pages
9 Module 4
No ratings yet
9 Module 4
49 pages
Loosely Coupled Architecture-Kmap, Intercluster and Intracluster Communication
No ratings yet
Loosely Coupled Architecture-Kmap, Intercluster and Intracluster Communication
10 pages
Tightly Coupled Architecture
No ratings yet
Tightly Coupled Architecture
11 pages
Artificial Intelligence (AI) and Neural Networks
No ratings yet
Artificial Intelligence (AI) and Neural Networks
4 pages
Week_6_A
No ratings yet
Week_6_A
22 pages
Unit-5 Part-2
No ratings yet
Unit-5 Part-2
22 pages
Cloud Computing Chapter 20
No ratings yet
Cloud Computing Chapter 20
16 pages
unit6
No ratings yet
unit6
36 pages
SYNON FAQ's
No ratings yet
SYNON FAQ's
15 pages
Chapter Ten Architeture
No ratings yet
Chapter Ten Architeture
14 pages
DC Unit 1
No ratings yet
DC Unit 1
32 pages
Os MD 4
No ratings yet
Os MD 4
112 pages
NWZ 4610
No ratings yet
NWZ 4610
120 pages
Unit 5: Distributed Multiprocessor Architectures
No ratings yet
Unit 5: Distributed Multiprocessor Architectures
48 pages
Multiprocessor
No ratings yet
Multiprocessor
22 pages
COE4590_8_Multiprocessor
No ratings yet
COE4590_8_Multiprocessor
17 pages
FALLSEM2024-25 CSI3021 TH VL2024250101925 2024-09-20 Reference-Material-I
No ratings yet
FALLSEM2024-25 CSI3021 TH VL2024250101925 2024-09-20 Reference-Material-I
25 pages
COE4590_9_Shared Mem_MessgPassing
No ratings yet
COE4590_9_Shared Mem_MessgPassing
14 pages
Hahhaha 3333
No ratings yet
Hahhaha 3333
7 pages
atII Bks Lec 2021 31 32
No ratings yet
atII Bks Lec 2021 31 32
16 pages
MCA Operating System and Unix Shell Programming 15
No ratings yet
MCA Operating System and Unix Shell Programming 15
12 pages
CSA Presentation
No ratings yet
CSA Presentation
37 pages
Seminar Topics
No ratings yet
Seminar Topics
15 pages
Multiprocessors: COMP 211 - Computer Systems Organization and Architecture
No ratings yet
Multiprocessors: COMP 211 - Computer Systems Organization and Architecture
29 pages
B.tech CS S8 High Performance Computing Module Notes Module 4
No ratings yet
B.tech CS S8 High Performance Computing Module Notes Module 4
33 pages
2. Parallel Computers
No ratings yet
2. Parallel Computers
39 pages
Multiprocessor Systems: - Tightly Coupled vs. Loosely Coupled Systems
No ratings yet
Multiprocessor Systems: - Tightly Coupled vs. Loosely Coupled Systems
14 pages
Organization of Multiprocessor Systems
No ratings yet
Organization of Multiprocessor Systems
87 pages
TRIO-TM-50.0 60.0 BCD.00658 EN RevG
No ratings yet
TRIO-TM-50.0 60.0 BCD.00658 EN RevG
4 pages
Unit 5
No ratings yet
Unit 5
89 pages
CS6461 - Computer Architecture Fall 2016: Morris Lancaster - Lecturer
No ratings yet
CS6461 - Computer Architecture Fall 2016: Morris Lancaster - Lecturer
58 pages
COA Assignment
No ratings yet
COA Assignment
21 pages
Firmware: Revision Record
No ratings yet
Firmware: Revision Record
5 pages
Lecturer Plan Microprocessor and Microcontroller
No ratings yet
Lecturer Plan Microprocessor and Microcontroller
3 pages
Explicitly Parallel Platforms
No ratings yet
Explicitly Parallel Platforms
90 pages
Multiprocessors and Multithreading: CS151B/EE M116C Computer Systems Architecture
No ratings yet
Multiprocessors and Multithreading: CS151B/EE M116C Computer Systems Architecture
13 pages
Unit VI
No ratings yet
Unit VI
50 pages
Lectures On Multiprocessors: Unit 10
No ratings yet
Lectures On Multiprocessors: Unit 10
26 pages
Unit6 - Microprocessor - Final 1
No ratings yet
Unit6 - Microprocessor - Final 1
30 pages
Parallel Processing: sp2016 Lec#5
No ratings yet
Parallel Processing: sp2016 Lec#5
27 pages
Pricelist Weidmuller 2023 For Customers
No ratings yet
Pricelist Weidmuller 2023 For Customers
4 pages
Multiprocessor Architecture and Programming
No ratings yet
Multiprocessor Architecture and Programming
20 pages
Multiprocessing: - Classification
No ratings yet
Multiprocessing: - Classification
14 pages
Multiprocessors
No ratings yet
Multiprocessors
12 pages
2ad6a430 1637912349895
No ratings yet
2ad6a430 1637912349895
51 pages
PP16 Lec4 Arch3
No ratings yet
PP16 Lec4 Arch3
23 pages
Unit-3 2 Multiprocessor Systems
No ratings yet
Unit-3 2 Multiprocessor Systems
12 pages
User's Guide: 100/10M PCI Adapter & 100/10M Wake On LAN PCI Adapter
No ratings yet
User's Guide: 100/10M PCI Adapter & 100/10M Wake On LAN PCI Adapter
7 pages
DSM
No ratings yet
DSM
36 pages
L32 SMP
No ratings yet
L32 SMP
47 pages
Data Converter Fundamentals
No ratings yet
Data Converter Fundamentals
27 pages
DLP in Ict
100% (1)
DLP in Ict
8 pages
ONTAP Cluster Fundamentals
No ratings yet
ONTAP Cluster Fundamentals
238 pages
Slides Taken From: Parallel Computing Platforms
No ratings yet
Slides Taken From: Parallel Computing Platforms
11 pages
Multi Processor Classification
No ratings yet
Multi Processor Classification
11 pages
What Is Parallel Computing
No ratings yet
What Is Parallel Computing
9 pages
Digital Basic - 1 PDF
No ratings yet
Digital Basic - 1 PDF
6 pages
MVI56-MCM: User Manual
No ratings yet
MVI56-MCM: User Manual
159 pages
15 Parallel Processing
No ratings yet
15 Parallel Processing
36 pages
Computer Architecture and Parallel Processing
No ratings yet
Computer Architecture and Parallel Processing
29 pages
MultiProcessors Tanenbaum BP
No ratings yet
MultiProcessors Tanenbaum BP
29 pages
Zonal-Architecture-White-Paper (Tendencias Automotrices)
No ratings yet
Zonal-Architecture-White-Paper (Tendencias Automotrices)
24 pages
PlayStation 2 Architecture: Architecture of Consoles: A Practical Analysis, #12
From Everand
PlayStation 2 Architecture: Architecture of Consoles: A Practical Analysis, #12
Rodrigo Copetti
No ratings yet
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
MARIO FRANCO
No ratings yet
PlayStation Architecture: Architecture of Consoles: A Practical Analysis, #6
From Everand
PlayStation Architecture: Architecture of Consoles: A Practical Analysis, #6
Rodrigo Copetti
No ratings yet

Distributed Memory Architecture

Uploaded by

Distributed Memory Architecture

Uploaded by

Unit 5

• Loosely coupled and tightly coupled

• Communicate via shared memory.

• A tightly coupled multiprocessor system may

Mapped ...... Unmapped Local Memory

Shared Memory Modules

Shared Memory Modules

etc, i.e. they are asymmetric

Message Transfer System (MTS)

– Also responsible for delaying other request until servicing

Cm1 Cm10Map Bus

OUT Map Bus

• Cm Master initiates a memory access nonlocal

1. Cm Master sends a transfer request to KMAP Master

5. Cm Slave transmits the result to KMAP

• Web pages - 20+ billion web pages x 20KB =

– One computer can read 30-35 MB/sec

Stuff Breaks – more reliability

“Ultra-reliable” hardware doesn’t really help

• Moving Computation is Cheaper than Moving

• HDFS is highly fault-tolerant and is designed to

• provides high throughput access to application

• suitable for applications that have large data

• NameNode :-manages the file system namespace

• DataNodes, usually one per node in the cluster,

• a file is split into one or more blocks

• NameNode executes file system namespace operations

• It also determines the mapping of blocks to DataNodes

• The DataNodes are responsible for serving read and

• The DataNodes also perform block creation, deletion,

• A MapReduce job usually splits the input data-set into independent

• The master is responsible for scheduling the jobs'

• The slaves execute the tasks as directed by the master

• JobTracker assumes the responsibility of distributing

You might also like