0% found this document useful (0 votes)

257 views12 pages

Multi-Processor-Parallel Processing PDF

- Parallel processing involves executing multiple instructions simultaneously on different data using multiple processors. This can improve performance over sequential processing. - There are three main categories of parallel processing architectures: Single Instruction Multiple Data (SIMD), Multiple Instruction Single Data (MISD), and Multiple Instruction Multiple Data (MIMD). - Symmetric multiprocessors (SMPs) and non-uniform memory access (NUMA) systems are two subcategories of MIMD systems. SMPs have uniform memory access times, while memory access times can vary across processors in NUMA systems.

Uploaded by

Barnali Dutta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

257 views12 pages

Multi-Processor-Parallel Processing PDF

Uploaded by

Barnali Dutta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

mywbut.

com

Multi-Processor / Parallel Processing

Parallel Processing:
Originally, the computer has been viewed as a sequential machine. Most computer
programming languages require the programmer to specify algorithms as sequence of
instruction.
Processor executes programs by executing machine instructions in a sequence and one at a
time.
Each instruction is executed in a sequence of operations (fetch instruction, fetch operands,
perform operation store result.)
It is observed that, at the micro operation level, multiple control signals are generated at the
same time.
Instruction pipelining, at least to the extent of overlapping fetch and execute operations, has
been around for long time.
By looking into these phenomenon, researcher has look into the matter whether some
operations can be performed in parallel or not.
As computer technology has evolved, and as the cost of computer hardware has dropped,
computer designers have sought more and more opportunities for parallelism, usual to
enhance performance and, in some cases, to increase availability.
The taxonomy first introduced by Flynn is still the most common way of categorizing systems
with parallel processing capability. Flynn proposed the following categories of computer
system:

Single Instruction, Multiple Data (SIMD) system: A single machine instruction controls
the simultaneous execution of a number of processing elements on a lockstep basis.
Each processing element has an associated data memory, so that each instruction is
executed on a different set of data by the different processors. Vector and array
processors fall into this category

Multiple Instruction, Single Data (MISD) system : A sequence of data is transmitted to a

set of processors, each of which executes a different instruction sequence. This
structure has never been implemented.

mywbut.com

Multiple Instruction, Multiple Data (MIMD) system: A set of processors simultaneously

execute different instruction sequences on different data sets. SMPs, clusters, and
NUMA systems fits into this category.

With the MIMD organization, the processors are general purpose; each is able to process all of
the instructions necessary to perform the appropriate data transformation.
Further MIMD can be subdivided into two main categories:

Symmetric multiprocessor (SMP): In an SMP, multiple processors share a single

memory or a pool of memory by means of a shared bus or other interconnection
mechanism. A distinguish feature is that the memory access time to any region of
memory is approximately the same for each processor.
Nonuniform memory access (NUMA): The memory access time to different regions of
memory may differ for a NUMA processor.

The design issues relating to SMPs and NUMA are complex, involving issues relating to physical
organization, interconnection structures, inter processor communication, operating system
design, and application software techniques.

Symmetric Multiprocessors:
A symmetric multiprocessor (SMP) can be defined as a standalone computer system with the
following characteristic:
1. There are two or more similar processor of comparable capability.
2. These processors share the same main memory and I/O facilities and are interconnected
by a bus or other internal connection scheme.
3. All processors share access to I/O devices, either through the same channels or through
different channels that provide paths to the same device.
4. All processors can perform the same functions.
5. The system is controlled by an integrated operating system that provides interaction
between processors and their programs at the job, task, file and data element levels.
The operating system of a SMP schedules processors or thread across all of the processors. SMP
has potential advantages over uniprocessor architecture:

Performance: A system with multiple processors will perform in a better way than one
with a single processor of the same type if the task can be organized in such a manner
that some portion of the work done can be done in parallel.

mywbut.com

Availability: Since all the processors can perform the same function in a symmetric
multiprocessor, the failure of a single processor does not stop the machine. Instead, the
system can continue to function at reduce performance level.
Incremental growth: A user can enhance the performance of a system by adding an
additional processor.
Sealing: Vendors can offer a range of product with different price and performance
characteristics based on number of processors configured in the system.

Organization:
The organization of a multiprocessor system is shown in Figure 10.1

Figure 10.1: Block diagram of tightly coupled multiprocessors

There are two or more processors. Each processor is self sufficient, including a control
unit, ALU, registers and cache.
Each processor has access to a shared main memory and the I/O devices through an
interconnection network.
The processor can communicate with each other through memory (messages and status
information left in common data areas).
It may also be possible for processors to exchange signal directly.
The memory is often organized so that multiple simultaneous accesses to separate
blocks of memory are possible.
In some configurations each processor may also have its own private main memory and
I/O channels in addition to the shared resources.

mywbut.com

The organization of multiprocessor system can be classified as follows:

Time shared or common bus

Multiport memory
Central control unit.

Time shared Bus:

Time shared bus is the simplest mechanism for constructing a multiprocessor system. The bus
consists of control, address and data lines. The block diagram is shown in Figure 10.2.

Figure 10.2: Time shared bus

The following features are provided in time-shared bus organization:

Addressing: It must be possible to distinguish modules on the bus to determine the

source and destination of data
Arbitration: Any I/O module can temporarily function as master. A mechanism is
provided to arbitrate competing request for bus control, using some sort of priority
scheme.
Time shearing: when one module is controlling the bus, other modules are locked out
and if necessary suspend operation until bus access in achieved.

The bus organization has several advantages compared with other approaches:

mywbut.com

Simplicity: This is the simplest approach to multiprocessor organization. The physical

interface and the addressing, arbitration and time sharing logic of each processor
remain the same as in a single processor system.
Flexibility: It is generally easy to expand the system by attaching more processor to the
bus.
Reliability: The bus is essentially a passive medium and the failure of any attached
device should not cause failure of the whole system.

The main drawback to the bus organization is performance. Thus, the speed of the system is
limited by the bus cycle time.
To improve performance, each processor can be equipped with local cache memory.
The use of cache leads to a new problem which is known as cache coherence problem. Each
local cache contains an image of a portion of main memory. If a word is altered in one cache, it
may invalidate a word in another cache. To prevent this, the other processors must perform an
update in its local cache.

Multiport Memory:

The multiport memory

approach allows the
direct, independent
access of main memory
modules by each
processor and io module.
The multiport memory
system is shown in Figure
10.3

Figure 10.3: Multiport memory

mywbut.com

The multiport memory approach is more complex than the bus approach, requiring a fair
amount of logic to be added to the memory system. Logic associated with memory is required
for resolving conflict. The method often used to resolve conflicts is to assign permanently
designated priorities to each memory port.
Non-uniform Memory Access (NUMA)
In NUMA architecture, all processors have access to all parts of main memory using loads and
stores. The memory access time of a processor differs depending on which region of main
memory is accessed. The last statement is true for all processors; however, for different
processors, which memory regions are slower and which are faster differ.
A NUMA system in which cache coherence is maintained among the cache of the various
processors is known as cache-cohence NUMA (CC-NUMA)
A typical CC-NUMA organization is shown in the Figure 10.4.

Figure 10.4: CC- NUMA Organization

There are multiple independent nodes, each of which is, in effect, an SMP organization.

mywbut.com

Each node contains multiple processors, each with its own L1 and L2 caches, plus main
memory.
The node is the basic building block of the overall CC NUMA organization
The nodes are interconnected by means of some communication facility, which could be a
switching mechanism, a ring, or some other networking facility.

Each node in the CC-NUMA system includes some main memory.

From the point of view of the processors, there is only a single addressable memory, with each
location having a unique system-wide address.
When a processor initiates a memory access, if the requested memory location is not in the
processors cache, then the L2 cache initiates a fetch operation.
If the desired line is in the local portion of the main memory, the line is fetch across the local
bus.
If the desired line is in a remote portion of the main memory, then an automatic request is send
out to fetch that line across the interconnection network, deliver it to the local bus, and then
deliver it to the requesting cache on that bus.
All of this activity is atomic and transparent to the processors and its cache.
In this configuration, cache coherence is a central concern. For that each node must maintain
some sort of directory that gives it an indication of the location of various portion of memory
and also cache status information.

Interconnection Networks:
In a multiprocessor system, the interconnection network must allow information transfer
between any pair of modules in the system. The traffic in the network consists of requests (such
as read and write), data transfers, and various commands.

mywbut.com

Single Bus:
The simplest and most economical means of interconnecting a number of modules is to use a
single bus.
Since several modules are connected to the bus and any module can request a data transfer at
any time, it is essential to have an efficient bus arbitration scheme.
In a simple mode of operation, the bus is dedicated to a particular source-destination pair for
the full duration of the requested transfer. For example, when a processor uses a read request
on the bus, it holds the bus until it receives the desired data from the memory module.
Since the memory module needs a certain amount of time to access the data bus, the bus will
be idle until the memory is ready to respond with the data.
Then the data is transferred to the processors. When this transfer is completed, the bus can be
assigned to handle another request.
A scheme known as the split- transaction protocol makes it possible to use the bus during the
idle period to serve another request.
Consider the following method of handling a series of read requests possibly from different
processor.
After transferring the address involved in the first request, the bus may be reassigned to
transfer the address of the second request; assuming that this request is to a different memory
module.
At this point, we have two modules proceeding with read access cycle in parallel.
If neither module has finished with its access, the bus may be reassigned to a third request and
so on.
Eventually, the first memory module completes its access cycle and uses the bus to transfer the
data to the corresponding source.
As other modules complete their cycles, the bus is needed to transfer their data to the
corresponding sources.
The split transaction protocol allows the bus and the available bandwidth to be used more
efficiently. The performance improvement achieved with this protocol depends on the
relationship between the bus transfer time and the memory access time.
In split- transaction protocol, performance is improved at the cost of increased bus complexity.

mywbut.com

There are two reasons why complexity increases:

Since a memory module needs to know which source initiated a given

read request,
a source identification tag must be attached to the request.
Complexity also increases because all modules, not just the processor,
must be
able to act as bus muster.

The main limitation of a single bus is that the number of modules that can be connected to the
bus is not that large. Networks that allow multiple independent transfer operations to proceed
in parallel can provide significantly increased data transfer rate.

Crossbar Network:
Crossbar switch is a versatile switching network. It is basically a network of switches. Any
module can be connected to any other module by closing the appropriate switch. Such
networks, where there is a direct link between all pairs of nodes are called fully connected
networks.
In a fully connected network, many
simultaneous transfers are possible. If n
sources need to send data to n distinct
destinations then all of these transfers
can take place concurrently. Since no
transfer is prevented by the lack of a
communication path, the crossbar is
called a nonblocking switch.
In the Figure 10.5 of crossbar
interconnection network, a single
switch is shown at each cross point. In
actual multiprocessor system, the paths
through the crossbar network are much
wider.

Figure 10.5: Crossbar Interconnection Network

mywbut.com

If there are modules in a network, than the number of cross point is

in a network to
interconnect modules. The total number of switches becomes large as increases.
In a crossbar switch, conflicts occur when two or more concurrent requests are made to the
same destination device. These conflicting requests are usually handled on a predetermined
priority basis.
The crossbar switch has the potential for the highest bandwidth and system efficiency.
However, because of its complexity and cost, it may be cost effective for a large multiprocessor
system.

Multistage Network:
The bus and crossbar systems use a single stage of switching to provide a path from a source to
a destination.
In multistage network, multiple stages of switches are used to setup a path between source and
destination.
Such networks are less costly than the crossbar structure, yet they provide a reasonably large
number of parallel paths between source and destinations.
In the Figure 10.6, it shows a three-stage network that called a shuffle network that
interconnects eight modules.
The term "shuffle" describes the pattern of connections from the outputs of one stage to the
inputs of the next stage.
The switchbox in the Figure 10.6 is a

switch that can route either input to either output.

If the inputs request distinct outputs, they can both be routed simultaneously in the straight
through or crossed pattern.

mywbut.com

If both inputs request the same output, only one request can be satisfied. The other one is
blocked until the first request finishes using the switch.

Figure 10.6: Multistage Shuffle Network

A network consisting of

stages can be used to interconnect

modules. In this case, there is

exactly one path through the network from any module to any module
network provides full connectivity between sources and destinations.

. Therefore, this

Many request patterns cannot be satisfied simultaneously. For example, the connection from P2
to P7 can not be provided at the same time as the connection from P3 to P6.
A multistage network is less expansive to implement than a crossbar network. If

nodes are to

be interconnected using this scheme, then we must use

stages with
switches
per stage. Since each switches contains four switches, the total number of switches is

which, for a large network, is considerably less than the

network.

switches needed in a crossbar

Multistage networks are less capable of providing concurrent connection than crossbar
switches. The connection path between

and

is indicated by RED lines in the Figure 10.6.

mywbut.com

UTSAV-A Culinary Epic of Indian Festivals
0% (1)
UTSAV-A Culinary Epic of Indian Festivals
12 pages
Manual-Caldera RIP-EN PDF
No ratings yet
Manual-Caldera RIP-EN PDF
28 pages
William Stallings Computer Organization and Architecture 10 Edition
No ratings yet
William Stallings Computer Organization and Architecture 10 Edition
34 pages
CH17-COA10e - Parallel Processing
No ratings yet
CH17-COA10e - Parallel Processing
45 pages
Multi-Processor / Parallel Processing
No ratings yet
Multi-Processor / Parallel Processing
12 pages
Multi-Processor / Parallel Processing
No ratings yet
Multi-Processor / Parallel Processing
12 pages
15 Parallel Processing
No ratings yet
15 Parallel Processing
36 pages
Parallel Processing:: Multiple Processor Organization
No ratings yet
Parallel Processing:: Multiple Processor Organization
24 pages
Chapter - 5 Introduction To Advanced Architecture 5.1 Introduction To Parallel Processing
No ratings yet
Chapter - 5 Introduction To Advanced Architecture 5.1 Introduction To Parallel Processing
11 pages
Parallel Processing
No ratings yet
Parallel Processing
28 pages
William Stallings Computer Organization and Architecture: Parallel Processing
No ratings yet
William Stallings Computer Organization and Architecture: Parallel Processing
40 pages
Unit VI
No ratings yet
Unit VI
50 pages
Unit6 - Microprocessor - Final 1
No ratings yet
Unit6 - Microprocessor - Final 1
30 pages
17 Computer Architecture and Organization.pptx
No ratings yet
17 Computer Architecture and Organization.pptx
28 pages
Slot28 CH17 ParallelProcessing 32 Slides
No ratings yet
Slot28 CH17 ParallelProcessing 32 Slides
32 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
51 pages
CH17 COA9e
No ratings yet
CH17 COA9e
51 pages
Lecture-7 SMP NUMA Cache Coherence
No ratings yet
Lecture-7 SMP NUMA Cache Coherence
34 pages
L32 SMP
No ratings yet
L32 SMP
47 pages
2ad6a430 1637912349895
No ratings yet
2ad6a430 1637912349895
51 pages
Chapter 8_Parallel Processing
No ratings yet
Chapter 8_Parallel Processing
50 pages
Unit 3
No ratings yet
Unit 3
28 pages
CH5 Parallel Processing
No ratings yet
CH5 Parallel Processing
30 pages
William Stallings Computer Organization and Architecture 10 Edition
No ratings yet
William Stallings Computer Organization and Architecture 10 Edition
34 pages
A502018463 23825 5 2019 Unit6
No ratings yet
A502018463 23825 5 2019 Unit6
36 pages
CH17 COA9e Parallel Processing
No ratings yet
CH17 COA9e Parallel Processing
52 pages
PART17
No ratings yet
PART17
45 pages
COA
No ratings yet
COA
107 pages
COA Assignment
No ratings yet
COA Assignment
21 pages
Multiprocessor
No ratings yet
Multiprocessor
22 pages
5 4 Parallel
No ratings yet
5 4 Parallel
47 pages
Unit 6 Mom
No ratings yet
Unit 6 Mom
23 pages
Interconnection Structures
No ratings yet
Interconnection Structures
7 pages
Unit11
No ratings yet
Unit11
10 pages
ACA-Unit5-Notes
No ratings yet
ACA-Unit5-Notes
26 pages
CH20 COA11e
No ratings yet
CH20 COA11e
40 pages
Final Unit5 CO Notes
No ratings yet
Final Unit5 CO Notes
7 pages
2. Parallel Computers
No ratings yet
2. Parallel Computers
39 pages
Definition of UMA: Basis For Comparison UMA Numa
No ratings yet
Definition of UMA: Basis For Comparison UMA Numa
10 pages
Coa Unit5
No ratings yet
Coa Unit5
11 pages
07 Multiprocessors MF PDF
No ratings yet
07 Multiprocessors MF PDF
99 pages
Chapter 6 Advanced Topics
No ratings yet
Chapter 6 Advanced Topics
14 pages
Parallelism and Multicores
No ratings yet
Parallelism and Multicores
54 pages
Slot28 CH17 ParallelProcessing 32 Slides
No ratings yet
Slot28 CH17 ParallelProcessing 32 Slides
32 pages
COA group Assigment
No ratings yet
COA group Assigment
11 pages
Chapter - 5 Parallel Processing
No ratings yet
Chapter - 5 Parallel Processing
117 pages
21cs401 CA Unit V
No ratings yet
21cs401 CA Unit V
16 pages
Chapter - 5 Multiprocessors and Thread-Level Parallelism: A Taxonomy of Parallel Architectures
No ratings yet
Chapter - 5 Multiprocessors and Thread-Level Parallelism: A Taxonomy of Parallel Architectures
41 pages
unit6
No ratings yet
unit6
36 pages
Arkom 13-40275
No ratings yet
Arkom 13-40275
32 pages
AIX Manual MP
No ratings yet
AIX Manual MP
6 pages
Parallel Arch 2
No ratings yet
Parallel Arch 2
9 pages
Organization of Multiprocessor Systems
No ratings yet
Organization of Multiprocessor Systems
87 pages
Multicore
No ratings yet
Multicore
3 pages
Symmetric Multiprocessing and Microkernel
No ratings yet
Symmetric Multiprocessing and Microkernel
6 pages
EX17
No ratings yet
EX17
2 pages
Multiprocessing: - Classification
No ratings yet
Multiprocessing: - Classification
14 pages
UNIT 2 CLOUD COMPUTING - converted
No ratings yet
UNIT 2 CLOUD COMPUTING - converted
19 pages
Multiprocessor System Architecture
No ratings yet
Multiprocessor System Architecture
11 pages
Operating Systems Interview Questions You'll Most Likely Be Asked
From Everand
Operating Systems Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Operating System Interview Questions and Answers
From Everand
Operating System Interview Questions and Answers
Manish Soni
No ratings yet
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
332 Indian Food Recipes Sanjeev Kapoor PDF
100% (4)
332 Indian Food Recipes Sanjeev Kapoor PDF
159 pages
B.L. Ms.
No ratings yet
B.L. Ms.
9 pages
Principles of Stress Management
No ratings yet
Principles of Stress Management
74 pages
Traditional Arts and Crafts
No ratings yet
Traditional Arts and Crafts
11 pages
The Higher Taste PDF
No ratings yet
The Higher Taste PDF
44 pages
HM 102
No ratings yet
HM 102
260 pages
Orality, Inscription and The Creation of A New Lore: Roma Chatterji
No ratings yet
Orality, Inscription and The Creation of A New Lore: Roma Chatterji
20 pages
Food, Cooking Skills, and Health: A Literature Review: Article
No ratings yet
Food, Cooking Skills, and Health: A Literature Review: Article
7 pages
Book Introduction
No ratings yet
Book Introduction
6 pages
Glossary of Bakery Terms
No ratings yet
Glossary of Bakery Terms
9 pages
Developing Website Using Tools: L L L L
No ratings yet
Developing Website Using Tools: L L L L
45 pages
India Yoga and Meditation Tours
No ratings yet
India Yoga and Meditation Tours
12 pages
Syllabus Biotechnology (BT) : The Biotechnology (BT) Test Paper Comprises of Biology, Chemistry, Mathematics and Physics
No ratings yet
Syllabus Biotechnology (BT) : The Biotechnology (BT) Test Paper Comprises of Biology, Chemistry, Mathematics and Physics
1 page
Tribal Faces
No ratings yet
Tribal Faces
36 pages
Ayurvedic Drug Plant
No ratings yet
Ayurvedic Drug Plant
3 pages
Bibliography: Bibliography: Bibliography: Bibliography
No ratings yet
Bibliography: Bibliography: Bibliography: Bibliography
24 pages
Dolls of Bengal
No ratings yet
Dolls of Bengal
40 pages
Oral Tradition
100% (1)
Oral Tradition
107 pages
Cog Jet Details
100% (1)
Cog Jet Details
2 pages
M. SC Wildlife Biology and Conservation National Entrance Test, December 10, 2017
No ratings yet
M. SC Wildlife Biology and Conservation National Entrance Test, December 10, 2017
21 pages
Science Education Panel
No ratings yet
Science Education Panel
14 pages
A Study On Traditional Mother Care Plants
No ratings yet
A Study On Traditional Mother Care Plants
6 pages
General Checklist For Troubleshooting in DevOps
No ratings yet
General Checklist For Troubleshooting in DevOps
9 pages
IIT Lab Report
No ratings yet
IIT Lab Report
14 pages
Log
No ratings yet
Log
27 pages
Leviton DataCenterNetworkInteractiveHandbook PDF
100% (1)
Leviton DataCenterNetworkInteractiveHandbook PDF
62 pages
Comcolor 3050/3010: Integrated System Controller Specifications
No ratings yet
Comcolor 3050/3010: Integrated System Controller Specifications
2 pages
Exp Linux Resume
No ratings yet
Exp Linux Resume
4 pages
Oracle Apex Release Notes
No ratings yet
Oracle Apex Release Notes
56 pages
Computer Lab Inventory
No ratings yet
Computer Lab Inventory
4 pages
Dell S4112-ON Series Installation Guide
No ratings yet
Dell S4112-ON Series Installation Guide
44 pages
09 Cisco Catalyst 3560 & 3750 QoS Design AAG
No ratings yet
09 Cisco Catalyst 3560 & 3750 QoS Design AAG
2 pages
Samsung SM-J327W Android Phone Manual
No ratings yet
Samsung SM-J327W Android Phone Manual
126 pages
Controlling Searches in Mixed APPN-Subarea Networks
No ratings yet
Controlling Searches in Mixed APPN-Subarea Networks
30 pages
PLC LG
No ratings yet
PLC LG
62 pages
What Is A Symbolic Constant ?
No ratings yet
What Is A Symbolic Constant ?
11 pages
Esp32-S3 Technical Reference Manual en
100% (1)
Esp32-S3 Technical Reference Manual en
1,505 pages
Schematic Diagram: Vga-Det Vga-Scl VS Vga-Scl
0% (1)
Schematic Diagram: Vga-Det Vga-Scl VS Vga-Scl
7 pages
VDI Design Guide Part 2
No ratings yet
VDI Design Guide Part 2
367 pages
AWP Unit 3 and 4
No ratings yet
AWP Unit 3 and 4
18 pages
Dell Case Study
100% (1)
Dell Case Study
10 pages
Minerva Infosolution (Company Profile)
No ratings yet
Minerva Infosolution (Company Profile)
11 pages
Awk Cheat Sheet
No ratings yet
Awk Cheat Sheet
4 pages
Subject: Photocopier & Plotter Machines Inventory Report As of JULY, 2017
No ratings yet
Subject: Photocopier & Plotter Machines Inventory Report As of JULY, 2017
6 pages
Switch in CPS
No ratings yet
Switch in CPS
5 pages
Mfe Getting Started
No ratings yet
Mfe Getting Started
50 pages
Asus-Product-Guide-2013-08 09
No ratings yet
Asus-Product-Guide-2013-08 09
23 pages
APM Agents
No ratings yet
APM Agents
102 pages
Dokumen - Tips - Linux Device Driverldd
No ratings yet
Dokumen - Tips - Linux Device Driverldd
37 pages
Property Editor in ICM
No ratings yet
Property Editor in ICM
14 pages
Installation
No ratings yet
Installation
2 pages

Multi-Processor-Parallel Processing PDF

Uploaded by

Multi-Processor-Parallel Processing PDF

Uploaded by

mywbut.

Multi-Processor / Parallel Processing

Multiple Instruction, Single Data (MISD) system : A sequence of data is transmitted to a

Multiple Instruction, Multiple Data (MIMD) system: A set of processors simultaneously

Symmetric multiprocessor (SMP): In an SMP, multiple processors share a single

Figure 10.1: Block diagram of tightly coupled multiprocessors

The organization of multiprocessor system can be classified as follows:

Time shared or common bus

Time shared Bus:

Figure 10.2: Time shared bus

The following features are provided in time-shared bus organization:

Addressing: It must be possible to distinguish modules on the bus to determine the

Simplicity: This is the simplest approach to multiprocessor organization. The physical

The multiport memory

Figure 10.3: Multiport memory

Figure 10.4: CC- NUMA Organization

Each node in the CC-NUMA system includes some main memory.

There are two reasons why complexity increases:

Since a memory module needs to know which source initiated a given

Figure 10.5: Crossbar Interconnection Network

If there are modules in a network, than the number of cross point is

switch that can route either input to either output.

Figure 10.6: Multistage Shuffle Network

stages can be used to interconnect

modules. In this case, there is

be interconnected using this scheme, then we must use

which, for a large network, is considerably less than the

switches needed in a crossbar

is indicated by RED lines in the Figure 10.6.

You might also like