0% found this document useful (0 votes)

19 views41 pages

ch.4 and 5

The document discusses interconnection bus architecture and various bus structures used in parallel processing systems, including time-shared common buses, multiport memory, and crossbar switches. It explains the advantages and disadvantages of each structure, highlighting their impact on system performance and communication efficiency. Additionally, it introduces array processors as a type of SIMD system, detailing their operation, organization, and applications in enhancing instruction processing speed.

Uploaded by

alitheengineer02

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views41 pages

ch.4 and 5

Uploaded by

alitheengineer02

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

Parallel Processing

4th Year
Term2

Ch4. Interconnection Bus Architecture

Dr. Fatemah Al Assfor

Interconnection Bus Structures
Bus: is a communication lines connecting two or more devices (or components)
• It is a shared transmission medium
• A bus consists of multiple lines
Each line is capable of transmitting single binary value “0” or “1”
A bus that connects major components (such as: CPU, Memory, I/O) is called System Bus (or
internal system bus).

Address Data Control

-The width of lines in address bus (n-bit) determines the size of the main memory (RAM)
memory size= 2𝑛
-The number of lines in data bus determines how many bits can be stored in a main memory
location.
Interconnection Bus Structures (cont.)
Control signals (bus): are control lines used to control the operations of the system memory,
I/O devices , and instruction execution , ALU…etc.
Some of the control signals are:
- Memory read: RD - Clock
- Memory write: WR - Reset
- I/O read - Interrupt
- I/O write - Request
Several physical (H/W) techniques available for establishing an interconnection network,
Some of these schemes are presented in this section:
1. Time-shared common bus
2. Multiport memory
3. Crossbar switch
4. Multistage switching network
5. Hypercube system
1) Time-Shared Common Bus
A common-bus multiprocessor system consists of several processors and I/O devices
connected through a common path (bus) to main memory unit.
- Example: A time-shared common bus for three processors and two I/O devices is shown in
Fig. 4.1.

Common bus

Fig. 4.1 Time-Shared Common

Bus
- Any other processor wants to transfer information, must check the availability of the bus.
When it is available, that processor address the destination unit to start transfer, and the
receiving unit responds to the control signals from that processor.
1) Time-Shared Common Bus
Only one processor can communicate with the memory or another processor at any given time.
This means that all other processors or units are either busy with internal operations or must be
idle waiting for the bus.

Disadvantages
- Only one processor can communicate with the memory or another processor at any given
time.
- Consequently, the total overall transfer rate (bandwidth) within the system is limited by
the speed of the single path
1) Time-Shared Common Bus (cont.)
The performance of the system can be increased if two or more independent buses can be used to
transfer information. However, this increases the bus cost and complexity.
o Example: A more economical is the implementation of a dual bus structure as shown in Fig. (4.2).
o Part of the local memory may be designed as a cache memory attached to the CPU.

-As shown in Fig.4.2, Each

system bus controller links the
local bus to the common
system bus
- A common shared memory is
connected to the common
system bus. This memory is
shared by all CPUs. In this case
only one CPU can communicate
Fig.4.2 : Time shared common bus organization with this shared memory.
2) Multiport Memory
A multiport memory system employs separate buses between each memory module and
each CPU. This is shown in Fig . (4.3) , 4- CPUs and 4- memory modules.
- Each processor bus is connected to each memory module.
-A processor bus consists of: address, data, and control lines required to communicate with
memory.

Fig. 4.3: Multiport memory

organization
2) Multiport Memory (cont.)
- As shown in the above figure, a memory module is said to have 4- port and each port
accommodates one of the buses.
- Memory module must have internal control logic to determine which port will have access
to that memory at any given time.
- Memory access conflicts are resolved by assigning fixed priorities to each memory port.
Thus, CPU1 will have priority over CPU 2, CPU 2 will have priority over CPU 3, and CPU 4
will have the lowest priority.
Advantage
The advantage of multi-port memory organization is the high transfer rate (bandwidth) that
can be achieved because of the multiple paths between processors and memory.

Disadvantage
The disadvantage is that it requires expensive memory control logic and many cables and
connectors. Consequently, this interconnection structure is usually appropriate for systems
with a small number of processors.
3) Crossbar Switch (also called switching Network)
- Consists of a set of cross-points that are placed at intersections between processor buses and
memory module paths.
- A crossbar can be defined as a switching network with N inputs and M outputs, which
allows up to min{N, M} one-to-one interconnections without contention.
Types of Crossbar Switch:
a) Uni-directional crossbar
b) Bidirectional crossbar

Advantage: The major advantage of the cross-bar switchs:

o Supports simultaneous transfers from all memory modules
o High speed: In one clock cycle, a connection can be made between source and destination.

Disadvantage:
o The hardware required to implement the switch can become quite large and complex.
Fig. (4.4) shows the functional design of a crossbar switch connected to one memory
module.
3) Crossbar Switch (cont.)
- The small square in Fig. (4.4) of each cross-
point is a switch that determines the path from
a processor to a memory module.

- It allow any processor in the system to

connect to any other processor or memory unit
so that many processors can communicate
simultaneously without contention.

- Most common applications are:

• Used in designing high-performance small-
scale multiprocessors.
Fig. 4.4: Crossbar switch interconnection
• Used in designing of routers for direct
networks
3) Crossbar Switch (cont.)
a) Uni-directional Crossbar Switch
M1 M2
... Mn

- Each switch consists of 2-input AND +

2-input OR gates ((direction: Crosspoint

- PE Mem).
C11 C1n
P1 C12

- To construct (NxN) crossbar switch

network between N-processor and N-
memory modules, one must use
control signals or enable signals. The P2 C21
C22 C2n

signal cij enables the switch in ith row

and jth column.

...
- cij is the control signals to determine Pn
Cn1
Cn2
Cnn

which crosspoint gets “activated”.

Fig. 4.5 : Uni- directional Crossbar switch

3) Crossbar Switch (cont.)
a) Uni-directional Crossbar Switch
M1 M2
... Mn

Notes: For (nXm) crossbar switch network Crosspoint

- The last row is free of OR gates

- Total No. of AND gates= n*m P1
C11
C12
C1n

- Total No. of OR gates = (n-1)*m

H.W: Design (4X3) crossbar network.

Estimate the total number of AND and P2 C21
C22 C2n

OR gates needed.

...
Cn1
Cn2
Cnn
Pn

Fig. 4.5 : Uni- directional Crossbar switch

b) Bi-directional Crossbar Switch
- Each switch consists of two AND & M1

two OR gates (PE Mem and/or

(Mem PE ). Crosspoint

C11

- To construct (NXM) crossbar switch

1
network between N-processors and M-
memory modules, use cij control signal 1
to enable or activate the switch in ith 2
row and jth column.
2
P1
Notes:
- Total No. of AND gates= 2(n*m)
- Total No. of OR gates= 2(n*m)-(m+n)

H.W: Design (4X4) bidirectional

crossbar network. Estimate the total Fig. 4.6 : Bi- directional Crossbar switch
number of AND and OR gates needed.
Chapter 5
SIMD Machines:
Array , Systolic Array, and Wavefront Systems
1) Array Processor
• Array processor is a type of SIMD system.
• Array processor is a single dedicated computer containing a set of identical processing
elements (called Pi s) that operate in parallel under the control of a master controller
(MC) in asynchronous way.
• Each Pi has it is own local memory Mi and includes an ALU and registers.

• All processors Pi(s) execute the same instruction simultaneously (for vector processing).
Thus, providing a single instruction stream with multiple data streams (SIMD operation).

Master controller (MC):

-The master controller (MC) controls all the operations of the computer system and the
processing elements Pi(s), as well.
- It also decodes the instructions and determines how each instruction is to be executed.
The MC consists of two parts:
a) MCU (Master Control Unit): is the CPU of the master controller. It includes an ALU and a
set of registers.

b) MCM (Master Control Memory): holds the instructions and common data.
Array Processor (cont.)

Fig. 5.1: Array processor internal

organization
Array Processor Operation
- Each instruction in the program is executed under the supervision of MCU in a sequential
fashion.
- MCU fetches the next instruction. It is execution will take place in one of the following
ways:
a) Regular Instructions: If the fetched instruction is a scalar or a branch instruction, it is
executed by MC itself.
b) Array or vector Instruction: If the fetched instruction is a vector instruction, such as vector
add or vector multiply, then MCU broadcasts the same instruction to each Pi of the processor
array, allowing all Pi(s) to execute this instruction simultaneously.
(Assuming that the required data is already within the Pi’s private memory).
Array Processor Operation (cont.)
-The data used in the execution of an array instruction is routed into the local memories
before the execution of the instruction by two ways:
a) All the data values can be transferred to local memories from an external source via the
system data bus.
or
b) The MCU can broadcast the data values to the local memories via the control bus.
Array Processor (cont.)

Fig. 5.1: Array processor internal

organization
Array Processor (cont.)
Notes:
- In an array processor, it may be necessary to disable some processing elements during
vector operation, this is can be achieved by using a mask register M inside the MCU,
having a bit mi for each processor Pi.
mn mn-1 …. m1 m0
M- register

- If mi = 1, Pi will respond otherwise, Pi is disabled.

0 1 1 0 .. 0 1 1 0 M- register: 4 processor work only

- Data is exchanged between scratchpad registers and local memories of the Pis. This
exchange takes place through path provided by the Inter-Processor Communication
Network (IPCN).
Example: Consider the following recurrence equation:
𝑧𝑖 = 𝑧𝑖−1 + 𝑎𝑖 𝑓𝑜𝑟 𝑖 = 0, … … 3 with 𝑎−1 = 0
Using array processor, calculate the result of the equation and draw array processor graph
that indicate how the recurrence equation is calculated. Determine the number of steps
needed to complete the implementation of the equation.

Solution
First, expand the recurrence equation : 𝒛𝒊 = 𝒛𝒊−𝟏 + 𝒂𝒊 𝑓𝑜𝑟 𝑖 = 0, … … 3 with 𝑎−1 = 0
𝑧0 = 𝑧−1 + 𝑎0 = 0 + 𝑎0 = 𝑎0 i=0
𝑧1 = 𝑧0 + 𝑎1 = 𝑎0 + 𝑎1 i=1
𝑧2 = 𝑧1 + 𝑎2 = 𝑎0 + 𝑎1 + 𝑎2 i=2
𝑧3 = 𝑧2 + 𝑎3 = 𝑎0 + 𝑎1 + 𝑎2 + 𝑎3 i=3

- To perform the recurrence equation by an array processing system , we need four processing
elements (4- PE).
- We assume that each PE (or Pi) is initialized with the data 𝑎𝑖 . Now, the following graph shows how
the values of 𝑧𝑖 are calculated.
𝑧2 𝑧3

Disable Disable Enable Enable

Step 2
a0 a0+ a1 a0+ a1+ a2 a0+a1+a2+a3

Disable 𝑧1 Enable Enable Enable

Step1 a2+ a3
a0 a0+ a1 a1+ a2

𝑧0

initialization a0 a1 a2 a3

P0 P1 P2 P3

𝑧0 = 𝑧−1 + 𝑎0 = 0 + 𝑎0 = 𝑎0 i=0
𝑧1 = 𝑧0 + 𝑎1 = 𝑎0 + 𝑎1 i=1
𝑧2 = 𝑧1 + 𝑎2 = 𝑎0 + 𝑎1 + 𝑎2 i=2
𝑧3 = 𝑧2 + 𝑎3 = 𝑎0 + 𝑎1 + 𝑎2 + 𝑎3 i=3
Disable Disable Enable Enable
Step 2
a0 a0+ a1 a0+ a1+ a2 a0+a1+a2+a3

Disable Enable Enable Enable

Step 1
a0 a0+ a1 a1+ a2 a2+ a3

a0 a1 a2 a3
initialization
P0 P1 P2 P3

Notes:
- In general, for an array processor system with N processing elements (where N is power of
2), it is possible to evaluate N- values of 𝑧𝑖 ∶ (𝑧0 , 𝑧1 , … … . . 𝑧𝑁−1 ) using 𝒍𝒐𝒈𝟐 𝑵 steps.
- Also, we need to disable 2𝑘−1 processing elements during step k.
Usage of Array Processors
•Array processors enhance the total speed of instruction processing.
•Most array processors' design optimizes its performance for repetitive arithmetic
operations, making it faster at vector arithmetic than the host CPU.
•Since most Array processors run asynchronously from the host CPU, the system's overall
capacity is thus improved.
•Array Processors have their own local memory, providing additional extra memory to
systems with limited memory. This is an essential consideration for the systems with a limited
physical memory or address space.
Applications:
Array processing is used at various places, including:

Applications

1 2 5 6

3 4 Astronomy Seismic Exploration

Radar Systems Sonar Systems applications
Medical Speech
applications Enhancement
2) Systolic Arrays
- Systolic arrays are another kind of SIMD systems.
- It comprises from a set of simple processing elements PE(s) with regular and local
connections which takes external inputs and processes them in a predetermined manner
in a pipelined fashion.
- It is a Synchronous Network

What are the functions of each cell in a Systolic System?

- Systolic Array systems consists of an array of PE (Processing Elements) called cells, each cell
is connected to a small number of nearest neighbors PE.
- Generally, the operations are the same in each cell.
- Each cell performs an operation or small number of operations on a data item and then
passes it to its neighbor.
Regular Interconnections of Systolic Arrays
What are typical structures of a Systolic Architecture?

1) Bidirectional two- dimensional Network

What are typical structures of a Systolic Architecture?

2) Planar array: This configuration allows

I/O only through its boundary cells.

3) Focal Plane: This configuration allows I/O

to each systolic cell.
Example: Consider the following systolic array cell, provide a step- by- step block diagram
approach of a (2*2) matrix multiplication Z= X*Y

:
X2 :
X1 X2

..Y2 Y1 C ..Y2 C Y1+ X1.C

X1
Before After
Solution: Matrix multiplication Z=X*Y 𝑌22
𝑍11 𝑍12 𝑋11 𝑋12 𝑌11 𝑌12 𝑌12 𝑌21
= ∗
𝑍21 𝑍22 𝑋21 𝑋22 𝑌21 𝑌22
𝑌11 0
𝑍11 = 𝑋11 𝑌11 + 𝑋12 𝑌21
𝑍12 = 𝑋11 𝑌12 + 𝑋12 𝑌22 000
𝑿𝟏𝟏 𝑿𝟏𝟐
𝑍21 = 𝑋21 𝑌11 + 𝑋22 𝑌21
𝑍22 = 𝑋21 𝑌12 + 𝑋22 𝑌22 000
𝑿𝟐𝟏 𝑋22
:
:
X2 X2
X1

..Y2 C Y1+ X1.C

..Y2 Y1 C Initialization

X1
Before After
Clock 1 Clock 2
𝑌22
𝑌12 𝑌21
0 𝑌22

0 + 𝑋11 𝑌11 0 + 𝑋12 ∗ 0 0 + 𝑋11 𝑌12

00 𝑋11 𝑋12 0 𝑍11 = 𝑋11 𝑌11 + 𝑋12 𝑌21
𝑋11 𝑋12
𝑌11 0 𝑌12 𝑌21
00 0 0 0 + 𝑋21 𝑌11 0
𝑋21 𝑋22 0
𝑋21 𝑋22

𝑌11
0 0
0
Clock 3 Clock 4

0 0 0 0
𝒁𝟏𝟐
0 +𝑋11 *0 𝑋11 𝑌12 + 𝑋12 𝑌22 0 0 +0*0
0 𝑋11 𝑋12 𝑋11 𝑋12

0 𝑌22 𝒁𝟐𝟏 0 0 𝒁𝟐𝟐

0 + 𝑋21 𝑌12 𝑋21 𝑌11 + 𝑋22 𝑌21 0 0+0∗0 𝑋21 𝑌12 + 𝑿𝟐𝟐 𝒀𝟐𝟐
0
𝑋21 𝑋22 𝑋21 𝑋22

𝑌12 𝑌21 0 𝑌22

Homework: Using focal plane systolic array architecture, provide a step- by- step block diagram
approach for a (3*3) matrix multiplication.
3) Wavefront Array Processor
- Wavefront arrays are another kind of SIMD systems.
- It is very similar to Systolic Array since it comprises from a set of simple processing
elements (PE) with regular and local connections which takes external inputs and processes
them in a predetermined manner in a pipelined fashion.
- But its asynchronous Network

Example: Consider the following wavefront array cell, provide a step- by- step block diagram
approach of a (2*2) matrix multiplication Z= X*Y
A

B data
𝑍11 𝑍12 𝑿𝟏𝟏 𝑿𝟏𝟐 𝒀𝟏𝟏 𝒀𝟏𝟐
Solution: Matrix multiplication Z=X*Y 𝑍21 𝑍22
=
𝑿𝟐𝟏 𝑿𝟐𝟐
∗
𝒀𝟐𝟏 𝒀𝟐𝟐

𝑍11 = 𝑋11 𝑌11 + 𝑋12 𝑌21

𝑍12 = 𝑋11 𝑌12 + 𝑋12 𝑌22 𝑌22

𝑌21 𝑌12
𝑍21 = 𝑋21 𝑌11 + 𝑋22 𝑌21
𝑌11 0
𝑍22 = 𝑋21 𝑌12 + 𝑋22 𝑌22

A 0 𝑿𝟏𝟐 𝑿𝟏𝟏
0 0

B Initialization
data
𝑿𝟐𝟐 𝑿𝟐𝟏 𝟎
0 0
𝑌22
𝑌21 𝑌12
𝑌11 0

0 𝑿𝟏𝟐 𝑿𝟏𝟏
Step 1
0 0
𝑌22

𝑌21 𝑌12
𝑿𝟐𝟐 𝑿𝟐𝟏 𝟎
0 0

0 𝑋12 𝑋11 0
𝟎 + 𝑿𝟏𝟏 𝒀𝟏𝟏 0

𝑌11 0

𝑋22 𝑋21 0
0 0

0
𝑌22
𝑌21 𝑌12

0 𝑋12 𝑋11
𝟎 + 𝑿𝟏𝟏 𝒀𝟏𝟏 0
0 Step 2
𝑌11 0
𝑋22 𝑋21 0 𝑌22
0
0 0

0 𝑍11
0 𝑿𝟏𝟏 𝒀𝟏𝟏 +
𝑋12 𝑋11
𝟎 + 𝑿𝟏𝟏 𝒀𝟏𝟐
𝑿𝟏𝟐 𝒀𝟐𝟏 + 𝟎

𝑌21 𝑌12

𝑋22 𝑋21
𝟎 + 𝑿𝟐𝟏 𝒀𝟏𝟏 0

𝑌11
0 𝑌22
𝑍11
𝑋12 𝑋11
0 𝑋11 𝑌11 + 𝑋12 𝑌21
+0 0 + 𝑋11 𝑌12

𝑌21 𝑌12 Step 3

𝑋22 𝑋21
0 + 𝑋21 𝑌11 0 0
0

𝑌11

𝑍11 𝑍12
0 0 𝑋12
𝑿𝟏𝟏 𝒀𝟏𝟏 + 𝑿𝟏𝟐 𝒀𝟐𝟏 𝑿𝟏𝟏 𝒀𝟏𝟐 + 𝑿𝟏𝟐 𝒀𝟐𝟐
+𝟎

0 𝑌22

0 𝑍21 𝑋22 𝑋21

𝑿𝟐𝟏 𝒀𝟏𝟏 + 0 + 𝑿𝟐𝟏 𝒀𝟏𝟐
𝑿𝟐𝟐 𝒀𝟐𝟏

𝑌21 𝑌12
0 0

𝑍11 𝑍12 𝑋12

0 0
𝑌22 Step 4
0
𝑍21 𝑋22 𝑋21
0 + 𝑋21 𝑌12 0 0
0
𝑌21 𝑌12

𝒁𝟏𝟏 𝒁𝟏𝟐
0 0 0
𝑋11 𝑌11 + 𝑋12 𝑌21 𝑋11 𝑌21 + 𝑋12 𝑌22
+0

0 0

𝒁𝟐𝟏 0 𝒁𝟐𝟐 𝑋22

0
𝑋21 𝑌11 + 𝑋22 𝑌21 𝑋21 𝑌12 + 𝑋22 𝑌22

𝑌21 𝑌22
Exercise: Consider the following wavefront array cell, provide a step- by- step block
diagram approach of a (3*3) matrix multiplication Z= X*Y

X2 :
X1 X2

..Y2 Y1 ..Y2 C Y1+ X1.C

Before After

Multiprocessor System and Interconnection Networks
No ratings yet
Multiprocessor System and Interconnection Networks
66 pages
Booth Algorithm Flowchart
No ratings yet
Booth Algorithm Flowchart
3 pages
Computer Architecture and Organization
No ratings yet
Computer Architecture and Organization
61 pages
Rs 232 & Rs 422 Standard
100% (3)
Rs 232 & Rs 422 Standard
27 pages
Car Base
0% (1)
Car Base
104 pages
Lecture 6 SoC
No ratings yet
Lecture 6 SoC
24 pages
One Pass Multi Pass Assembler and Implementation Examples
80% (5)
One Pass Multi Pass Assembler and Implementation Examples
17 pages
Arm Cortex (LPC 2148) Based Motor Speed Control
100% (2)
Arm Cortex (LPC 2148) Based Motor Speed Control
72 pages
COA (Computer Organization & Architecture)
No ratings yet
COA (Computer Organization & Architecture)
30 pages
CH03 COA9e
No ratings yet
CH03 COA9e
52 pages
Embedded Systems Notes
No ratings yet
Embedded Systems Notes
115 pages
Unit 5
No ratings yet
Unit 5
89 pages
Unit 15 Bus Structure
No ratings yet
Unit 15 Bus Structure
32 pages
Road Ahead For Augmented Reality (PWC)
0% (1)
Road Ahead For Augmented Reality (PWC)
8 pages
CMP 3011 - Unit 2 - CPU
No ratings yet
CMP 3011 - Unit 2 - CPU
186 pages
VP Interconnection Networks 1
No ratings yet
VP Interconnection Networks 1
18 pages
Elements of Bus Design
No ratings yet
Elements of Bus Design
35 pages
03 - Top Level View of Computer Function and Interconnection
No ratings yet
03 - Top Level View of Computer Function and Interconnection
46 pages
Pipeline
No ratings yet
Pipeline
43 pages
CCS 1202 Lecture 4 - General Microprocessor Organization
No ratings yet
CCS 1202 Lecture 4 - General Microprocessor Organization
36 pages
Assembly Language Lab-1
100% (1)
Assembly Language Lab-1
6 pages
Lecture 6 System Bus COA
No ratings yet
Lecture 6 System Bus COA
38 pages
Lec 10
No ratings yet
Lec 10
23 pages
Module 4 Chapter 1
No ratings yet
Module 4 Chapter 1
28 pages
Multiprocessing
No ratings yet
Multiprocessing
4 pages
Panasonic - tc-p42xt50x Service Manual
No ratings yet
Panasonic - tc-p42xt50x Service Manual
7 pages
FALLSEM2024-25 CSI3021 TH VL2024250101925 2024-09-20 Reference-Material-I
No ratings yet
FALLSEM2024-25 CSI3021 TH VL2024250101925 2024-09-20 Reference-Material-I
25 pages
Unit VI
No ratings yet
Unit VI
50 pages
User Manual 3984818
No ratings yet
User Manual 3984818
23 pages
12 Io
No ratings yet
12 Io
20 pages
Co Unit-V
No ratings yet
Co Unit-V
12 pages
Unit-3.3 Dynamic Interconnection Network
No ratings yet
Unit-3.3 Dynamic Interconnection Network
15 pages
4 - Interfacing
No ratings yet
4 - Interfacing
12 pages
Midterm Reviewer
No ratings yet
Midterm Reviewer
17 pages
Interfacing and Communication
No ratings yet
Interfacing and Communication
50 pages
Chapter Ten Architeture
No ratings yet
Chapter Ten Architeture
14 pages
Unit6 - Microprocessor - Final 1
No ratings yet
Unit6 - Microprocessor - Final 1
30 pages
03 - Top Level View of Computer Function and Interconnection
No ratings yet
03 - Top Level View of Computer Function and Interconnection
44 pages
Bus Structures
No ratings yet
Bus Structures
7 pages
atII Bks Lec 2021 31 32
No ratings yet
atII Bks Lec 2021 31 32
16 pages
03 - Top Level View of Computer Function and Interconnection
No ratings yet
03 - Top Level View of Computer Function and Interconnection
64 pages
CO Unit6
No ratings yet
CO Unit6
8 pages
03N - Top Level View of Computer Function and Interconnection
No ratings yet
03N - Top Level View of Computer Function and Interconnection
38 pages
Slot04 05 CH03 TopLevelView 38 Slides
No ratings yet
Slot04 05 CH03 TopLevelView 38 Slides
38 pages
Unit 5
No ratings yet
Unit 5
23 pages
Unit-5 Part-2
No ratings yet
Unit-5 Part-2
22 pages
COA Group Assigment
No ratings yet
COA Group Assigment
11 pages
PC Troubleshooting and Maintenance Guide
No ratings yet
PC Troubleshooting and Maintenance Guide
20 pages
Module 3
No ratings yet
Module 3
25 pages
A Top-Level View of Computer Function and Interconnection
No ratings yet
A Top-Level View of Computer Function and Interconnection
38 pages
Unit 11
No ratings yet
Unit 11
10 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
60 pages
Dual Boot Windows 10 and Linux Ubuntu On Separate Hard Drives
No ratings yet
Dual Boot Windows 10 and Linux Ubuntu On Separate Hard Drives
9 pages
Multiprocessors
No ratings yet
Multiprocessors
8 pages
How To Create Boobtable Windows
No ratings yet
How To Create Boobtable Windows
8 pages
2ad6a430 1637912349895
No ratings yet
2ad6a430 1637912349895
51 pages
CH03 COA9e A Top Level View of Computer
No ratings yet
CH03 COA9e A Top Level View of Computer
37 pages
Microprocessor
No ratings yet
Microprocessor
7 pages
MCA Operating System and Unix Shell Programming 15
No ratings yet
MCA Operating System and Unix Shell Programming 15
12 pages
Lec 34
No ratings yet
Lec 34
23 pages
Final Unit5 CO Notes
No ratings yet
Final Unit5 CO Notes
7 pages
Ch3 PDF
No ratings yet
Ch3 PDF
62 pages
Chapter Thirteen: Multiprocessors
No ratings yet
Chapter Thirteen: Multiprocessors
55 pages
Lectures On Lectures On Multiprocessors: Unit 10
No ratings yet
Lectures On Lectures On Multiprocessors: Unit 10
26 pages
Multiprocessor Architecture and Programming
No ratings yet
Multiprocessor Architecture and Programming
20 pages
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
MARIO FRANCO
No ratings yet
Lectures On Multiprocessors: Unit 10
No ratings yet
Lectures On Multiprocessors: Unit 10
26 pages
Interconnection Networks: Prof. Varsha Poddar Department of CSE
No ratings yet
Interconnection Networks: Prof. Varsha Poddar Department of CSE
18 pages
Symmetric Multiprocessors: Unit 5 Memory Organization
No ratings yet
Symmetric Multiprocessors: Unit 5 Memory Organization
6 pages
Interfacing Processors and Peripherals: CS151B/EE M116C Computer Systems Architecture
No ratings yet
Interfacing Processors and Peripherals: CS151B/EE M116C Computer Systems Architecture
31 pages
Netally AirMagnetPlanner DataSheet
100% (1)
Netally AirMagnetPlanner DataSheet
4 pages
CCTV Surveillance System Technology: Systems Engineer OIC
No ratings yet
CCTV Surveillance System Technology: Systems Engineer OIC
18 pages
Lecture 3 On Chapter 3 A Top-Level View of Computer Function and Interconnection by Sameer Akram
No ratings yet
Lecture 3 On Chapter 3 A Top-Level View of Computer Function and Interconnection by Sameer Akram
37 pages
Modbus TCP / RTU Gateway: IE-GW-MB-2TX-1RS232/485 IE-GWT-MB-2TX-1RS232/485
No ratings yet
Modbus TCP / RTU Gateway: IE-GW-MB-2TX-1RS232/485 IE-GWT-MB-2TX-1RS232/485
53 pages
Manual: Intermedia
No ratings yet
Manual: Intermedia
21 pages
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
From Everand
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
Digital Equipment Corporation
No ratings yet
Tricks With Tapes: August 19, 2004 Session 2824
No ratings yet
Tricks With Tapes: August 19, 2004 Session 2824
37 pages
Simple Digital Lock: Department of Electronics and Communication Engineering
No ratings yet
Simple Digital Lock: Department of Electronics and Communication Engineering
18 pages
5 TH Computer 1 ST Term
No ratings yet
5 TH Computer 1 ST Term
20 pages
VoIP-GSM USB Gateway HUAWEI E1550
No ratings yet
VoIP-GSM USB Gateway HUAWEI E1550
7 pages
Fat File System. Fat32 Fat16 Fat12
No ratings yet
Fat File System. Fat32 Fat16 Fat12
2 pages
MLR Institute of Technology
No ratings yet
MLR Institute of Technology
16 pages
Prestigio Solutions PDF
No ratings yet
Prestigio Solutions PDF
15 pages
Lab 2
No ratings yet
Lab 2
12 pages
Computer Architecture Chapter 3: Arithmetic for Computers: Dr. Phạm Quốc Cường
No ratings yet
Computer Architecture Chapter 3: Arithmetic for Computers: Dr. Phạm Quốc Cường
56 pages
ELS 08 November 2021
No ratings yet
ELS 08 November 2021
11 pages
Lec 7
No ratings yet
Lec 7
8 pages
RAID Advantage Disadvantage
No ratings yet
RAID Advantage Disadvantage
3 pages
Essentials of Computer Architecture - Realref - Copie (2) - Copie
No ratings yet
Essentials of Computer Architecture - Realref - Copie (2) - Copie
772 pages

ch.4 and 5

Uploaded by

ch.4 and 5

Uploaded by

Parallel Processing

Ch4. Interconnection Bus Architecture

Dr. Fatemah Al Assfor

Address Data Control

Fig. 4.1 Time-Shared Common

-As shown in Fig.4.2, Each

Fig. 4.3: Multiport memory

Advantage: The major advantage of the cross-bar switchs:

- It allow any processor in the system to

- Most common applications are:

- Each switch consists of 2-input AND +

- To construct (NxN) crossbar switch

signal cij enables the switch in ith row

which crosspoint gets “activated”.

Fig. 4.5 : Uni- directional Crossbar switch

Notes: For (nXm) crossbar switch network Crosspoint

- The last row is free of OR gates

- Total No. of OR gates = (n-1)*m

H.W: Design (4X3) crossbar network.

Fig. 4.5 : Uni- directional Crossbar switch

two OR gates (PE Mem and/or

- To construct (NXM) crossbar switch

H.W: Design (4X4) bidirectional

Master controller (MC):

Fig. 5.1: Array processor internal

Fig. 5.1: Array processor internal

- If mi = 1, Pi will respond otherwise, Pi is disabled.

Disable Disable Enable Enable

Disable 𝑧1 Enable Enable Enable

Disable Enable Enable Enable

3 4 Astronomy Seismic Exploration

What are the functions of each cell in a Systolic System?

1) Bidirectional two- dimensional Network

2) Planar array: This configuration allows

3) Focal Plane: This configuration allows I/O

..Y2 Y1 C ..Y2 C Y1+ X1.C

..Y2 C Y1+ X1.C

0 + 𝑋11 𝑌11 0 + 𝑋12 ∗ 0 0 + 𝑋11 𝑌12

0 𝑌22 𝒁𝟐𝟏 0 0 𝒁𝟐𝟐

𝑌12 𝑌21 0 𝑌22

𝑍11 = 𝑋11 𝑌11 + 𝑋12 𝑌21

𝑌21 𝑌12 Step 3

0 𝑍21 𝑋22 𝑋21

𝑍11 𝑍12 𝑋12

𝒁𝟐𝟏 0 𝒁𝟐𝟐 𝑋22

..Y2 Y1 ..Y2 C Y1+ X1.C

You might also like