Ca 1

Uploaded by

ashikapramodpm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views37 pages

Ca 1

Uploaded by

ashikapramodpm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

Module III - STRUCTURES AND

ALGORITHMS FOR ARRAY

PROCESSORS
SIMD ARRAY PROCESSORS

 A synchronous array of parallel processors is called

an array processor
 It consists of multiple Processing Elements (PEs)
under the supervision of one Control Unit (CU)
 An array processor can handle Single Instruction and
Multiple Data (SIMD) streams
 Array processors are also known as SIMD computers
 SIMD computers appear in two basic architectural
organizations:
1. Array Processors – using random-access
memory
2. Associative Processors – using content
addressable memory
SIMD Computer Organizations

 An array processor may assume one of two slightly

different configurations
 This configuration is structured with N synchronized
PEs, all of which are under the control of one CU.
 Each PEi is an arithmetic logic unit (ALU) with
attached working registers and local memory
 The CU has its own main memory for the storage of
programs
 The system and user programs are executed under
the control of CU
 The user programs are loaded into the CU memory
from the external source
 The function of the CU is to decode all the
instructions and determine where the decoded
instructions should be executed
 Scalar or control-type instructions are directly
executed inside the CU.
 Vector instructions are broadcast to the PEs for
distributed execution to achieve spatial parallelism
through duplicate arithmetic units (PEs).
 All the PEs perform the same function
synchronously in a lock-step fashion under the
command of the CU.
 Vector operands are distributed to the PEMs before
parallel execution in the array of PEs.
 The distributed data can be loaded into the PEMs
from an external source via the system data bus, or
via the CU in a broadcast mode using the control bus
 Masking schemes are used to control the status of
each PE during the execution of a vector instruction.
 Each PE may be either active or disabled during an
instruction cycle.
 A masking vector is used to control the status of all
PEs.
 Only enabled PEs perform computation
 Data exchanges among the PEs are done via an inter-
PE communication network, which performs all
necessary data-routing and manipulation functions.
 This interconnection network is under the control of
the control unit.
 An array processor is normally interfaced to a host
computer through the control unit.
 The host computer is a general-purpose machine
which serves as the "operating manager" of the
entire system, consisting of the host and the
processor array.
 The functions of the host computer include resource
management and peripheral and I/O supervisions.
 The control unit of the processor array directly
supervises the execution of programs, whereas the
host machine performs the executive and I/O
functions with the outside world.
Configuration II
 This configuration II differs from the configuration I
in two aspects.
1. The local memories attached to the PEs are now
replaced by parallel memory modules shared by all
the PEs through an alignment network.
2.The inter-PE permutation network is replaced by
the inter alignment network, which is again
controlled by the CU.
 A good example of a configuration II SIMD machine
is the Burroughs Scientific Processor (BSP).
 There are N PEs and P memory modules in
configuration II.
 The two numbers are not necessarily equal.
 They have been chosen to be relatively prime.
 The alignment network is a path-switching network
between the PEs and the parallel memories.
 Such an alignment network is desired to allow
conflict accesses of the shared memories by as many
PEs as possible.
SIMD computer

 Formally, an SIMD computer C is characterized by

the following set of parameters:
C = < N, F, I, M >
 Where
 N = the number of PEs in the system. For example,
the Illiac-IV has N= 64, the BSP has N =16, and the
MPP has N = 16,384.
 F = a set of data-routing functions provided by the
interconnection network (in configuration I) or by
the alignment network (in configuration II).
 I = the set of machine instructions for scalar-vector,
data-routing, and network-manipulation operations.
 M = the set of masking schemes, where each mask
partitions the set of PEs into the two disjoint subsets
of enabled PEs and disabled PEs.
Masking and Data Routing Mechanisms

 Each PEi is
o a processor with its own memory PEMi ,
o a set of working registers and flags, namely Ai ,
Bi , Ci and Si,
o an arithmetic logic unit,
o a local index register Ii ,
o an address register Di and
o a data-routing register Ri .
 The Ri of each PEi is connected to the Ri of other PEs
via the interconnection network.
 When data transfer among PEs occurs, it is the
contents of the Ri registers that are being
transferred.
 We denote the N PEs as PE for i= 0, 1, 2, , N-1, where
the index i is the address of PEi .
 We assume N = 2m or m = log2 N binary digits are
needed to encode the address of a PE.
 The address register Di is used to hold the m bit
address of the PEi.
 This PE structure is essentially based on the design
in Illiac IV
 Some array processor may use 2 routing register, one
for input and the other for output.
 Each PEi is either active or in the inactive mode
during each instruction cycle.
 If a PEi is active, it executes the instruction broadcast
to it by the CU.
 If a PEi is inactive, it will not execute the broadcast
instruction.
 The masking schemes are used to specify the status
flag Si of PEi.
 The conventions Si = 1 is chosen for an active PEi and
Si = 0 for an inactive PEi .
 In the CU, there is a global index register I and a
Masking register M.
 The M register has N bits.
 The physical length of a vector is determined by the
number of PEs.
 The CU performs the segmentation of a long vector
into vector loops, the setting of a global address, and
the offset increment.
 In an array processor, vector operands can be
specified by the registers to be used or by the
memory addresses to be referenced.
 For memory-reference instructions, each PEi
accesses the local PEMi offset by its own index
register Ii.
 The register Ii modifies the global memory address
broadcast from the CU.
 Thus, different locations in different PEMs can be
accessed simultaneously with the same global
address specified by the CU.
 Array processors are special purpose computers for
limited scientific applications.
 The array of PEs are passive arithmetic units waiting
to be called for parallel computation duties.
 The permutation network among PEs is under
control from the CU
Inter-PE Communications

 Network design decisions for inter-PE

communications are :
1. Operation modes
2. Control strategies
3. Switching methodologies
4. Network topologies
 These are fundamental decisions in determining the
appropriate architecture of an interconnection
network for an SIMD machine.
 The decisions are made between operation modes,
control strategies, switching methodologies, and
network topologies.
Operation modes

 Two types of communication can be identified :

1. Synchronous : Synchronous communication is
needed for establishing communication paths
synchronously for either a data manipulating
function or for a data instruction broadcast.
2. Asynchronous : Asynchronous communication
is needed for multiprocessing in which
connection requests are issued dynamically
 A system may also be designed to facilitate both
synchronous and asynchronous processing.
 The typical operation modes of interconnection
networks can be classified into three categories:
synchronous, asynchronous, and combined.
 All existing SIMD machines choose the synchronous
operation mode, in which lock-step operations
among all PEs are enforced.
Control strategies

 An interconnection network consists of a number of

switching elements and interconnecting links,
interconnection functions are realized by properly setting
control of the switching elements.
 The control-setting function can be managed by a
centralized controller or by the individual switching
element.
 The latter strategy is called distributed control and the
first strategy corresponds to centralized control.
 Most existing SIMD interconnection networks choose the
centralized control on all switch elements by the control
unit.
Switching methodologies

 The two major switching methodologies are :

 Circuit switching : A physical path is actually
established between a source and a destination.
 Packet switching : Data is put in a packet and
routed through the interconnection network
without establishing a physical connection path.
 Circuit switching is much more suitable for bulk data
transmission, and packet switching is more efficient
for many short data messages.
 Another option, integrated switching, includes the
capabilities of both circuit switching and packet
switching.
 Most SIMD interconnection networks are handwired
to assume circuit switching operations.
 Packet switched networks have been suggested
mainly for MIMD machines.
Network topologies

 A network can be depicted by a graph in which nodes

represent switching points and edges represent
communication links.
 The topologies tend to be regular and can be grouped
into two categories:
 Static : Links between two processors are passive
and dedicated buses cannot be reconfigured for
direct connections to other processors.
 Dynamic: Links in the dynamic category can be
reconfigured by setting the network’s active
switching elements.
 The space of the interconnection networks can be
represented by the cartesian product of the above
four sets of design features:
{operation mode} x {control strategy} x {switching
methodology} x {network topology}
 The choice of a particular interconnection network
depends on the application demands, technology
supports, and cost-effectiveness.

Computer Organization and Architecture Module 1 (Kerala University) Notes
100% (12)
Computer Organization and Architecture Module 1 (Kerala University) Notes
30 pages
Minbooklist 136254
No ratings yet
Minbooklist 136254
156 pages
2 Computer System Architecture
No ratings yet
2 Computer System Architecture
12 pages
Chapter1 - Basic Structure of Computers
0% (1)
Chapter1 - Basic Structure of Computers
119 pages
Chapter1 - Basic Structure of Computers
100% (1)
Chapter1 - Basic Structure of Computers
119 pages
Computer Organization and Architecture Module 1
100% (1)
Computer Organization and Architecture Module 1
46 pages
B.tech CS S8 High Performance Computing Module Notes Module 3
100% (1)
B.tech CS S8 High Performance Computing Module Notes Module 3
28 pages
Version 2 EE IIT, Kharagpur 1
100% (1)
Version 2 EE IIT, Kharagpur 1
15 pages
Types of DSP Architectures
100% (3)
Types of DSP Architectures
45 pages
Computer System Organizations: Ms - Chit Su Mon
No ratings yet
Computer System Organizations: Ms - Chit Su Mon
74 pages
6th Edition - Chapter 1 - Basic Structure of Computers-26!02!2021
100% (1)
6th Edition - Chapter 1 - Basic Structure of Computers-26!02!2021
58 pages
Processing Device
No ratings yet
Processing Device
4 pages
Chapter1 Basic Structure of Computers
100% (2)
Chapter1 Basic Structure of Computers
7 pages
CAA Micro Project
100% (1)
CAA Micro Project
19 pages
Computer Organization, Unit 1 & 2
No ratings yet
Computer Organization, Unit 1 & 2
198 pages
The Hardware Side - Part 1: An Introduction
No ratings yet
The Hardware Side - Part 1: An Introduction
68 pages
SIMD Computer Organizations
0% (1)
SIMD Computer Organizations
20 pages
Organization CH 2
No ratings yet
Organization CH 2
102 pages
Unit 1
No ratings yet
Unit 1
52 pages
Chapter1 - Basic Structure of Computers
No ratings yet
Chapter1 - Basic Structure of Computers
119 pages
How To Implement Modbus TCP Protocol Using VBA With Excel - Acc Automation
No ratings yet
How To Implement Modbus TCP Protocol Using VBA With Excel - Acc Automation
18 pages
Chapter1 - Basic Structure of Computers
No ratings yet
Chapter1 - Basic Structure of Computers
123 pages
Unit 1
No ratings yet
Unit 1
74 pages
Refer This Notes
No ratings yet
Refer This Notes
30 pages
Unit-1 Co
No ratings yet
Unit-1 Co
70 pages
LD and CO Module 3
No ratings yet
LD and CO Module 3
74 pages
Mod 1
No ratings yet
Mod 1
12 pages
12 Volt Relay Wiring Diagram
100% (2)
12 Volt Relay Wiring Diagram
1 page
CO - OS Unit-1 (Part1)
No ratings yet
CO - OS Unit-1 (Part1)
40 pages
The Stored Program Concept
No ratings yet
The Stored Program Concept
11 pages
Chapter 4
No ratings yet
Chapter 4
8 pages
Follow-Up Email Templates
100% (1)
Follow-Up Email Templates
33 pages
Munish Vashishath Block Diagram
No ratings yet
Munish Vashishath Block Diagram
39 pages
Unit-1 Co
No ratings yet
Unit-1 Co
71 pages
CA I - Chapter 3 RISC V Processor
No ratings yet
CA I - Chapter 3 RISC V Processor
107 pages
Computer Organization AND Architecture
No ratings yet
Computer Organization AND Architecture
64 pages
Computer Organization and Architecture: William Stallings
No ratings yet
Computer Organization and Architecture: William Stallings
78 pages
Module-4. Structure of Computers, Instruction Set Architecture and Memory Unit
No ratings yet
Module-4. Structure of Computers, Instruction Set Architecture and Memory Unit
59 pages
UNIT-V-Pipeline and Array Processing and Multi Processors
No ratings yet
UNIT-V-Pipeline and Array Processing and Multi Processors
51 pages
Aca UNIT-5
No ratings yet
Aca UNIT-5
10 pages
Array Processor
No ratings yet
Array Processor
6 pages
Introduction To Computer Architecture
No ratings yet
Introduction To Computer Architecture
81 pages
MPMC Units (1&2) - III Ece (r19) (2) - 1
No ratings yet
MPMC Units (1&2) - III Ece (r19) (2) - 1
62 pages
Coa - Lecture Notes
No ratings yet
Coa - Lecture Notes
137 pages
Chapter1 Basic Structure of Computers
No ratings yet
Chapter1 Basic Structure of Computers
119 pages
26-27 SIMD Architecture
No ratings yet
26-27 SIMD Architecture
33 pages
Microprocessor: Mbeya University of Science and Technology Department of Electronics and Telecommunication Engineering
No ratings yet
Microprocessor: Mbeya University of Science and Technology Department of Electronics and Telecommunication Engineering
30 pages
BNYS Prospectus 2020 21
No ratings yet
BNYS Prospectus 2020 21
33 pages
Processor Fundamental
No ratings yet
Processor Fundamental
36 pages
User Manual Hexprog Ii Tuner: Index
No ratings yet
User Manual Hexprog Ii Tuner: Index
20 pages
Chapter 1 - Basic Structure of Computers
No ratings yet
Chapter 1 - Basic Structure of Computers
33 pages
Unit 1 Computer - Organization
No ratings yet
Unit 1 Computer - Organization
96 pages
COA
No ratings yet
COA
137 pages
Qetero Service Booking Platform
No ratings yet
Qetero Service Booking Platform
23 pages
19011874 - A00《HD90S Series High Voltage Inverter User Manual - Project E》-English
No ratings yet
19011874 - A00《HD90S Series High Voltage Inverter User Manual - Project E》-English
415 pages
VAE-Driven Multimodal Fusion For Early Cardiac Disease Detection
No ratings yet
VAE-Driven Multimodal Fusion For Early Cardiac Disease Detection
17 pages
Introduction To Cellular Mobile Radio Systems
No ratings yet
Introduction To Cellular Mobile Radio Systems
83 pages
2nd Year NEP Syllabus
No ratings yet
2nd Year NEP Syllabus
30 pages
III Term Paper EM
No ratings yet
III Term Paper EM
5 pages
Here Are Some Cases of ERP Implementation:: ERP Case Find Out About #1: Cadbury - Success
No ratings yet
Here Are Some Cases of ERP Implementation:: ERP Case Find Out About #1: Cadbury - Success
5 pages
Comp 2 Revision
No ratings yet
Comp 2 Revision
7 pages
Conditional Formatting
No ratings yet
Conditional Formatting
32 pages
AICyber-Chain Combining AI and Blockchain For Improved Cybersecurity
No ratings yet
AICyber-Chain Combining AI and Blockchain For Improved Cybersecurity
22 pages
An Ensemble Deep Learning Model For Vehicular Engine Health Prediction
No ratings yet
An Ensemble Deep Learning Model For Vehicular Engine Health Prediction
19 pages
IOT Module 4
No ratings yet
IOT Module 4
14 pages
Manual Siwarex Wp521 Wp522 en
No ratings yet
Manual Siwarex Wp521 Wp522 en
176 pages
Datasheet FSR
No ratings yet
Datasheet FSR
10 pages
IP Security Architecture
No ratings yet
IP Security Architecture
11 pages
Seminar Index
No ratings yet
Seminar Index
5 pages
High Accuracy Lane Line Detection System Using Enhanced Yolo V3
No ratings yet
High Accuracy Lane Line Detection System Using Enhanced Yolo V3
6 pages
Predicting Market Performance Using Machine and Deep Learning Techniques
No ratings yet
Predicting Market Performance Using Machine and Deep Learning Techniques
8 pages
Excel 2016: Basics 1: Navigating and Formatting
No ratings yet
Excel 2016: Basics 1: Navigating and Formatting
27 pages
APT1608EC
No ratings yet
APT1608EC
4 pages
Estimating Today March April 2023
No ratings yet
Estimating Today March April 2023
36 pages
Advances in Neural Rendering
No ratings yet
Advances in Neural Rendering
33 pages
Uss 2023-24 CONSOLIDATed 8
No ratings yet
Uss 2023-24 CONSOLIDATed 8
1 page
Uss 2023-24 MARK Test 6
No ratings yet
Uss 2023-24 MARK Test 6
1 page
Pneumonia Detection and Classification Using Deep Learning Abstract
No ratings yet
Pneumonia Detection and Classification Using Deep Learning Abstract
1 page
AI-Powered Freshness Detection and Shelf Life Prediction System For Food Items Using Image Processing
No ratings yet
AI-Powered Freshness Detection and Shelf Life Prediction System For Food Items Using Image Processing
1 page
Filter Validation Guidelines
No ratings yet
Filter Validation Guidelines
9 pages
Journal of Energy Storage
No ratings yet
Journal of Energy Storage
14 pages
AI For EV
No ratings yet
AI For EV
22 pages
SAFETY106714
No ratings yet
SAFETY106714
14 pages
M1 User Level Security User Guide
No ratings yet
M1 User Level Security User Guide
8 pages
Канада
No ratings yet
Канада
9 pages
Database Programming With SQL 16-1: Working With Sequences Practice Activities
No ratings yet
Database Programming With SQL 16-1: Working With Sequences Practice Activities
3 pages
Svsdvsdvsdvsdvs
No ratings yet
Svsdvsdvsdvsdvs
6 pages
Clipsal 4CC AND 4FCC INSTALLATION ISTRUCTION
No ratings yet
Clipsal 4CC AND 4FCC INSTALLATION ISTRUCTION
2 pages
Sridevi SR - Accounts Executive With 4 Years of Exp
No ratings yet
Sridevi SR - Accounts Executive With 4 Years of Exp
3 pages
Efficient Memory Optimization for IoT Intrusion Detection
From Everand
Efficient Memory Optimization for IoT Intrusion Detection
Ethan Evelyn
No ratings yet
Routing in Wireless Mesh Networks
From Everand
Routing in Wireless Mesh Networks
Raghav Kumar
No ratings yet
Digital Engineering: Complex System Design
From Everand
Digital Engineering: Complex System Design
S Mathioudakis
No ratings yet
The complete guide to Hardware Technician Terminology: A simplified guide
From Everand
The complete guide to Hardware Technician Terminology: A simplified guide
Sumitra Kumari
No ratings yet
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
From Everand
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
Digital Equipment Corporation
No ratings yet
Study Guide Cisco 300-535 SPAUTO Automating and Programming Cisco Service Provider Solutions
From Everand
Study Guide Cisco 300-535 SPAUTO Automating and Programming Cisco Service Provider Solutions
Anand Vemula
No ratings yet
Study Guide Designing Cisco Data Centre Infrastructure (300-610) Exam
From Everand
Study Guide Designing Cisco Data Centre Infrastructure (300-610) Exam
Anand Vemula
No ratings yet
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
MARIO FRANCO
No ratings yet
BGP and the Internet
From Everand
BGP and the Internet
Dimitrios Voutsinas
No ratings yet
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
Correct Maintenance - Cognex DataMan 8500
From Everand
Correct Maintenance - Cognex DataMan 8500
Unique Content
No ratings yet
Computer Science II Essentials
From Everand
Computer Science II Essentials
Randall Raus
No ratings yet
Networked Control System: Fundamentals and Applications
From Everand
Networked Control System: Fundamentals and Applications
Fouad Sabry
No ratings yet

Ca 1

Uploaded by

Ca 1

Uploaded by

Module III - STRUCTURES AND

ALGORITHMS FOR ARRAY

 A synchronous array of parallel processors is called

 An array processor may assume one of two slightly

 Formally, an SIMD computer C is characterized by

 Network design decisions for inter-PE

 Two types of communication can be identified :

 An interconnection network consists of a number of

 The two major switching methodologies are :

 A network can be depicted by a graph in which nodes

You might also like