0% found this document useful (0 votes)

308 views25 pages

SIMD Array Processor

The document describes two SIMD array processors - ILLIAC-IV and Burroughs Scientific Processor (BSP). ILLIAC-IV consists of multiple processing elements under a single control unit. Each processing element contains an ALU, registers and local memory. Vector instructions are sent to processing elements for distributed execution to achieve spatial parallelism. A masking scheme is used to control the status of processing elements during instruction execution. BSP has fewer processing units than ILLIAC-IV but with all processors having equal access to a common logical address space divided into separate memory modules. Each processing element is an arithmetic unit with input/output registers. The document also discusses various interconnection network topologies used

Uploaded by

Tejodeep Bose

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

308 views25 pages

SIMD Array Processor

Uploaded by

Tejodeep Bose

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

SIMD ARRAY

PROCESSORS
Chapter 4

NOTE: Refer two author book Kai Hwang and Briggs page no:325
SIMD ARRAY PROCESSORS

• ILLIAC-IV

• Burroughs Scientific Processor(BSP)

ILLIAC-IV
ILLIAC IV
• Constitutes of multiple synchronized processing
elements under one control unit

• Each processing elements contain an ALU, registers

and local memory

• User programs are loaded into the control unit

memory from an external source

• The control unit then decodes and decides where the

instruction should be executed.
ILLIAC IV
• Branching instructions are executed directly on the
control unit

• Vector instructions are sent to the processing

elements for distributive execution to achieve spatial
parallelism via duplicate arithmetic units

• The data can be loaded into the processing element’s

memory from external source via system bus and
broadcast mode of control unit
ILLIAC
•
IV
Masking scheme is used to control the status of
processing element during execution

• A PE may either be activated or deactivated during an

instruction cycle

• Enabled PE only perform execution

• Data exchange between the PE is done via

interconnection network that performs data routing
and manipulation function
ILLIAC IV
• Interconnection network is too under the supervision
of control unit
• A host computer is interfaced with array processor
through control unit
• A host computer is a general purpose machine which
serves as a overall manager of the entire system.
• Host : front-end machine; manages resources, i/o activity
• Array Processor can be considered as back-end attached
computer
• Note: each node has local memory
Burroughs Scientific Processor(BSP)
• Effectively a successor to the ILLIAC IV machine

• But with an architecture modified to reflect the fact that the BSP was
intended to be a commercial product

• It has fewer processing units than ILLIAC IV

• All processors enjoyed equal access to a common logical address space

which was divided into a number of physically separate memory
modules

• Each processing element is nothing more than an arithmetic unit with

input and output registers, and these units are homogeneous and non-
pipelined
• Note: All nodes share the same memory howsoever the memory are divided
into modules
SIMD Interconnection Network

1D : Linear; 2D: Ring, Mesh,Star, Tree; 3D: Hypercube

SIMD Interconnection Network
• Data exchange among the PE’s are done via interconnection
network
• They perform all data routing and manipulation functions
• Architecture of an interconnection network is based on
topology
• Static: pattern is fixed and cannot be reconfigured
• Dynamic : pattern inside the network are not fixed
• Depending on the number of stages it is divided into two
types
• Single stage
• Multi stage

NOTE: refer 334 page no of two author kai Hwang and Briggs
Single Stage
• Only one stage is used
• Depending on the interstage connection used a single stage is also
known as recirculating network
• Data may recirculate single stage many times before reaching their
destination
• E.g : Crossbar Network

NOTE: refer 334 page no of two author kai Hwang and Briggs
Multi Stage
• Consist of many stages interconnected switch
• Characterized by switch box and network connectivity
• The connectivity is controlled by choice of interstage
connection pattern
• A switch box may have any of the four patterns
mentioned in the next slide.

NOTE: refer 337 page no of two author kai Hwang and Briggs
1: Straight
2:Exchange
3: Lower Broadcast
4:Upper Broadcast

NOTE: refer 337 page no of two author kai Hwang and Briggs
Mesh Connected Illilac Network

No of Nodes = N=16
No of interconnection per node (r) =√N=4
Max no of hops≤ √N-1 Example:
Routing function for connecting ith node: Node 3 will be connected with:
R+i(1)= (i+1) mod N 0≤ i ≤N-I i+1= 4th node
R-i(1)= (i-1) mod N i-1=3rd node
R+r(i)= (r+i) mod N i+4=7th node
R-r(i)= (r-i) mod N i-4=(-1)=15
Shuffle exchange and omega Networks:
• The class of shuffle- exchange network is based on two routing function
shuffle(S) and exchange(G).
• Two types: perfect shuffle and Inverse shuffle
• For perfect shuffle
• Let A=an-1, an-2,…….a1,a0 be a PE address.
• S(an-1, an-2,…….a1,a0 )=an-2,…….a1,a0 , an-1 ..
• N= number of PE’s.
• n= log n
• The cyclic shifting of bits in A to the left for one bit position is performed by
the S.
NOTE: 350 page no of two author kai Hwang and Briggs
PERFECT SHUFFLE

NOTE: 351 page no of two author kai Hwang and Briggs

Shuffle exchange and omega Networks:
• Inverse shuffle
• Let A=an-1, an-2,…….a1,a0 be a PE address.
• S(an-1, an-2,…….a1,a0 )=a0, an-1, an-2,…….a1
• N= number of PE’s.
• n= log n
• The cyclic shifting of bits in A to the right for one bit position is
performed by the S.

NOTE: 350 page no of two author kai Hwang and Briggs

Inverse Shuffle

NOTE: 351 page no of two author kai Hwang and Briggs

OMEGA NETWORK (by using perfect
shuffle)

NOTE: refer Video

BLOCKING STATE
• If the i/o ports are on the same side then the network
is called one-sided networks, also known as full
switches
• Two side multistage has an input and output side
• This can be classified further as blocking or non blocking

• If the simultaneous connections of some multiple i/o pairs result in

conflicts in switches or links, then the multistage network is known
as blocking network. Eg: OMEGA NETWORK

• If the network can perform all possible connections sources (input)

and destination(output) by rearranging its connection then its
known as non-blocking network. Eg: Benes Network
Cube Interconnection Network
• Cube network can be implemented as either a re-circular network or
as multistage network for SIMD.
• In cube network by single cube we can able to connect 8 PEs.
• To connect 16 node we need two cube and so on.
• For representing 8 PEs 3bits are required.
• Interconnection made by using following rule:
• vertical lines connect vertices(PEs) differ in the most significant bit.
• Horizontal line differs in the least significant bit.
• Vertices at both ends of diagonal lines differs in the middle bit position.

(refer page no:343

Cube Interconnection Network
Assignment:
• Design a 4-cube network for an array processor consisting of 16
Processing Elements (PEs). Trace the path to route a packet of data
from the node 0110 to 1101.
• Barrel Shifter Network (refer page no:345)
• With the aid of an example explain the ‘Masking’ and ‘Data Routing’
mechanism in an SIMD Array processor.
Thank You

SIMD Architecture Explained
100% (1)
SIMD Architecture Explained
45 pages
Introduction To Algorithms: Design and Analysis of Algorithms 214
No ratings yet
Introduction To Algorithms: Design and Analysis of Algorithms 214
42 pages
Bubble and Shuttle Sort Algorithms Explained
0% (1)
Bubble and Shuttle Sort Algorithms Explained
3 pages
Assembly Language Arithmetic Instructions
No ratings yet
Assembly Language Arithmetic Instructions
30 pages
Operating System Overview and Evolution
No ratings yet
Operating System Overview and Evolution
81 pages
Security in Distributed Systems
No ratings yet
Security in Distributed Systems
16 pages
Intro to Algorithms & Flowcharts
No ratings yet
Intro to Algorithms & Flowcharts
186 pages
OS Lecture-13 (Virtual Memory and Page Replacement Algorithms)
No ratings yet
OS Lecture-13 (Virtual Memory and Page Replacement Algorithms)
52 pages
Kernel I/O Subsystem in Operating System
No ratings yet
Kernel I/O Subsystem in Operating System
2 pages
FCFS Lab
No ratings yet
FCFS Lab
5 pages
2.7 Heaps
No ratings yet
2.7 Heaps
25 pages
4th Sem DBMS LAB Manual
No ratings yet
4th Sem DBMS LAB Manual
43 pages
CH - 5. Memory Management
No ratings yet
CH - 5. Memory Management
86 pages
Run-Time Environment in Compiler Design
No ratings yet
Run-Time Environment in Compiler Design
34 pages
Data Structures & Algorithms Guide
No ratings yet
Data Structures & Algorithms Guide
12 pages
Instruction Set Architecture Types
No ratings yet
Instruction Set Architecture Types
16 pages
Software Project Scheduling Guide
No ratings yet
Software Project Scheduling Guide
20 pages
COA Project
No ratings yet
COA Project
8 pages
Relational Database Design Pitfalls
No ratings yet
Relational Database Design Pitfalls
6 pages
Chapter 6 - Pipelining
0% (1)
Chapter 6 - Pipelining
61 pages
Chapter 10: Algorithms 10.1. Deterministic and Non-Deterministic Algorithm
No ratings yet
Chapter 10: Algorithms 10.1. Deterministic and Non-Deterministic Algorithm
5 pages
Differences Between Black Box Testing Vs White Box Testing
No ratings yet
Differences Between Black Box Testing Vs White Box Testing
9 pages
Understanding SIMD Architecture
No ratings yet
Understanding SIMD Architecture
28 pages
Evolution of Computer Architecture
0% (1)
Evolution of Computer Architecture
6 pages
Parameter Passing Techniques
No ratings yet
Parameter Passing Techniques
5 pages
OS Scheduling for Tech Enthusiasts
No ratings yet
OS Scheduling for Tech Enthusiasts
32 pages
Lecture 06 - Binary Search Tree (BST) - Design Analysis of Algorithm
No ratings yet
Lecture 06 - Binary Search Tree (BST) - Design Analysis of Algorithm
30 pages
Operating System Interfaces and Management
No ratings yet
Operating System Interfaces and Management
34 pages
Computer Organization and Architecture Micro-Operations
No ratings yet
Computer Organization and Architecture Micro-Operations
9 pages
Lecture 12 Structures
No ratings yet
Lecture 12 Structures
37 pages
OS Practical Exam Question Bank Practice Questions
No ratings yet
OS Practical Exam Question Bank Practice Questions
2 pages
Performance Measurement of The Algorithm
No ratings yet
Performance Measurement of The Algorithm
10 pages
Unit 2
100% (1)
Unit 2
58 pages
Fundamentals of Algorithmic Problem Solving: B.B. Karki, LSU 2.1 CSC 3102
No ratings yet
Fundamentals of Algorithmic Problem Solving: B.B. Karki, LSU 2.1 CSC 3102
4 pages
Basic Concepts of String and Automata
No ratings yet
Basic Concepts of String and Automata
8 pages
Rajamadam Engineering Lab Record
No ratings yet
Rajamadam Engineering Lab Record
71 pages
Computer Peripherals & Interfacing
No ratings yet
Computer Peripherals & Interfacing
128 pages
Pipelining Basic and Intermediate Concepts
No ratings yet
Pipelining Basic and Intermediate Concepts
127 pages
Lecture 2.0 - Issues in Design of Distributed System
100% (1)
Lecture 2.0 - Issues in Design of Distributed System
14 pages
Dixit Abhishek
No ratings yet
Dixit Abhishek
54 pages
Linux Scheduling
No ratings yet
Linux Scheduling
20 pages
Chapter 2 - Memory Management (Simple Systems)
No ratings yet
Chapter 2 - Memory Management (Simple Systems)
31 pages
Advantages of Database Systems
100% (13)
Advantages of Database Systems
61 pages
Software Design Essentials
No ratings yet
Software Design Essentials
11 pages
Graph Basic Terminology
No ratings yet
Graph Basic Terminology
21 pages
CD Assignment-2
No ratings yet
CD Assignment-2
16 pages
Design and Analysis Algorithm: Sorting Algorithms
No ratings yet
Design and Analysis Algorithm: Sorting Algorithms
17 pages
3.allocation of Frames
No ratings yet
3.allocation of Frames
8 pages
Operating System Chapter 3 Scheduling
No ratings yet
Operating System Chapter 3 Scheduling
10 pages
MODULE 2: Input / Output Organization: Courtesy: Text Book: Carl Hamacher 5 Edition
No ratings yet
MODULE 2: Input / Output Organization: Courtesy: Text Book: Carl Hamacher 5 Edition
95 pages
Intro to Database Systems
100% (1)
Intro to Database Systems
142 pages
Serial and Parallel First 3 Lecture
No ratings yet
Serial and Parallel First 3 Lecture
17 pages
CS2253 - Computer Organization and Architecture PDF
100% (1)
CS2253 - Computer Organization and Architecture PDF
2 pages
Interconnection Networks
No ratings yet
Interconnection Networks
7 pages
Interconnection Networks Overview
No ratings yet
Interconnection Networks Overview
48 pages
Module 4 Chapter 1
No ratings yet
Module 4 Chapter 1
28 pages
Module 3
No ratings yet
Module 3
25 pages
Interconnection Networks
No ratings yet
Interconnection Networks
40 pages
Lecture 4 Network Topologies For Parallel Architecture
No ratings yet
Lecture 4 Network Topologies For Parallel Architecture
34 pages
1multiprocessors and Multicomputers: A. Multiprocessor System Interconnects
No ratings yet
1multiprocessors and Multicomputers: A. Multiprocessor System Interconnects
16 pages
SAP SD Tutorial
No ratings yet
SAP SD Tutorial
174 pages
Microssoft Access 2006
No ratings yet
Microssoft Access 2006
6 pages
B.tech 3 Cse Syllabus
No ratings yet
B.tech 3 Cse Syllabus
65 pages
(IJCST-V10I3P17) :gawaher Soliman Hussein, Asmaa Hanafy Ali
No ratings yet
(IJCST-V10I3P17) :gawaher Soliman Hussein, Asmaa Hanafy Ali
6 pages
1715-AENTR Release Notes
No ratings yet
1715-AENTR Release Notes
9 pages
Intellidox Operating Manual
No ratings yet
Intellidox Operating Manual
94 pages
How People Approach Information
No ratings yet
How People Approach Information
56 pages
Apple Proprietary Schematics
No ratings yet
Apple Proprietary Schematics
86 pages
ATS R Up
No ratings yet
ATS R Up
1 page
Unit-2 Complete Python Program Flow Control Conditional Blocks 22nd Sept
No ratings yet
Unit-2 Complete Python Program Flow Control Conditional Blocks 22nd Sept
61 pages
Safenet Network HSM Formerly Luna Sa Product Brief
No ratings yet
Safenet Network HSM Formerly Luna Sa Product Brief
2 pages
The Complete Guide To Microsoft Windows Server 2025 Datacenter
No ratings yet
The Complete Guide To Microsoft Windows Server 2025 Datacenter
3 pages
Elevator Selection With Destination Control System: January 2006
No ratings yet
Elevator Selection With Destination Control System: January 2006
14 pages
Buy Old and New Gmail Accounts
No ratings yet
Buy Old and New Gmail Accounts
5 pages
Spirent Nomad-UX User Manual
No ratings yet
Spirent Nomad-UX User Manual
78 pages
Prequel 2
No ratings yet
Prequel 2
2 pages
Step 3
No ratings yet
Step 3
1 page
8.8.2 Packet Tracer - Compare CLI and SDN Controller Network Management - ILM
No ratings yet
8.8.2 Packet Tracer - Compare CLI and SDN Controller Network Management - ILM
7 pages
Arista Optical Modules & Cables Overview
No ratings yet
Arista Optical Modules & Cables Overview
26 pages
Backend Two
No ratings yet
Backend Two
19 pages
Interviewready - Io - Low Level System Design
No ratings yet
Interviewready - Io - Low Level System Design
2 pages
1 - Unit 2 - Assignment Brief 1
No ratings yet
1 - Unit 2 - Assignment Brief 1
3 pages
Installed Software
No ratings yet
Installed Software
3 pages
Century Computer Skills and Applications Lessons 10th Edition by Hoggatt Shank Smith ISBN Test Bank
100% (69)
Century Computer Skills and Applications Lessons 10th Edition by Hoggatt Shank Smith ISBN Test Bank
5 pages
Operations Research Simulation History
No ratings yet
Operations Research Simulation History
13 pages
CLAP Roles and Responsibilities Overview
No ratings yet
CLAP Roles and Responsibilities Overview
2 pages
Vandana Verma: Contact
No ratings yet
Vandana Verma: Contact
1 page
9295-1732532901075-Unit 35 - NEW System Analysis and Design - 2024-2025 (2) (AutoRecovered)
No ratings yet
9295-1732532901075-Unit 35 - NEW System Analysis and Design - 2024-2025 (2) (AutoRecovered)
106 pages
Indradrive MPX - 1x
No ratings yet
Indradrive MPX - 1x
90 pages
802.X Standards
No ratings yet
802.X Standards
2 pages

SIMD Array Processor

Uploaded by

SIMD Array Processor

Uploaded by

SIMD ARRAY

• Burroughs Scientific Processor(BSP)

• Each processing elements contain an ALU, registers

• User programs are loaded into the control unit

• The control unit then decodes and decides where the

• Vector instructions are sent to the processing

• The data can be loaded into the processing element’s

• A PE may either be activated or deactivated during an

• Enabled PE only perform execution

• Data exchange between the PE is done via

• It has fewer processing units than ILLIAC IV

• All processors enjoyed equal access to a common logical address space

• Each processing element is nothing more than an arithmetic unit with

1D : Linear; 2D: Ring, Mesh,Star, Tree; 3D: Hypercube

NOTE: 351 page no of two author kai Hwang and Briggs

NOTE: 350 page no of two author kai Hwang and Briggs

NOTE: 351 page no of two author kai Hwang and Briggs

NOTE: refer Video

• If the simultaneous connections of some multiple i/o pairs result in

• If the network can perform all possible connections sources (input)

(refer page no:343

You might also like