Lecture006. Introduction Systolic Array
Lecture006. Introduction Systolic Array
Computing
Fundamentals
What are
Systolic
Arrays?
Simple cells
Each cell performs one operation
(usually)
Definition 2.
H.T.Kung
Most designs are simple and regular in order to keep the VLSI
implementation costs low
programs with simple data and control flow are best
Control
Unit
Control Bus
Processing
Units
Processing
Units
..
Processing
Units
Data Bus
Interconnection Network(Local)
Systolic Array.
Control
Unit
Processing
Units
Control
Unit
Processing
Units
..
Control
Unit
Processing
Units
Interconnection Network(Local)
SIMD array usually loads data into its local memories before starting the
computation.
Systolic arrays usually pipe data from an outside host and also pipe the
results back to the host.
Memory
PE
5 MOPS
100ns
Memory
PE PE ----- PE
30 MOPS
1. Connection Topology
2D Meshes
hypercubes
(FPGAs) offer the possibility that re-programmable, reconfigurable arrays can be constructed to efficiently compute
certain problems.
Early systolic arrays are linear arrays and one dimensional(1D) or two
dimensional I/O(2D).
Most recently, systolic arrays are implemented as planar array with perimeter
I/O to feed data through the boundary.
1D Linear Array
2D Linear Array
My experience story:
Hypercubes in Intel
3-d Array
4-d Array (mapped to 3-D)
3-D Hex Array
Regular
3-D Trees and Lattices
Interconnect in
3D
This is a research area of our group - look to Perkowskis
and Anas Al Rabadis papers
Characteristics
of Systolic
Architectures
Synchrony,
Modularity,
Regularity,
Spatial locality,
Temporal locality,
Pipelinability,
Parallel computing.
Synchrony means that the data is rhythmically computed (Timed by a global clock)
and passed through the network.
Modularity means that the array(Finite/Infinite) consists of modular processing units.
Regularity means that the modular processing units are interconnected with
homogeneously.
cost effective,
array is modular (i.e) adjustable to various performance goals ,
large number of processors work together,
local communication in systolic array is advantageous for communication
to be faster.
Exploit Concurrency
Scalable
Routing costs
dominate: power,
area, and time!
Exploiting Concurrency
Large number of simple PEs
Manage without instruction store
Methods:
Pipelining
SIMD/MIMD
Vector
Sources
1. Seth Copen Goldstein, CMU
2. David E. Culler, UC. Berkeley,
3. [email protected]
4. Syeda Mohsina Afroze
and other students of Advanced Logic Synthesis,
ECE 572, 1999 and 2000.