CA Classes-211-215
CA Classes-211-215
The kernels were planned to verify vectorisation ability and are able to be
vectorised by hand.
Activity 2:
Visit your local computer vendor and get an expert opinion about vector
processors and their working.
9.6 Summary
There are several representative application areas where vector processing
is of the utmost importance. Depending upon the way the operands are
fetched, vector processors can be segregated into two groups.
Operands are straight away streamed from the memory to the functional
units and outcomes are written back to memory at the time the vector
operation advances in this architecture.
Operands are read into vector registers wherein they are fed to the
functional units and outcomes of operations are written to vector
registers in this architecture.
Vector register architectures have several advantages over vector
memory-memory architectures.
There are several major components of the vector unit of a register-
register vector machine
The various types of vector instructions for a register-register vector
processor are:
Vector-scalar Instructions
Vector-vector Instructions
Vector-memory Instructions
Gather and Scatter Instructions
Masking Instructions
Vector Reduction Instructions
CRAY-1 is one of the oldest processors that implemented vector
processing.
Two issues that arise in real programs: (i) the vector length in a program is
not exactly 64. (ii) Non adjacent elements in vectors that reside in memory.
The structure of the program & capability of the compiler are two factors
that affect the success with which a program can be run in vector mode.
9.7 Glossary
ASC: Advanced Scientific Computer
Data hazards: the conflicts in register accesses
ETA-10: A later shared-memory multiprocessor version of the CDC
Cyber 205.
Functional hazards: the conflicts in functional units.
Gather: an operation that fetches the non-zero elements of a sparse
vector from memory.
Masking instructions: These instructions use a mask vector to expand
or compress a vector
Scatter: It stores a vector in a sparse vector into memory.
SECDED: single-error correction, double-error detection.
Small scale integration: it can pack 10 to 20 transistors in a single
chip.
Strip mining: the vector is partitioned into strips of 64 elements.
Vector reduction instructions: These instructions accept one or two
vectors as input and produce a scalar as output.
9.9 Answers
Self Assessment Questions
1. Vector processors
2. Data parallelism
3. ETA-10
4. True
5. Crossbars
6. Vector-memory instructions
7. False
8. Strip mining
9. Sequential words
10. Structure of the program & capability of the compiler
11. False
Terminal Questions
1. There are various application areas of vector processors which are of
considerable importance. Refer Section 9.2.
2. Depending upon the way the operands are fetched, vector processors
can be segregated into two groups: Memory-memory vector architecture
and Vector-register architecture. Refer Section 9.3.
3. Due to the capability to overlap memory accesses as well as the
probable use of vector processors again, vector-register vector
processors are normally more efficient as compared to memory-memory
vector processors. Refer Section 9.3.
4. a. The CDC Cyber 205 is based on the concepts initiated for the CDC
Star 100; the first commercial model was produced in 1981. Refer
Section 9.4.
b. CRAY-1 is one of the oldest processors that implemented vector
processing. Refer Section 9.5.
c. The vector size may be less than the vector register size, and the
vector size may be larger than the vector register size. Refer
Section 9.6.
d. As vectors are one-dimensional series, saving a vector in memory is
direct: vector elements are stored as sequential words in memory.
Refer Section 9.6.
References:
Hwang, K. (1993). Advanced Computer Architecture. McGraw-Hill.
Godse, D. A. & Godse, A. P. (2010). Computer Organisation. Technical
Publications.
Hennessy, John L., Patterson, David A. & Goldberg David (2011).
Computer Architecture: A Quantitative Approach, Morgan Kaufmann;
5th edition.
Sima, Dezsö, Fountain, Terry J. &Kacsuk, Péter (1997). Advanced
computer architectures - a design space approach. Addison-Wesley-
Longman.
E-references:
https://csel.cs.colorado.edu/~csci4576/VectorArch/VectorArch.html
http://www.cs.clemson.edu/~mark/464/appG.pdf
nasa_fig.gif