Lecture 3
Lecture 3
Computing
LECTURE #3
1
O Introduction
O Parallel Computing Platforms: logical & physical
Agenda O Logical Organization
1- Control
2- Communication
O Flynn Taxonomy
O 1- Control
2
1- Introduction
❖A computing platform includes a hardware architecture and a software
framework (including application frameworks), where the combination allows
software to run.
4
Parallel Computing Platform
5
Parallel Computing Platform
Logical Organization
8
Parallel Computing Platform
Logical Organization
9
Parallel Computing Platform
Logical Organization
Fine-grain Parallelism: Coarse-grain Parallelism:
❖ Relatively small amounts of computational
❖Relatively large amounts of computational work
work are done between communication events are done between communication/synchronization
❖ Low computation to communication ratio. events
❖ High computation to communication ratio
❖ Facilitates load balancing.
❖ Implies more opportunity for performance
❖ Implies high communication overhead and less increase
opportunity for performance enhancement
❖ Harder to load balance efficiently.
❖ If granularity is too fine it is possible that the
overhead required for communications and
synchronization between tasks takes longer than
the computation.
Parallel Computing Platform
Logical Organization
Which is Best?
❖ The most efficient granularity is dependent on the algorithm and the
hardware environment in which it runs.
11
Parallel Computing Platform
Logical Organization
12
Models:
Flynn's Classical Taxonomy
❖Flynn’s classification scheme is based on the notion of a stream of information.
❖Each of these dimensions can have only one of two possible states: Single or
Multiple.
❖One of the more widely used classifications that classify parallel computers, use
since 1966.
1
4
Flynn's Classical Taxonomy
The matrix below defines the 4 possible classifications according to Flynn:
1
5
Flynn's Classical Taxonomy
The matrix below defines the 4 possible classifications according to Flynn:
1
6
Flynn's Taxonomy
❖Single data: only one data stream is being used as input during any one
clock cycle
❖ This is the oldest and even today, the most common type of computer
❖ Processor Arrays: ILLIAC IV, DAP Connection Machine CM-2, MasPar MP-1.
❖Vector Pipelines: IBM 9000, Cray X-MP, Y-MP & C90, Fujitsu VP, NEC SX-2,
Hitachi S820, ETA10
❖ Examples:
For (I = 0; i<1000; i++)
c[i] = a[i] + b[i];
20
21
Flynn's Taxonomy
22
Your Turn !!!
Guess what are the SIMD drawbacks??!!
23
Flynn's Taxonomy
28