Lecture 4
Lecture 4
Computing
LECTURE 4
1
O Parallel Computing Platform
O Logical Organization
Agenda 1- Control
2- Communication
O Physical Organization: Interconnection networks:
1- Static Network
▪ Topology
▪ Evaluation of networks
2
Parallel Computing Platform
Logical Organization
3
Parallel Computing Platform
Logical Organization
Platforms that provide a shared data space are Platforms that support messaging are called
called shared-address-space machines or message passing platforms or multi-computers.
multiprocessors
Parallel Computing Platform
Logical Organization
❖Changes in a memory location effected by one processor are visible to all other
processors (global address space).
6
Parallel Computing Platform
1- Accessing Shared data (cont.) Logical Organization
❖Shared memory machines can divided into two main classes based upon
memory access times:
7
Parallel Computing Platform
1- Accessing Shared data (cont.) Logical Organization
❖Shared memory machines can divided into two main classes based upon
memory access times:
Platforms that provide a shared data space are Platforms that support messaging are called
called shared-address-space machines or message passing platforms or multi-computers.
multiprocessors
2- Exchanging messages (Cont.) Parallel Computing Platform
Logical Organization
Message passing systems are a class of multi-computers [clustered
workstations] in which each processor has access to its own local memory
▪ Each one operates independently.
▪ Changes it makes to its local memory have no effect on the
memory of other processors.
▪ Hence, the concept of cache coherency does not apply.
Memory Memory
Memory
2- Exchanging messages (Cont.) Parallel Computing Platform
Logical Organization
❖These platforms are programmed using (variants of) send and receive
primitives. {GetID, NumProcs}.
❖On each CPU runs a serial process that can communicate with other
processes on other CPUs by means of the network.
(MPI)—A distributed memory parallel programming language
❖Synchronizes well with Data Parallelism.
15
Your Turn Distributed Memory
Shared Memory
Advantages
Advantages
Global address space provides a user-friendly Memory is scalable with number of processor,
Increase the number of processors and the size of
programming prespective to memory. memory increases proportionally.
Data sharing between tasks is both fast and Each processor can rapidly access its own
uniform due to the proximity of memory to memory without interference and without
CPUs overhead incurred with trying to maintain cache
coherency.
Cost effectiveness
Disadvantages
Lack of scalability between memory and Disadvantages
CPUs. Adding more CPUs can increases traffic
on the shared memory and CPU path Programmer responsible for many details
associated with data communication between
Expensive processors.
Difficult to map existing data structures, based on
global memory, to this memory organization
42
Interconnection Networks
❖Provide another mechanisms for data transfer between processors and memory
modules
17
❖ Interconnection networks can be classified as static or dynamic.
❖ Static networks:
✓ Consist of point-to-point communication links among processing nodes.
❖ Dynamic networks:
✓ Are built using switches and communication links.
✓ Communication links are connected to one another dynamically by the
switches to establish paths among processing nodes and memory banks.
21
A- Static Interconnection Network
Evaluating Static Interconnection Networks
❖Diameter: The maximum distance between any two processing nodes in
the network. (number of hops through which a message in transferred on
its way from one point to another )
❖Bisection Width: The minimum number of wires you must cut to divide
the network into two equal parts.
❖Cost: The number of links or switches besides the length of wires, etc., are
factors in to the cost.
22
A- Static Interconnection Networks
1. Complete network (clique)
2. Star network
3. Linear array
4. Ring
5. Tree
6. 2D & 3D mesh/torus
7. Hypercube
8. Fat tree
23
A- Static Interconnection Networks
1- Completely Connected
❖ Each processor is connected to every other processor.
2-Star
❖ Every node is connected only to a common node at the center.
3- linear
❖ Each node has two neighbors, one to its left and one to its right.
4- Ring (1D)
❖ It is linear but the nodes at either end are connected.
21
A- Static Interconnection Networks
5- 2D & 3D mesh
❖ Has nodes with 4 neighbors, to the north, south, east, and west
t.
❖Good match for discrete simulation and matrix operations
❖ Easy to manufacture and extend
❖ Examples: Cray 3D (3d torus), Intel Paragon (2D mesh)
A- Static Interconnection Network
6.Hypercubes
❖ A special case of a d-dimensional mesh is a hypercube.
❖Bisection Width: The minimum number of wires you must cut to divide
the network into two equal parts.
❖Cost: The number of links or switches besides the length of wires, etc., are
factors in to the cost.
33
Calculate it A- Static Interconnection Network