Inter Connect Performance
Inter Connect Performance
CSL718 :: Multiprocessors
Multiprocessors
Interconnection Mechanisms
Performance Models
20th April, 2006
M M M M M M
Global Interconnection Network
M M M
Anshul Kumar, CSE IITD slide 2
Shared
Shared Bus
Bus
each processor sees this picture:
processing
bus access
bus transaction time
bus utilization
processing time bus transaction time
prob of a processor using the bus =
prob of a processor not using the bus = 1 –
prob of none of the n processors using the bus = (1 – )n
prob of at least one processor using the bus = 1 – (1 – )n
achieved BW on a relative scale = 1 – (1 – )n
required BW = n available BW = 1
Anshul Kumar, CSE IITD slide 3
Effect
Effect of
of re-submitted
re-submitted requests
requests
1- + PA (1-PA ) 1-PA
A W
prob = qA PA prob = qW
PA PA
qA qW 1 q A
PA 1 PA PA 1
actual request rate a q A q w
PA PA
1
PA 1 PA 1 PA 1
a
BW n a 1 1 a
n
also PA a
a
1
Anshul Kumar, CSE IITD slide 4
Shared Bus : BW per proc
0.600
0.500 n=2
0.400 n=3
0.300 n=4
BW achieved
0.200 n=2
0.100 n=3
0.000 n=4
-0.100 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
BW required (req probability)
Shared Bus : utilization
1.200
1.000 n=2
0.800 n=3
0.600 n=4
utilization
0.400 n=2
0.200 n=3
0.000 n=4
-0.200 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
req probability
Waiting
Waiting time
time
waiting time i Tbus
if request is rejected i times and accepted on (i 1 )th attempt
probability of this (1 PA ) i PA
Expected value of waiting time Tw i Tbus (1 PA ) i PA
i 1
1 PA
Tbus PA i (1 PA ) Tbus PA
i
i 1 1 (1 PA )
2
1 PA a
Tbus Tbus
PA a
SWITCH
link M
link link
link P link
S S node
M
node
M node
M
P P P link
node
M
DIRECT
INDIRECT
Anshul Kumar, CSE IITD slide 10
Static
Static and
and Dynamic
Dynamic Networks
Networks
• Static Networks
– fixed point to point connections
– usually direct
– each node pair may not have a direct connection
– routing through nodes
• Dynamic Networks
– connections established as per need
– usually indirect
– path can be established between any pair of nodes
– routing through switches
Anshul Kumar, CSE IITD slide 11
Static
Static Network
Network Topologies
Topologies
Non-uniform connectivity
Linear 2D-Mesh
Tree
Star
Ring
Torus
Fully Connected
0
0 1 2 8 1
3 4 5 7 2
6 7 8
6 3
neighbors of node r : 5 4
(r 1) mod 9 and
(r 3) mod 9 Chordal Ring
Anshul Kumar, CSE IITD slide 14
Fat
Fat Tree
Tree Network
Network
simplest 22
cross-bar switch
000 000
001 001
010 010
011 011
100 100
101 101
110 110
111 111
non-blocking
Anshul Kumar, CSE IITD slide 18
Switching
Switching Mechanism
Mechanism
• Circuit Switching (connection oriented
communication)
– A circuit is established between the source and
the destination
• Packet Switching (connectionless
communication)
– Information is divided into packets and each
packet is sent independently from node to node
header payload/data
store & forward
routing H l
BW BW
H l
latency n
BW BW
time
worm hole H l
latency n
routing BW BW
overhead Tx time=bytes/BW
sender
time of flight
Tx time=bytes/BW overhead
receiver
transport latency
total latency
time
Anshul Kumar, CSE IITD slide 23
Other
Other Parameters
Parameters
• Throughput Bandwidth (no credit for header)
• Bisection bandwidth = BW across a bisection
• Node degree
• Network Diameter
• Cost
• Fault Tolerance
n is number of dimensions
k d is av. no. of hops
along one dimension
kd
r is prob of message req in a cycle
k-ary n-cube
i
m 1 m
m m
Anshul Kumar, CSE IITD slide 29
Switch
Switch Performance
Performance –– contd.
contd.
k
Expected BW (on relative scale) E (i )q (i )
i 0
k m 1
i
k i
1 m C i r (1 r ) k i
i 0 m
k m k
1 k i
i
m Ci r (1 r )
k i k i
m Ci r (1 r )
k i
i 0 i 0 m
i
k
m 1
k
m k Ci r i (1 r ) k i m k Ci r (1 r ) k i
i 0 i 0 m
k k
m 1 r
m m r (1 r ) m m 1
m m
Anshul Kumar, CSE IITD slide 30
Switch
Switch Performance
Performance –– contd.
contd.
Requested bandwidth r k
Expected BW (if there were no output port conflicts) m
k
r
Expected BW (because of output port conflicts) m m 1
m
this is less than m as well as r k (assuming that r 1)
BW
prob of acceptance of requests PA
rk
We now consider effect of request re - submission due to conflicts.
We need to compute revised request rate due to
re - submission and also compute delays because of waiting.