Switches, Routers and Networks
Switches, Routers and Networks
Interconnection routing
Buffering -input and output
Local area networks (LANs)
Metropolitan area networks (MANs)
Wide area networks (WANs)
Trends
Introduction
the concentrator
0
0
11 – the 2x2 2-state point-to-point switch (switching cell) with a set of
interconnection lines such that
every node is an object with an array of inputs and an array of outputs
an interconnection line leads from an output of one node to an input of
another node
every I/O of a node is incident with at most one interconnection line
an I/O is called external if it is not incident with any interconnection
line
Construct an N by N switch using two N/2 by N/2 switches and a new stage of N/2
basic (2x2) modules
N by N switch has Log2(N) stages each with N/2 basic (2x2)
Complexity issues
• There are many different parameters that are used to consider the
complexity of an interconnection network
• Line complexity: number of interconnection lines
• Node (cell) complexity: number of small nodes (mxn where m < 3 and
n < 3)
• Depth: maximum number of nodes on a route (assuming an acyclic
interconnection network)
• Entropy of a switch: log of the number of connections states
• What relations exist between complexity and the capabilities of a
switch?
Complexity
�e � n
log(2J )
( )= n log n -1.44n + log(
� log n! ()
)+
�2 �2 so component
complexity is bounded from below by () -1.44n + Q(()
n logn logn )
16x16 60x60
4x4
4x4 6x6
10x10
2x2
2x2 2x2
2x2 2x2
3x3 5x5
2x2
Divide and conquer
• Basic blocks need not be 2x2, trees need not be balanced
• A three stage approach in which we use as the middle stage two
networks of size 2n-1 x 2n-1 to build a network of size 2n x 2n
2n-1
cells
2n-1
cells
• We denote by [nxm, rxp, mxq] the 3-stage network with r nxm input
nodes, m rxp middle nodes, p mxq output nodes such that – output y of
input node x is linked to input x of of middle node y – output u of middle
node y is linked to input y of output node u
• Rearrangeability theorem: the 3-stage network is rearrangeable iff
1 2
P(m + 1) = 1-(1 - P(m))
2
• We can now solve for P(m) recursively
• For an m stage network, throughput (per output link) is P(m), which is
the probability that there is a packet at the output
• Modular Architecture
• Switch buffers: None, at input, or at output of each module Switch
fabric consists of many 2x2 modules
• Solution: Buffering
Distributed buffer
Contention and buffering
Buffers increase delay
Tradeoff between delay and throughput
X=X=1
– packet duration equaling one slot
• The average number of packets at each output is given by (M/G/1
formula):
2
2 A -( A)
N=
Q
2(1 -A)
Note that the only delay is due to the queueing at the outputs and none
is due to the switch fabric
Advantages: No delay or blocking inside switch
Disadvantages:
– Bus speed must be N times line speed
• Imposes practical limit on size and capacity of switch
• Shared output buffers: output buffers are implemented in shared
memory using a linked list
i ii
( A )2
i
Q
=
2(1 -A)
• Notice however that the total number of packets addressed
N
to the outputs is N (number of HOL packets). Hence, i =>
( A )2
i
LQ = N Q =
=1
i =1
2(1 -A)
• We can now solve, using the quadratic equation to obtain:
A = utilization = 2 -
2 � 0.58
• The maximum throughput of an input queued switch, is limited by HOL
blocking to 58% ( for large N)
Example:
How does the scheduler decide which input to transfer to which output?
31 233 0
2 0 0
1 input 2 0 0 2
• Each entry in the backlog matrix represent the number of packets in input i’s queue
that are destined to output j
• During each slot the scheduler can transfer at most one packet from each input to
each output
The scheduler must choose one packet (at most) from each row, and column of the
backlog matrix
This can be done by solving a bi-partite graph matching algorithm
The bi-partite graph consists of N nodes representing the inputs and N nodes
representing the outputs
MIT
Bi-partite graph representation
4096 + 96 + 64
• SCSI drawbacks:
Two or more I/O controllers cannot easily share SCSI devices on the same I/O bus,
so a single server controls connections between users and their data
Address on an I/O bus: 8 or 16 addresses depending on implementation
Distance 25 m
bus, the requirement for more storage has driven extending the SCSI
interface to many devices and eventually replacing a single storage
device with a full network, the storage area network (SAN)
• Based on Fibre Channel protocol (FC) fiber channel:
Gigabit per second bandwidth (1063 Mbps) and theoretically up to 4
Gbps
Allows SCSI in serial form rather than the parallel form usually found
in SCSI (also supports HIPPI and IPI I/O protocols)
Distance of up to 10 km
24-bit address identifier – up to 16 million ports
FC
Port Level
NodeLevel
• Arbitrated loop topology:
up to 126 devices in a serial loop configuration
Each port discovers when it has been attached
No collisions
Fair access: every port wanting to initiate traffic gets to do so before
another port gets a
second shot
Differen
t types of FC SAN architectures
• Fabric topology
FC switch
This is not a shared bus!