0% found this document useful (0 votes)

58 views28 pages

Lectures 17 & 18 Fast Packet Switching: Eytan Modiano Massachusetts Institute of Technology

This document discusses packet switching and fast packet switching. It covers three generations of packet switches, including first generation switches that used CPUs, second generation switches that distributed processing to line cards, and third generation switches that replaced shared buses with switch fabrics. It also discusses multi-stage interconnection networks, including distributed buffer, output buffer, and input buffer architectures. Specific networks like Omega and Baseline networks are described. The document discusses concepts like self-routing, contention, and throughput analysis of interconnection networks.

Uploaded by

ablaoublas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views28 pages

Lectures 17 & 18 Fast Packet Switching: Eytan Modiano Massachusetts Institute of Technology

Uploaded by

ablaoublas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Lectures 17 & 18

Fast packet switching

Eytan Modiano
Massachusetts Institute of Technology

Eytan Modiano
Slide 1
Packet switches

Packet
Routing
engine Switch

Scheduler

Packet
Data Header Packet Tag

DestinationAddress Output port number

or VC number

• A packet switch consists of a routing engine (table look-up), a

switch scheduler, and a switch fabric.
• The routing engine looks-up the packet address in a routing table
and determines which output port to send the packet.
– Packet is tagged with port number
– The switch uses the tag to send the packet to the proper output port

Eytan Modiano
Slide 2
First Generation Switches

CPU

LC-1 LC-2 LC-3

output buffer Input buffer

• Computer with multiple line cards

– CPU polls the line cards
– CPU processes the packets
• Simple, but performance is limited by processor speeds and bus
speeds
• Examples: Ethernet bridges and low end routers

Eytan Modiano
Slide 3
Second Generation switches

Computer

Bus

LC LC LC LC

• Most of the processing is now done in the line cards

– Route table look-up, etc.
– Line cards buffer the packets
– Line card send packets to proper output port

• Advantages: CPU and main Memory are no longer the bottleneck

• Disadvantage: Performance limited by bus speeds

– Bus BW must be N times LC speed (N ports)
Eytan Modiano
• Example: CISCO 7500 series router
Slide 4
Third generation switches

Input LC Output LC
N by N
Input LC SWITCH Output LC
FABRIC
Input LC Output LC

Controller

• Replace shared bus with a switch fabric

• Performance depends on the switch fabric, but potentially can
alleviate the bus bottleneck

Eytan Modiano
Slide 5
Switch Architectures

• Distributed buffer

• Output buffer

• Input buffer

Eytan Modiano
Slide 6
Distributed buffer

• Modular Architecture

Basic module is a 2x2 switch, which can be either in the through

or crossed position

• Switch buffers: None, at input, or at output of each module

Switch fabric consists of many 2x2 modules

N N
inputs outputs

Eytan Modiano
Slide 7
Interconnection networks

• N input
• Log(N) stages with N/2 modules per stage

• Notice the order of inputs into a stage is a shuffle of the outputs

from the previous stage: (0,4,1,5,2,6,3,7)
• Easily extended to more stages
• Any output can be reached from any input by proper switch
settings
– Not all routes can be done simultaneously
– Exactly one route between each SD pair
Eytan Modiano
– Self-routing network
Slide 8
Self Routing

• Use a tag: n bit sequence with one bit per stage of the network
– E.g., Tag = b3b2b1

• Module at stage i looks at bit i of the tag (bi), and sends the packet
up if bi=0 and down if bi=1
• In omega network, for destination port with binary address abc the
tag is cba

– Example: output 100 => tag = 001

– Notice that regardless of input port, tag 001 will get you to output 100

Eytan Modiano
Slide 9
Baseline network

• Another Example of a multi-stage interconnection network

• Built using the basic 2x2 switch module
• Recursive construction
– Construct an N by N switch using two N/2 by N/2 switches and a new
stage of N/2 basic (2x2) modules
– N by N switch has Log2(N) stages each with N/2 basic (2x2) modules

2x2
N/2 x N/2
2x2
2x2 2x2
N inputs 4 x 4 switch
example
2x2 2x2 2x2
N/2 x N/2
2x2

N/2 basic mods 2 N/2 by N/2 switches

Eytan Modiano
Slide 10
Contention

• Two packets may want to use the same link at the same time
(same output port of a module)

• Hot spot effect

• Solution: Buffering

Eytan Modiano
Slide 11
Throughput analysis of interconnection

networks

• Assume no buffering at the switches

• If two packets want to use the same port one of them is dropped

• Suppose switch has m stages

• Packet transmit time = 1 slot (between stages)

• New packet arrival at the inputs, every slot

– Saturation analysis (for maximum throughput)
– Uniform destination distribution independent from packet to packet

Eytan Modiano
Slide 12
Interconnection Throughput, continued

• Let P(m) be the probability that a packet is transmitted on a stage

m link
P(m) A C P(m+1)
P(m) B

• P(0) = 1
• P(m+1) = 1 – P(no packet on stage m+1 link (link c) )

= 1 – P(neither inputs to stage m+1 chooses this output)

• Each input has a packet with probability P(m) and that packet will
choose the link with probability 1/2. Hence,
1 2
P(m + 1) = 1− (1 − P( m))
2
• We can now solve for P(m) recursively
• For an m stage network, throughput (per output link) is P(m),
which is the probability that there is a packet at the output
Eytan Modiano
Slide 13
Interconnection Throughput, continued

Throughput of interconnect network

1.2

0.8

0.6

0.4

0.2

0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
stages

• Throughput can be significantly improved by adding buffers at the stages

– Buffers increase delay
– Tradeoff between delay and throughput
Eytan Modiano
Slide 14
Advantages/Disadvantages

of multi-stage architecture

• Advantages
– Modular
– Scalable
– Bus (links) only needs to be as fast as the line cards

• Disadvantages
– Delays for going through the stages
Cut-through possible when buffers empty
– Decreased throughput due to internal blocking

• Alternatives: Buffers that are external to the switch fabric

– Output buffers
– Input buffers

Eytan Modiano
Slide 15
Output buffer architecture

N
inputs Interconnect fabric
or
Bus

• As soon as a packet arrives, it is transferred to the appropriate

output buffer
• Assume slotted system (cell switch)
• During each slot the switch fabric transfers one packet from each
input (if available) to the appropriate output
– Must be able to transfer N packets per slot
– Bus speed must be N times the line rate
– No queueing at the inputs
Buffer at most one packet at the input for one slot

Eytan Modiano
Slide 16
Queueing Analysis

• If external arrivals to each input are Poisson (average rate A ),

each output queue behaves as an M/D/1 queue

– packet duration equaling one slot X = X 2 = 1

• The average number of packets at each output is given by (M/G/1

formula):
2
2A − (A )
NQ =
2(1 − A )

• Note that the only delay is due to the queueing at the outputs and
none is due to the switch fabric

Eytan Modiano
Slide 17
Advantages/Disadvantages of

Output buffer architecture

• Advantages: No delay or blocking inside switch

• Disadvantages:
– Bus speed must be N times line speed
Imposes practical limit on size and capacity of switch

• Shared output buffers: output buffers are implemented in shared

memory using a linked list
– Requires less memory (due to statistical multiplexing)
– Memory must be fast

Eytan Modiano
Slide 18
Input buffer architecture

• Packets buffered at input rather than output

– Switch fabric does not need to be as fast
Crossbar switch

1 X

2 X
Scheduler
3 X

4 X

X = connect

1 2 3 4

• During each slot, the scheduler established the crossbar

connections to transfer packets from the input to the outputs
– Maximum of one packet from each input
– Maximum of one packet to each output
• Head of line (HOL) blocking – when the packet at the head of two
or more input queues is destined to the same output, only one can
Eytan Modiano
Slide 19
be transferred and the other is blocked
Throughput analysis of input queued switches

• HOL blocking limits throughput because some inputs

(consequently outputs) are kept idle during a slot even when they
have other packet to send in their queue

• Consider an NxN switch and again assume that inputs are

saturated (always have a packet to send)

• Uniform traffic => each packet is destined to each output with

equal probability (1/N)

• Now, consider only those packets at the head of their queues

(there are N of them!)

Eytan Modiano
Slide 20
Throughput analysis, continued

• i
Let Qm be the number of HOL packets destined to node i at the
end of the mth slot
i i i
Qm = max(0,Qm −1 + Am − 1)

• Where
Ami = number of new HOL messages addressed to node i that arrive
to the HOL during slot m. Now,
 Cm −1 
P( A = l) = 
i
m
 (1/ N )l (1 − 1/ N )Cm− 1 − l
 l 
• Where
Cm −1 = number of HOL messages that departed during the m-1 slot =
number of new HOL arrivals

• As N approaches infinity, Ami becomes Poisson of rate C/N where C

is the average number of departures per slot
Eytan Modiano
Slide 21
Throughput analysis, continued

• In steady-state, Qi behaves as an M/D/1 of rate A and, as before,

2
2 A − (A )
i
Q =
2(1 − A)

• Notice however that the total number of packets addressed to the

outputs is N (number of HOL packets). Hence,
N
2 A − ( A )2
∑Q i
=N => Q = i

2(1 − A)
=1
i =1

We can now solve, using the quadratic equation to obtain:

A = utilization = 2 − 2 ≈ 0.58

Eytan Modiano
Slide 22
Summary of input queued switches

• The maximum throughput of an input queued switch, is limited by

HOL blocking to 58% ( for large N)

– Assuming uniform traffic and FCFS service

• Advantages of input queues:

– Simple
– Bus rate = line rate

• Disadvantages: Throughput limitation

Eytan Modiano
Slide 23
Overcoming HOL blocking

• If inputs are allowed to transfer packets that are not at the head of
their queues, throughput can be substantially improved (not
FCFS)

Example: input 1 1 2

input 2 3 2

input 3 4 3

input 4 4 2

• How does the scheduler decide which input to transfer to which

output?

Eytan Modiano
Slide 24
Backlog matrix

output

1 2 3

1 3 3 0

input 2 2 0 0

3 0 0 2

• Each entery in the backlog matrix represent the number of

packets in input i’s queue that are destined to output j
• During each slot the scheduler can transfer at most one packet
from each input to each output
– The scheduler must choose one packet (at most) from each row, and
column of the backlog matrix
– This can be done by solving a bi-partite graph matching algorithm
– The bi-partite graph consists of N nodes representing the inputs and
N nodes representing the outputs
Eytan Modiano
Slide 25
Bi-partite graph representation

• There is an edge in the graph from an input to an output if there is a

packet in the backlog matrix to be transferred from that input to that
output
– For previous backlog matrix, the bi-partite graph is:

1 1

2 2

3 3

• Definition: A matching is a set of edges, such that no two edges share

a node
– Finding a matching in the bi-partite graph is equivalent to finding a set of
packets such that no two packets share a row or column in the backlog
matrix

• Definition: A maximum matching is a matching with the maximum

possible number of edges
– Finding a maximum matching is equivalent to finding the largest set of
Eytan Modiano
Slide 26
packets that can be transferred simultaneously
Maximum Matchings

• Algorithms for finding maximum matching exist

• The best known algorithms takes O(N2.5) operations
– Too long for large N

• Alternatives
– Sub-optimal solutions
– Maximal matching: A matching that cannot be made any larger for a
given backlog matrix

– For previous example:

(1-1,3-3) is maximal

(2-1,1-2,3-3) is maximum

• Fact: The number of edges in a maximal matching ≥ 1/2 the

number of edges in a maximum matching

Eytan Modiano
Slide 27
Achieving 100% throughput

in an input queued switch

• Finding a maximum matching during each time slot does not

eliminate the effects of HOL blocking
– Must look beyond one slot at a time in making scheduling decisions

• Definition: A weighted bi-partite graph is a bi-partite graph with

costs associated with the edges

• Definition: A maximum weighted matching is a matching with the

maximum edge weights

• Theorem: A scheduler that chooses during each time slot the

maximum weighted matching where the weight of link (i,j) is equal to
the length of queue (i,j) achieves full utilization (100% throughput)

– Proof: see “Achieving 100% throughput in an input queued switch” by

N. McKeown, et. al., IEEE Transactions on Communications, Aug. 1999.

Eytan Modiano
Slide 28

Chapter 3
No ratings yet
Chapter 3
25 pages
ComputerNetwork C5-1 en
No ratings yet
ComputerNetwork C5-1 en
84 pages
Introduction
No ratings yet
Introduction
46 pages
4 - Interconnection Networks
No ratings yet
4 - Interconnection Networks
57 pages
P51a 05and06
No ratings yet
P51a 05and06
125 pages
16.36 Communication Systems Engineering: Mit Opencourseware
No ratings yet
16.36 Communication Systems Engineering: Mit Opencourseware
24 pages
Switch Performance Analysis and Design Improvements: IEG4020 Telecommunication Switching and Network Systems
No ratings yet
Switch Performance Analysis and Design Improvements: IEG4020 Telecommunication Switching and Network Systems
46 pages
MN RAN Networking
No ratings yet
MN RAN Networking
137 pages
MENDELSON Elliott - Introduction To Mathematical Logic
100% (1)
MENDELSON Elliott - Introduction To Mathematical Logic
225 pages
Chapter 4 v8.0
No ratings yet
Chapter 4 v8.0
44 pages
Inside A Router
No ratings yet
Inside A Router
10 pages
ComputerNetwork C5-1 en
No ratings yet
ComputerNetwork C5-1 en
84 pages
Chapter 3
No ratings yet
Chapter 3
32 pages
Lec04 Routers
No ratings yet
Lec04 Routers
36 pages
Switching in Modern Communication Networks
No ratings yet
Switching in Modern Communication Networks
35 pages
ComputerNetwork C5-1 en
No ratings yet
ComputerNetwork C5-1 en
84 pages
Switching
No ratings yet
Switching
63 pages
CN Chaptr 1,2,3
No ratings yet
CN Chaptr 1,2,3
239 pages
CRI Reti 07 Network Layer-Data Plane
No ratings yet
CRI Reti 07 Network Layer-Data Plane
40 pages
Review of Networking Concepts: Prof. Malathi Veeraraghavan University of Virginia
No ratings yet
Review of Networking Concepts: Prof. Malathi Veeraraghavan University of Virginia
57 pages
Structure of A Switch
No ratings yet
Structure of A Switch
9 pages
Chapter 4
No ratings yet
Chapter 4
83 pages
2 Networks
No ratings yet
2 Networks
85 pages
Day 13
No ratings yet
Day 13
18 pages
Module 4 Chapter 1
No ratings yet
Module 4 Chapter 1
28 pages
Network 34
No ratings yet
Network 34
76 pages
EEE 552 - Lecture 4 2024
No ratings yet
EEE 552 - Lecture 4 2024
68 pages
Module 1
No ratings yet
Module 1
99 pages
Packet Scheduling in Multiterrabit Networks
No ratings yet
Packet Scheduling in Multiterrabit Networks
3 pages
Chapter 2
No ratings yet
Chapter 2
28 pages
Unit Iv Hardware Accelerates & Networks
No ratings yet
Unit Iv Hardware Accelerates & Networks
59 pages
Section 4 Switching Methods
No ratings yet
Section 4 Switching Methods
41 pages
Packet Switch Architectures: - Introduction: - Packet Lookup and Classification: - Switching Fabrics
No ratings yet
Packet Switch Architectures: - Introduction: - Packet Lookup and Classification: - Switching Fabrics
20 pages
Switching Technologies-1
No ratings yet
Switching Technologies-1
28 pages
OSI Reference Model
No ratings yet
OSI Reference Model
6 pages
Chapter One: Router and Switch: What Is The Big Benefit of Using Switches To Connect Hosts?
No ratings yet
Chapter One: Router and Switch: What Is The Big Benefit of Using Switches To Connect Hosts?
48 pages
CNS M3 Network Layer
No ratings yet
CNS M3 Network Layer
21 pages
AWS Certified Security Specialty
No ratings yet
AWS Certified Security Specialty
13 pages
Appendix F: Authors: John Hennessy & David Patterson
No ratings yet
Appendix F: Authors: John Hennessy & David Patterson
33 pages
Lecture 18 - 19 - Switching Cont. - Message Switching
No ratings yet
Lecture 18 - 19 - Switching Cont. - Message Switching
28 pages
High Speed Switching
No ratings yet
High Speed Switching
46 pages
Dynamic Networks: CS 213, LECTURE 15 L.N. Bhuyan
No ratings yet
Dynamic Networks: CS 213, LECTURE 15 L.N. Bhuyan
25 pages
Volte SRVCC
100% (4)
Volte SRVCC
41 pages
Lecture 5 Circuit and Packet Switching
No ratings yet
Lecture 5 Circuit and Packet Switching
39 pages
Optical Interconnection Technology in Switches, Routers and Optical Cross Connects
No ratings yet
Optical Interconnection Technology in Switches, Routers and Optical Cross Connects
7 pages
Chapter 1b: Circuit Switching Networks
No ratings yet
Chapter 1b: Circuit Switching Networks
53 pages
Palo Alto High Availability
100% (1)
Palo Alto High Availability
53 pages
CSE 4255: Telecommunication Lecture 3: Switching
No ratings yet
CSE 4255: Telecommunication Lecture 3: Switching
28 pages
Digital Switching
No ratings yet
Digital Switching
147 pages
Presentation of Layer 2 Network
No ratings yet
Presentation of Layer 2 Network
49 pages
Network Layer: The Most Complex Layer
No ratings yet
Network Layer: The Most Complex Layer
75 pages
1multiprocessors and Multicomputers: A. Multiprocessor System Interconnects
No ratings yet
1multiprocessors and Multicomputers: A. Multiprocessor System Interconnects
16 pages
Inter Connection Networks and Cluster
No ratings yet
Inter Connection Networks and Cluster
49 pages
Lect Networking Primer
No ratings yet
Lect Networking Primer
51 pages
Lecture Note On Switch Architectures
No ratings yet
Lecture Note On Switch Architectures
63 pages
Switching
No ratings yet
Switching
30 pages
2.2-2 Network Topology
100% (1)
2.2-2 Network Topology
11 pages
Introduction and Layered Network Architecture
No ratings yet
Introduction and Layered Network Architecture
32 pages
1-Introduction To Computer Networks
No ratings yet
1-Introduction To Computer Networks
39 pages
Physical Layer-Switching
No ratings yet
Physical Layer-Switching
42 pages
A380 25 B2X1 PDF
No ratings yet
A380 25 B2X1 PDF
24 pages
WSND QB
No ratings yet
WSND QB
8 pages
Question Bank For DSS
No ratings yet
Question Bank For DSS
7 pages
Meshlium Technical Guide
No ratings yet
Meshlium Technical Guide
214 pages
Lectures 13 & 14 Packet Multiple Access: The Aloha Protocol
No ratings yet
Lectures 13 & 14 Packet Multiple Access: The Aloha Protocol
19 pages
SDVRP Test Instances: Note That Only Four of Six Possible Demand Scenarios Are Included For Eil76 and Eil101
No ratings yet
SDVRP Test Instances: Note That Only Four of Six Possible Demand Scenarios Are Included For Eil76 and Eil101
2 pages
EQ Coefficients PDF
No ratings yet
EQ Coefficients PDF
15 pages
Switch Fabrics 1: "Centralized" Vs Distributed Switches
No ratings yet
Switch Fabrics 1: "Centralized" Vs Distributed Switches
4 pages
IPABX2024 J
No ratings yet
IPABX2024 J
5 pages
Secure Shell
No ratings yet
Secure Shell
17 pages
Full HD TV: ÝDL-32CX520
No ratings yet
Full HD TV: ÝDL-32CX520
6 pages
Aruba-Certified Mobility Professional (ACMP) 6.1 Study Guide
No ratings yet
Aruba-Certified Mobility Professional (ACMP) 6.1 Study Guide
11 pages
Technology Training That Works Technology Training That Works
No ratings yet
Technology Training That Works Technology Training That Works
30 pages
What Is Adsl Technology?: E Commerce
No ratings yet
What Is Adsl Technology?: E Commerce
14 pages
A Library of Local Search Heuristics For The Vehicle Routing Problem
No ratings yet
A Library of Local Search Heuristics For The Vehicle Routing Problem
23 pages
Lectures 8 & 9 M/G/1 Queues: Eytan Modiano MIT
No ratings yet
Lectures 8 & 9 M/G/1 Queues: Eytan Modiano MIT
14 pages
Pexip Infinity Cisco VCS Deployment Guide V30.a
No ratings yet
Pexip Infinity Cisco VCS Deployment Guide V30.a
14 pages
BL0977 - SIAE Plan - 28jun2018
No ratings yet
BL0977 - SIAE Plan - 28jun2018
21 pages
A Novel Approach To Solve The Split Delivery Vehicle Routing Problem
No ratings yet
A Novel Approach To Solve The Split Delivery Vehicle Routing Problem
15 pages
IRON-MAN: An Approach To Perform Temporal Motionless Analysis of Video Using CNN in Mpsoc
No ratings yet
IRON-MAN: An Approach To Perform Temporal Motionless Analysis of Video Using CNN in Mpsoc
15 pages
Especifica
No ratings yet
Especifica
41 pages
Adaptive Graph Regularized Low-Rank Matrix Factorization With Noise and Outliers For Clustering
No ratings yet
Adaptive Graph Regularized Low-Rank Matrix Factorization With Noise and Outliers For Clustering
13 pages
IC-F5120D - F6120D - A4 Japan
No ratings yet
IC-F5120D - F6120D - A4 Japan
2 pages
Neural Network-Based Distributed Finite-Time Tracking Control of Uncertain Multi-Agent Systems With Full State Constraints
No ratings yet
Neural Network-Based Distributed Finite-Time Tracking Control of Uncertain Multi-Agent Systems With Full State Constraints
10 pages
Catalog, Wireless Motor Pump Remote Controller, FORBIX SEMICON Co.
No ratings yet
Catalog, Wireless Motor Pump Remote Controller, FORBIX SEMICON Co.
4 pages
Activity 2
No ratings yet
Activity 2
3 pages
Requirement Specification 3.1 Hardware Requirements: 3.3.1 NS2 Overview
No ratings yet
Requirement Specification 3.1 Hardware Requirements: 3.3.1 NS2 Overview
8 pages
Design of Stabilizing Switching Laws For Mixed Switched Affine Systems
No ratings yet
Design of Stabilizing Switching Laws For Mixed Switched Affine Systems
6 pages
Cisco 8841 IP Phone CP-8841-USER-GUIDE
No ratings yet
Cisco 8841 IP Phone CP-8841-USER-GUIDE
2 pages
Nk-Maxclique and MMCQ: Tow New Exact Branch and Bound Algorithms For The Maximum Clique Problem
No ratings yet
Nk-Maxclique and MMCQ: Tow New Exact Branch and Bound Algorithms For The Maximum Clique Problem
9 pages
Validation of Email Addresses Collected Offline
No ratings yet
Validation of Email Addresses Collected Offline
12 pages
Openran 5G NR: Whitebox - Flexible - Disaggregated - Small Cell Platform
No ratings yet
Openran 5G NR: Whitebox - Flexible - Disaggregated - Small Cell Platform
2 pages
EB200
No ratings yet
EB200
8 pages
Optical Communication
No ratings yet
Optical Communication
10 pages
Cisco Umbrella Package Comparison
No ratings yet
Cisco Umbrella Package Comparison
2 pages
DS-2CD1121-I 2 MP Fixed Dome Network Camera
No ratings yet
DS-2CD1121-I 2 MP Fixed Dome Network Camera
5 pages
Burglary FirstAlert
No ratings yet
Burglary FirstAlert
1 page
Siddaganga Institute of Technology, Tumkur - 572 103: Usn 1 S I CSPE17
No ratings yet
Siddaganga Institute of Technology, Tumkur - 572 103: Usn 1 S I CSPE17
2 pages
Exploring BeagleBone: Tools and Techniques for Building with Embedded Linux
From Everand
Exploring BeagleBone: Tools and Techniques for Building with Embedded Linux
Derek Molloy
4/5 (2)
An Introduction To Digital Design
From Everand
An Introduction To Digital Design
Jason King
2/5 (1)

Lectures 17 & 18 Fast Packet Switching: Eytan Modiano Massachusetts Institute of Technology

Uploaded by

Lectures 17 & 18 Fast Packet Switching: Eytan Modiano Massachusetts Institute of Technology

Uploaded by

Lectures 17 & 18

Fast packet switching

DestinationAddress Output port number

• A packet switch consists of a routing engine (table look-up), a

LC-1 LC-2 LC-3

• Computer with multiple line cards

• Most of the processing is now done in the line cards

• Advantages: CPU and main Memory are no longer the bottleneck

• Disadvantage: Performance limited by bus speeds

• Replace shared bus with a switch fabric

Basic module is a 2x2 switch, which can be either in the through

• Switch buffers: None, at input, or at output of each module

Example: Omega (shuffle exchange network)

• Notice the order of inputs into a stage is a shuffle of the outputs

– Example: output 100 => tag = 001

• Another Example of a multi-stage interconnection network

N/2 basic mods 2 N/2 by N/2 switches

• Hot spot effect

• Assume no buffering at the switches

• Suppose switch has m stages

• Packet transmit time = 1 slot (between stages)

• New packet arrival at the inputs, every slot

• Let P(m) be the probability that a packet is transmitted on a stage

= 1 – P(neither inputs to stage m+1 chooses this output)

Throughput of interconnect network

• Throughput can be significantly improved by adding buffers at the stages

• Alternatives: Buffers that are external to the switch fabric

• As soon as a packet arrives, it is transferred to the appropriate

• If external arrivals to each input are Poisson (average rate A ),

– packet duration equaling one slot X = X 2 = 1

• The average number of packets at each output is given by (M/G/1

Output buffer architecture

• Advantages: No delay or blocking inside switch

• Shared output buffers: output buffers are implemented in shared

• Packets buffered at input rather than output

• During each slot, the scheduler established the crossbar

• HOL blocking limits throughput because some inputs

• Consider an NxN switch and again assume that inputs are

• Uniform traffic => each packet is destined to each output with

• Now, consider only those packets at the head of their queues

• As N approaches infinity, Ami becomes Poisson of rate C/N where C

• In steady-state, Qi behaves as an M/D/1 of rate A and, as before,

• Notice however that the total number of packets addressed to the

We can now solve, using the quadratic equation to obtain:

• The maximum throughput of an input queued switch, is limited by

– Assuming uniform traffic and FCFS service

• Advantages of input queues:

• Disadvantages: Throughput limitation

• How does the scheduler decide which input to transfer to which

• Each entery in the backlog matrix represent the number of

• There is an edge in the graph from an input to an output if there is a

• Definition: A matching is a set of edges, such that no two edges share

• Definition: A maximum matching is a matching with the maximum

• Algorithms for finding maximum matching exist

– For previous example:

• Fact: The number of edges in a maximal matching ≥ 1/2 the

in an input queued switch

• Finding a maximum matching during each time slot does not

• Definition: A weighted bi-partite graph is a bi-partite graph with

• Definition: A maximum weighted matching is a matching with the

• Theorem: A scheduler that chooses during each time slot the

– Proof: see “Achieving 100% throughput in an input queued switch” by

You might also like