0% found this document useful (0 votes)

60 views20 pages

Eitan Zahavi Isaac Keslassy Avinoam Kolodny

Learning Routing System

Uploaded by

Irwan Efendy Laudin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

60 views20 pages

Eitan Zahavi Isaac Keslassy Avinoam Kolodny

Learning Routing System

Uploaded by

Irwan Efendy Laudin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

DISTRIBUTED ADAPTIVE ROUTING

FOR BIG-DATA APPLICATIONS

RUNNING ON DATA CENTER NETWORKS

Eitan Zahavi*+
Isaac Keslassy+
Avinoam Kolodny+

ANCS 2012

Mellanox Technologies LTD,

Department
*

Technion - EE

Big Data Larger Flows

Data-set sizes keep rising

Web2 and Cloud Big-Data applications

Data Center Traffic changes to:

Longer, Higher BW and Fewer Flows

Google

Static Routing of Big-Data = Low BW

Static Routing cannot balance a small number of

flows
Congestion: when BW of link flows > link capacity
When longer and higher-BW flows contend:

On lossy network: packet drop BW drop

On lossless network: congestion spreading BW drop
Data flow

S
R

Traffic Aware Load Balancing Systems

Adaptive Routing adjusts routing to

network load
Centralized

Flows are routed according to

a global knowledge
Central
Routing
Control

Distributed

Each flow is routed by its input

switch with local knowledge

Self
Routing
Unit
S
R

S
R

Central vs. Distributed Adaptive Routing

Property

Central Adaptive
Routing

Distributed Adaptive
Routing

Scalability

Low

High

Knowledg
e

Global

Local (to keep

scalability)

NonYes
BlockingDistributed

Unknown

Adaptive Routing is
either scalable or have global knowledge
It is Reactive

Research Question
6

Can a Scalable Distributed Adaptive

Routing System perform like centralized
system and produce non-blocking
routing assignments in reasonable time?

Trial and Error

Is Fundamental to Distributed AR
7

Randomize output port Trial 1

Send the traffic
Contention 1
Un-route contending flow

Send the traffic

Contention 2
Un-route contending flow

Randomize new output port Trial 2

Randomize new output port Trial 3

Send the traffic

Convergence!

Routing Trials Cause BW Loss

Packet Simulation:
R1 is delivered followed by G1
R2 is stuck behind G1
Re-route
R3 arrives before R2

R1
R2
R3
SR

R1
SR

Out-of-Order Packets delivery!

Implications are significant drop in flow BW

TCP* sees out-of-order as packet-drop and throttle the senders

See Incast papers

* Or any other reliable transport

Research Plan
9

Given
events
Ne
w

1.
2.

Tr
Tr
ial
ial
Tr
1
2
aff
ic

Tr
ial
N

No
C

t
on
te

nti
on

Analyze Distributed Adaptive Routing systems

Find how many routing trials are required to
converge
Find conditions that make the system reach a nonblocking
assignment in a reasonable time

A Simple Policy for Selecting a Flow to ReRoute

At each time step

Each output switch
Request

re-route of a single worst

contending flow
1

1
1

At t=0 New traffic pattern is applied

n
Randomize output-ports
SR
and Send flows
At t=0.5 Request Re-Routes
SR
Repeat for t=t+1 until no contention

r
m

input
switch

output
switch

Evaluation
11

Measure average number of iterations I to

convergence
I is exponential with system size !

A Balls and Bins

Representation

Each output switch is a balls and bins system

Bins are the switch input links, balls are the link
flows
Assume 1 ball (=flow) is allowed on each bin (=link)

A good bin has 1 ball

Bins are either empty, good or bad
Middle Switch
S
R

S
R

empty

bad
good

System Dynamics
13

Two reasons of ball moves

Improvement or Induced-move
Induced

Output switch 1
Middle Switch:

2
2

Improve

Middle Switch:

SW2

Output switch 2

SW1

SW3

Balls are numbered by their input switch number

The Last Step Governs Convergence

Estimated Markov chain models

What is the probability of the required last
Improvement to not cause a bad Induced
move?
Each one of the r output-switches must do
that step
Absorbing 1
Absorbing
B
A
Therefore convergence time is exponential
1
0
D
Output switch 1
C
with r
B
A

Output switch 2

Bad

Good

Bad

Good

0
C

Bad

Output switch r

Good

0
C

Introducing p
15

Assume a symmetrical system: flows have

same BW
What if the Flow_BW < Link_BW?
The network load is Flow_BW/Link_BW
p = how many balls are allowed in one bin
p=2
p=1
p=1
p=2

S
R

p has Great Impact on Convergence

Measure average number of iterations I to

convergence
I shows very strong dependency on p

Implementable Distributed System

Replace congestion detection by flow-count with

QCN
Detected on middle switch output not output
switch input
Replace worst flow selection by congested flow
sampling
Implement as extension to detailed InfiniBand flit
level model

52% Load on 1152 nodes Fat-Tree

No change in number of adaptations

over time !
No convergence

48% Load on 1152 nodes Fat-Tree

Switch Routing Adaptations/ 10usec

t [sec]

Conclusions
20

Study: Distributed Adaptive Routing of Big-Data flows

Focus on: Time to convergence to non-blocking routing

Learning: The cause for the slow convergence

Corollary: Half link BW flows converge in few iterations

Evaluation: 1152 nodes fat-tree simulation reproduce
these results

Distributed Adaptive Routing of Half Link_BW

Flows
is both Non-Blocking and Scalable

C331 - C531 - Maintenance Manual Rev 2
100% (1)
C331 - C531 - Maintenance Manual Rev 2
219 pages
Accenture Disruptability POV PDF
No ratings yet
Accenture Disruptability POV PDF
13 pages
Adaptive Opportunistic Routing For Wireless Ad Hoc Networks
No ratings yet
Adaptive Opportunistic Routing For Wireless Ad Hoc Networks
44 pages
Network Routing DV
No ratings yet
Network Routing DV
53 pages
Network Analysis
0% (1)
Network Analysis
204 pages
Network Analysis Dec 06
No ratings yet
Network Analysis Dec 06
204 pages
Data Communication & Computer Network: Presented by
No ratings yet
Data Communication & Computer Network: Presented by
32 pages
Data and Computer Communications
No ratings yet
Data and Computer Communications
31 pages
Routing in Circuit Switched NT K Network
No ratings yet
Routing in Circuit Switched NT K Network
34 pages
Neely 2010
No ratings yet
Neely 2010
211 pages
A Project Review On
No ratings yet
A Project Review On
46 pages
Routing in Switched Networks: Stallings, Chapter 12
No ratings yet
Routing in Switched Networks: Stallings, Chapter 12
21 pages
Data and Computer Communications
No ratings yet
Data and Computer Communications
32 pages
Network Algorithms
No ratings yet
Network Algorithms
45 pages
Data and Computer Communications: - Routing in Switched Networks
No ratings yet
Data and Computer Communications: - Routing in Switched Networks
32 pages
Data and Computer Communications
No ratings yet
Data and Computer Communications
32 pages
Marasevic Columbia 0054D 13545
No ratings yet
Marasevic Columbia 0054D 13545
232 pages
Shortest-Path Routing: Reading: Sections 4.2 and 4.3.4
No ratings yet
Shortest-Path Routing: Reading: Sections 4.2 and 4.3.4
36 pages
Shortest-Path Routing: Reading: Sections 4.2 and 4.3.4
No ratings yet
Shortest-Path Routing: Reading: Sections 4.2 and 4.3.4
35 pages
Shortest-Path Routing: Reading: Sections 4.2 and 4.3.4
No ratings yet
Shortest-Path Routing: Reading: Sections 4.2 and 4.3.4
35 pages
Chapter 7 R
No ratings yet
Chapter 7 R
48 pages
Network Routing Algorithms
No ratings yet
Network Routing Algorithms
42 pages
Routing Concepts
No ratings yet
Routing Concepts
27 pages
Lecture6 FAS2024
No ratings yet
Lecture6 FAS2024
41 pages
Lecture 8 and 9
No ratings yet
Lecture 8 and 9
49 pages
Routing Algorithms Routing Algorithms - II: Reading Assignment Reading Assignment
No ratings yet
Routing Algorithms Routing Algorithms - II: Reading Assignment Reading Assignment
26 pages
13 Link State
No ratings yet
13 Link State
34 pages
Performance Modeling of Communication Networks With Markov Chains
No ratings yet
Performance Modeling of Communication Networks With Markov Chains
90 pages
Van Bemten2016NetworkCalculus AComprehensive
No ratings yet
Van Bemten2016NetworkCalculus AComprehensive
57 pages
Fairqueue
No ratings yet
Fairqueue
22 pages
Next-Hop Routing 2. Network-Specific Routing: A B C D R2 R4
100% (1)
Next-Hop Routing 2. Network-Specific Routing: A B C D R2 R4
9 pages
Shortest Path Routing: Reading: Sections 4.2 and 4.3.4
No ratings yet
Shortest Path Routing: Reading: Sections 4.2 and 4.3.4
35 pages
Shortest-Path Routing: Reading: Sections 4.2 and 4.3.4
No ratings yet
Shortest-Path Routing: Reading: Sections 4.2 and 4.3.4
35 pages
IOSRJEN (WWW - Iosrjen.org) IOSR Journal of Engineering
No ratings yet
IOSRJEN (WWW - Iosrjen.org) IOSR Journal of Engineering
5 pages
Routing Algorithms-I: © Sudhakar Yalamanchili, Georgia Institute of Technology (Except As Indicated)
No ratings yet
Routing Algorithms-I: © Sudhakar Yalamanchili, Georgia Institute of Technology (Except As Indicated)
52 pages
4 Basic Routing Theory II - Adaptive Routing: Log N) For Oblivious Routing On The
No ratings yet
4 Basic Routing Theory II - Adaptive Routing: Log N) For Oblivious Routing On The
8 pages
Shortest-Path Routing: Reading: Sections 4.2 and 4.3.4
No ratings yet
Shortest-Path Routing: Reading: Sections 4.2 and 4.3.4
35 pages
CC CHP 12
No ratings yet
CC CHP 12
31 pages
Chapter8 QueuingTheory
No ratings yet
Chapter8 QueuingTheory
74 pages
Congestion Control & Routers
No ratings yet
Congestion Control & Routers
29 pages
Chapter 5 Part-2 (Network Layer - Types)
No ratings yet
Chapter 5 Part-2 (Network Layer - Types)
31 pages
BCA Routing
No ratings yet
BCA Routing
61 pages
2130 Slides5
No ratings yet
2130 Slides5
19 pages
Net. Fall 2024 Lec. 11
No ratings yet
Net. Fall 2024 Lec. 11
43 pages
Routing Algorithms
No ratings yet
Routing Algorithms
23 pages
Chapter III
No ratings yet
Chapter III
25 pages
Week11.2 Final
No ratings yet
Week11.2 Final
47 pages
Design Routing Protocols For Mobile Ad Hoc Networks
No ratings yet
Design Routing Protocols For Mobile Ad Hoc Networks
6 pages
RIPE Meeting 2019 - Internet Clouds Are Also Unpredictable
No ratings yet
RIPE Meeting 2019 - Internet Clouds Are Also Unpredictable
36 pages
Unit-3 PART-B Routing Algorithms
100% (1)
Unit-3 PART-B Routing Algorithms
24 pages
Unicast Routing - Mukesh
No ratings yet
Unicast Routing - Mukesh
29 pages
Routing (Cont.)
No ratings yet
Routing (Cont.)
32 pages
1998 Regularizability of Complex Switched Server Queueing Networks
No ratings yet
1998 Regularizability of Complex Switched Server Queueing Networks
9 pages
Unit 4 Network Layer
No ratings yet
Unit 4 Network Layer
43 pages
Session 18,19,20 NetworkAlgorithms
No ratings yet
Session 18,19,20 NetworkAlgorithms
45 pages
Routing Algorithms
No ratings yet
Routing Algorithms
12 pages
Routing Tables: Gennia Soor Ipsita Dutta Jayoti Das Gourav Lahiri Ishan Nagwani Gagandeep Singh
No ratings yet
Routing Tables: Gennia Soor Ipsita Dutta Jayoti Das Gourav Lahiri Ishan Nagwani Gagandeep Singh
14 pages
Revision Before Studying Routing in Manet
No ratings yet
Revision Before Studying Routing in Manet
20 pages
Mkt4218: New Product and Innovation
No ratings yet
Mkt4218: New Product and Innovation
36 pages
Math Lesson Plan The Vitruvian Man
No ratings yet
Math Lesson Plan The Vitruvian Man
9 pages
Introduction To Computer System
100% (1)
Introduction To Computer System
66 pages
Hints of Assignment5 - Fall 2024
No ratings yet
Hints of Assignment5 - Fall 2024
11 pages
Micro Focus and 'Dialects'
No ratings yet
Micro Focus and 'Dialects'
2 pages
Paytm Annual Report 2024
No ratings yet
Paytm Annual Report 2024
416 pages
RizzCraft Color Guide
100% (1)
RizzCraft Color Guide
17 pages
ECE 2006 Semester II
No ratings yet
ECE 2006 Semester II
4 pages
Grade 6 ICT Revised Text Book
No ratings yet
Grade 6 ICT Revised Text Book
65 pages
Alerting FAQ MFMon
No ratings yet
Alerting FAQ MFMon
18 pages
Waveform Analysis Software User Manual V1.1
No ratings yet
Waveform Analysis Software User Manual V1.1
25 pages
18 Pytest
No ratings yet
18 Pytest
9 pages
Unit 5 - Week 3: Assignment 3
No ratings yet
Unit 5 - Week 3: Assignment 3
5 pages
Implementation of Nepal National Building Code Through Automated Building Permit System
No ratings yet
Implementation of Nepal National Building Code Through Automated Building Permit System
8 pages
Discrete Mathematics Question Paper
0% (2)
Discrete Mathematics Question Paper
4 pages
Large Rhombicosidodecahedron PDF
No ratings yet
Large Rhombicosidodecahedron PDF
11 pages
How To Do ESD Protection During SMT Assembly Process
No ratings yet
How To Do ESD Protection During SMT Assembly Process
18 pages
Mobile Application SRS
100% (1)
Mobile Application SRS
9 pages
Sop - Vor
No ratings yet
Sop - Vor
3 pages
Change The VxRail Manager IP Address
No ratings yet
Change The VxRail Manager IP Address
2 pages
ARC Family Disaster Plan Template r083012 0
No ratings yet
ARC Family Disaster Plan Template r083012 0
3 pages
Poland
No ratings yet
Poland
2 pages
Application of Queueing Theory in Healthcare A Literature Review
No ratings yet
Application of Queueing Theory in Healthcare A Literature Review
5 pages
Strivers A2Z DSA Completion Plan
No ratings yet
Strivers A2Z DSA Completion Plan
2 pages
Mathlinks 9 Review Bundles CH 2
No ratings yet
Mathlinks 9 Review Bundles CH 2
4 pages
Oscalc Manual
No ratings yet
Oscalc Manual
90 pages
Common Cathode Fast Recovery Epitaxial Diode (FRED) : Dsek 60 I 2x 30 A V 600 V T 35 Ns
No ratings yet
Common Cathode Fast Recovery Epitaxial Diode (FRED) : Dsek 60 I 2x 30 A V 600 V T 35 Ns
2 pages
L1000A TM EN TOEP C710616 134G 6 0 Addendum
No ratings yet
L1000A TM EN TOEP C710616 134G 6 0 Addendum
106 pages

Eitan Zahavi Isaac Keslassy Avinoam Kolodny

Uploaded by

Eitan Zahavi Isaac Keslassy Avinoam Kolodny

Uploaded by

DISTRIBUTED ADAPTIVE ROUTING

FOR BIG-DATA APPLICATIONS

Mellanox Technologies LTD,

Big Data Larger Flows

Data-set sizes keep rising

Web2 and Cloud Big-Data applications

Data Center Traffic changes to:

Longer, Higher BW and Fewer Flows

Static Routing of Big-Data = Low BW

Static Routing cannot balance a small number of

On lossy network: packet drop BW drop

Traffic Aware Load Balancing Systems

Adaptive Routing adjusts routing to

Flows are routed according to

Each flow is routed by its input

Central vs. Distributed Adaptive Routing

Local (to keep

Can a Scalable Distributed Adaptive

Trial and Error

Randomize output port Trial 1

Send the traffic

Randomize new output port Trial 2

Randomize new output port Trial 3

Send the traffic

Routing Trials Cause BW Loss

Out-of-Order Packets delivery!

Implications are significant drop in flow BW

TCP* sees out-of-order as packet-drop and throttle the senders

* Or any other reliable transport

Analyze Distributed Adaptive Routing systems

A Simple Policy for Selecting a Flow to ReRoute

At each time step

re-route of a single worst

At t=0 New traffic pattern is applied

Measure average number of iterations I to

A Balls and Bins

Each output switch is a balls and bins system

A good bin has 1 ball

Two reasons of ball moves

Balls are numbered by their input switch number

The Last Step Governs Convergence

Estimated Markov chain models

Assume a symmetrical system: flows have

p has Great Impact on Convergence

Measure average number of iterations I to

Implementable Distributed System

Replace congestion detection by flow-count with

52% Load on 1152 nodes Fat-Tree

No change in number of adaptations

48% Load on 1152 nodes Fat-Tree

Switch Routing Adaptations/ 10usec

Study: Distributed Adaptive Routing of Big-Data flows

Focus on: Time to convergence to non-blocking routing

Learning: The cause for the slow convergence

Corollary: Half link BW flows converge in few iterations

Distributed Adaptive Routing of Half Link_BW

You might also like