Protection
Protection
Protection
Switching
Yaakov (J) Stein
Chief Scientist
RAD Data Communications
April 2009
Course Outline
• General protection switching principles
• SONET/SDH
• Ethernet linear protection
• Ethernet ring protection
• MPLS fast reroute
Definition
References
Traffic types
Network topologies
Triggers
Protection classes
Entities
Protection types
Signaling
APS includes :
detection of failures (signal fail or signal degrade) on a working channel
switching traffic transmission to a protection channel
selecting traffic reception from the protection channel
(optionally) reverting back to the working channel once failure is repaired
dense meshes
– for this topology multiple local bypasses can be preconfigured
– protection switching is similar to routing change, but faster
often called “fast reroute” (FRR)
Failures are Signal Fail (SF) or Signal Degrade (SD) (of various types)
and may be :
detected by physical layer
indicated by signaling (e.g. AIS)
detected by OAM mechanisms
working channel
protection channel
head-end tail-end
working channel
head-end tail-end
protection channel
(bridge) (selector)
signaling channel
unidirectional
unidirectional protection working channel
failure
protection channel in use
working channel
protection channel
bidirectional
bidirectional protection working channel
failure
protection channel in use
working channel
channel A
channel B
When not in use, protection channel can be used for extra traffic
working channel
extra traffic
protection channel
APS signaling
Y(J)S APS Slide 19
1:n protection
One protection channel is allocated for n working channels
Only can protect one working channel at a time
but improbable that more than 1 working channel will simultaneously fail
Only 1/(n+1) of total capacity is reserved for protection
working channels
protection channel
working channels
protection channels
Y(J)S APS Slide 21
n
(1:1)
(1:1) protection
This is like n times 1:1 but the n protection channels share bandwidth
Only 1 failed working channel can be protected
This is different from 1:n since
n protection channels are preconfigured
n working channels need not be of the same type
For example
in 1:n protection the protection channel may already be in use !
True failure conditions usually have higher priority than manual commands
unprotected
trail protected trail
working channel
protection channel
head-end NE tail-end NE
path
line line (MS section) line
section section section section
Transport
Overhead
TOH
TOH consists of
3 rows of section overhead - frame sync, trace, EOC, …
6 rows of line overhead - pointers, SSM, FEBE, and
Line APS signaling - bytes K1 and K2
POH is responsible for type, status, path performance monitoring, VCAT, trace
HO Path APS signaling is 4 MSBs of byte K3
V5
V1
VC OH is responsible for
J2
Timing, PM, REI, …
V2
LO Path APS signaling is N2
4 MSBs of byte K4
V3
K4
V4
VC OH
Y(J)S APS Slide 37
How does it work?
working channel
protection channel
head-end NE tail-end NE
Y(J)S APS Slide 39
Linear 1:1 protection
Head-end bridge usually sends data on working channel
When tail-end detects failure it signals (using K1) to head-end
Head-end then starts sending data over protection channel
May be at any layer (but only OC-n level protects against fiber cuts)
working channel
extra traffic
protection channel
In order to save BW
we allocate 1 protection channel for every N working channels
N limited to 14
4 bits in K1 byte from tail-end to head-end
– 0 protection channel
– 1-14 working channels
– 15 extra traffic channel
working channels
protection channel
A-B B A-B B
B-C
B-A
A A
C-B
B-A C
Two-fiber version
half of OC-N capacity devoted to protection
only half capacity available for traffic
Four-fiber version
full redundant OC-N devoted to protection
twice as many NEs as compared to two-fiber
Example
recovery from unidirectional fiber cut
Y(J)S APS Slide 46
Ethernet linear APS
STP
LAG
G.8031
END=0
(1B) – regular APS messages are sent 1 per 5 seconds
– after change 3 messages are sent at max rate (300 per sec)
where
req/state identifies the message (NR, SF, WTR, SD, forced switch, etc)
prot. type identifies the protection type (1+1, 1:1, uni/bidirectional, etc.)
requested and bridged signal identify incoming / outgoing traffic
since only 1+1 and 1:1 they are either null or traffic (all other values reserved)
Y(J)S APS Slide 51
G.8031 1:1 revertive operation
In the normal (NR) state :
head-end and tail-end exchange CCM (at 300 per second rate)
on both working and protection channels
head-end and tail-end exchange NR APS messages
on the protection channel (every 5 seconds)
When a failure appears in the working channel
tail-end stops receiving 3 CCM messages on working channel
tail-end enters SF state
tail-end sends 3 SF messages at 300 per second on the APS channel
tail-end switches selector (bi-d and bridge) to the protection channel
head-end (receiving SF) switches bridge (bi-d and selector) to protection channel
tail-end continues sending SF messages every 5 seconds
head-end sends NR messages but with bridged=normal
When the failure is cleared
tail-end leaves SF state and enters WTR state (typically 5 minutes, 5..12 min)
tail-end sends WTR message to head-end (in nonrevertive - DNR message)
tail-end sends WTR every 5 seconds
when WTR expires both sides enter NR state
G.8032
RPR
CLEER
developed by 802.17 WG
based on Cisco’s Spatial Reuse Protocol (RFC 2892)
ringlet selection
A
C
B
placed into 1 of 2 buffers
according to service class
sent according to fairness
PTQ
STQ
A0 RT reserved low No
A1 RT allocated, low No
reclaimable
Notes:
class A have minimal delay
class B have higher priority than STQ transit frames, so bounded delay/FDV
classes B and C share STQ, so once in ring have similar delay
steering info
wrap
Y(J)S APS Slide 63
NERT and CLEER
New Ethernet Ring Technology / Closed Loop Encapsulated Ethernet Ring
Similar to RPR but uses real Ethernet format
NERT and CLEER distinguish between
ring nodes
switches connected to ring nodes
switches
Y(J)S APS Slide 64
MPLS fast reroute
IP FRR
RFC 4090
One-to-one backup
each LSP protected separately
detour LSP created for each LSP at each potential PLR
no labels pushed PLR MP
Facility backup
backup tunnel for multiple LSPs
bypass tunnel created at each potential PLR
uses label stacking
PLR MP