ZTE UMTS RAN Equipment Redundancy Feature Guide
ZTE UMTS RAN Equipment Redundancy Feature Guide
ZTE UMTS RAN Equipment Redundancy Feature Guide
Feature Guide
Date
2010-10-15
Author
LouDeGang
Approved By
YuFeng
Remarks
Not open to the Third Party
TABLE OF CONTENTS
1
2
2.1
Overview .............................................................................................................. 1
RAN Equipment Redundancy............................................................................... 1
3
3.1
3.1.1
3.1.2
3.1.3
3.1.4
3.1.5
3.2
3.2.1
3.2.2
3.2.3
3.2.4
3.3
3.3.1
3.3.2
3.3.3
3.3.4
Technical Description......................................................................................... 1
Board Redundancy Protection of RAN ................................................................. 2
Brief Introduction to Board Redundancy Protection ............................................. 2
General Process of Board Switching.................................................................... 3
Board Fault Detection ........................................................................................... 7
Redundancy Protection Solution for RNC Boards ............................................... 8
Redundancy Protection Solution for Node B Boards ......................................... 28
Port Redundancy Protection............................................................................... 31
Backup of Ethernet Port...................................................................................... 31
Backup of E1/T1 Port .......................................................................................... 33
Backup of SDH Port ............................................................................................ 33
Backup of Level-1 Switch Optical Port ............................................................... 33
Redundancy Protection of Communication Link ................................................ 36
ATM Link Backup ................................................................................................ 36
IP Link Backup .................................................................................................... 36
SCTP Link Backup .............................................................................................. 36
Data Division Transmission Backup ................................................................... 36
4
4.1
4.1.1
4.1.2
4.1.3
4.2
4.2.1
4.2.2
4.2.3
5
5.1
5.2
Glossary ............................................................................................................. 42
II
FIGURES
Figure 3-1 Principle of Hybrid 1+1 Backup.............................................................................. 3
Figure 3-2 Active/Standby Switching Process......................................................................... 6
Figure 3-3 Active/Standby Interlock Circuit ............................................................................. 7
Figure 3-4 Connection of Major Boards in the RAN ................................................................ 9
Figure 3-5 Before Switching .................................................................................................. 10
Figure 3-6 After Switching of Optical Port ............................................................................. 11
Figure 3-7 After Switching Because of the Fault of Network Port of Internal Media Plane .. 11
Figure 3-8 After Switching Because the Active Board is Offline ........................................... 12
Figure 3-9 DTA Before Switching .......................................................................................... 12
Figure 3-10 DTA After Switching ........................................................................................... 13
Figure 3-11 DTA Hybrid 1+1 Backup Signal Flow................................................................. 13
Figure 3-12 DTI Before Switching ......................................................................................... 15
Figure 3-13 DTI After Switching............................................................................................. 16
Figure 3-14 DTA Hybrid 1+1 Backup Signal Flow................................................................. 16
Figure 3-15 Before GIPI4 Switching ...................................................................................... 17
Figure 3-16 Before EIPI Switching ........................................................................................ 18
Figure 3-17 SDTA2(1:1backup)before switching .................................................................. 23
Figure 3-18 SDTA2(1:1backup)After Switching of Optical Port Abnormal ........................... 23
Figure 3-19 SDTA2(1:1backup)After switchover caused by internal media plane Ethernet
port fault .................................................................................................................................... 23
Figure 3-20 SDTA2(1+1 Backup) Before switching of Internal Media Plane Ethernet Port
Failure ....................................................................................................................................... 24
Figure 3-21 SDTA2(1+1backup)After switching of Internal Media Plane Ethernet Port
Failure ....................................................................................................................................... 24
Figure 3-22 SDTA2 After switching of active board off-line .................................................. 24
Figure 3-23 SDTI before switching of optical port ................................................................. 25
Figure 3-24 SDTI After switching of optical port.................................................................... 26
Figure 3-25 SDTI(Hybrid 1+1backup)Before switching of internal media plane Ethernet Port
failure ........................................................................................................................................ 26
Figure 3-26 SDTI(Hybrid 1+1backup)After switching of Internal Media Plane Ethernet Port
Failure ....................................................................................................................................... 26
Figure 3-27 After switching of active board is offline............................................................. 27
Figure 3-28 Typical Configuration of B8200 .......................................................................... 28
III
TABLES
Table 3-1 Backup Modes of RNC Boards ............................................................................... 8
Table 3-2 describes the backup mode of each board in the B8200. .................................... 28
Table 3-3 Backup Modes of B8200 Boards........................................................................... 28
Table 3-4 FE Routing Ports and Trunk Ports Supported by Various Boards ....................... 31
Table 3-5 GE Ports Supported by the GUIM/GLI .................................................................. 34
IV
Functional Attribute
System version: [RNC V3.09, Node B V4.09, OMMR V3.09, OMMB V4. 09]
Attribute: [Optional]
Involved NEs:
UE
-
Node B
RNC
MSCS
-
MGW
-
SGSN
-
GGSN
-
HLR
-
Note:
*-: The NE is not involved.
*: The NE is involved.
Dependency: [None]
Mutual-exclusion function: [None]
Remarks: [None].
Overview
2.1
Technical Description
There are three types of redundancy protections in the RA N, that is, board redundancy
protection, port redundancy prot ection, and communication link redundancy protection.
3.1
3.1.1
The comparison bet ween the 1+1 mode and 1:1 mode is as follows:
Common feature
The two mutual-backup boards share one logical address (compared wit h physical
address, a logical address indicates a logical node for service. When function is
implemented, the active/standby board configuration is not concerned) and one
suite of bearer resources (for example, the circuit resource an d internal media
stream resource)
Difference
1+1 backup: Only one board is running in active status at a time.
1:1 backup: Res ourc es and interfaces are divided into groups, and protection
switching is implemented in each group. The two mutual-backup boards work
cooperatively. The groups in a board can be in different working states. However,
make sure that if one group in a board is active, it must be standby in the other
board For example, in one board, Group A is active and Group B is standby. Then,
in the other board, Group A is standby and Group B is active. As a result, the two
boards may both have active resources at the same time and can process the
access service. For the SDTA 2, SDTI and APBE boards, the corresponding optical
ports in the active board and standby board compos e an APS group. The
active/standby state of a board is independent from the active/standby state of the
external optical port of the board.
Figure 3-1
Standby Board
Data stream
A
Cross
board
Channe
l
Access
Data stream B
Active Board
3.1.2
takes over its work, thus preventing the service functions from being affected for a long
time.
In addition, the system also provides some means to adjust the active board manually.
To verify whet her the current standby board can work normally, manual switching can be
initiated to switch the standby board to the active state.
The RAN system provides the basic common switching reasons for the boards in 1+1
backup mode and in 1:1 backup mode. For different boards, the RAN provides different
switching reasons for different devices and functions on the boards. The following
section describes the general switching reasons and switching process.
3.1.2.1
Man-machine Command
The system allows user to initiate a switching to the specified board from the OMC
by command.
Key-press switching
Each board supports the switching that is initiated through manual intervention from
a panel key-press. Press E XCH key in the panel of the active board, and then user
can initiate the active/standby switching.
Removal of board
Each board is configured with an unplugging switch. Before a b oard is removed,
the system initiates the active/standby switching through an interrupt notice. The
standby board takes over the work first before the active board is out of position.
Meanwhile, the system reports the switch-on alarm and lights up. During the
switching, the E NUM indicator flashes quickly at a frequency of 5 Hz. Upon
completion of the switching, the ENUM indicator flashes at a frequency of 1 Hz.
Before the ENUM indicat or flashes at a frequency of 1 Hz, do not remove the board.
Otherwise, the system triggers the abnormal switching process due to the reset of
the active board, as described in 3).
Besides the common reasons described above, other reasons, such as operation faults
of board, may trigger the active/standby switching.
3.1.2.2
3.1.2.3
Switching Flow
When both the active and standby boards work normally, every functional module in the
active board synchronizes the key information to the standby board at real time.
A normal switching process comprises three phases including preliminary switching,
standby-to-active (SA) switching, and active-to-standby AS) switching, thus switching the
board process and communication link, as shown in Figure 3-2.
Figure 3-2
Active board
Starting switch
Standby board
Pre
Sync data
Begin
communication
link switch
SA
AS
End
Hardware
communication
active/standby
link switch
switch
back
After receiving a switching request, a board first makes necessary judgment, for
example, whether the current board is an active board, and whether the standby board
is in position. If the s witching conditions are met, the active board sends a preliminary
switching command. The standby board compares the level of the switching barring
reason with the level of the switching reason. If the level of the switching barring reason
is higher, the system bars the active/standby switching. Ot herwise, the system
implements the preliminary switching operation, that is, instruct the active board and
standby board of the registered process to perform necessary active/standby data
synchronization. In this phas e, the active board is always in active state and can
implement normal service functions. After the preliminary switching is complete, the
system implements the active/standby switching bet ween the active process and
standby process. The active process and standby proc ess are in different boards, so the
ongoing services are affected transiently. To minimize the impact, the system
implements link switching upon completion of the preliminary switching, that is, instruct
another service board to cache messages and send the cached messages to the new
active board upon completion of the active/standby switching.
In the SA(Slave to Active) phase and AS(Active to Slave) phase, the system instructs
the registered processes to implement the active/standby switching. The detailed
switching is implemented in each process respectively .
After the SA switching and AS switching are complete, the system switches to the
original link and sends the messages cached by another servic e board to the new active
board.
All normal active/standby switching processes follows the above flow. For the a bnormal
switching due t o reset of the active board, the system cannot follow the above switching
flow becaus e the active board is not in position. After receiving the interrupt message
because the active board is reset or out of position, the original stan dby board
immediat ely implements the SA switching and link switching and then takes over the
services of the original active board.
3.1.3
3.1.3.1
Figure 3-3
A A
Board
BB
Board
MASCT
MASCT
O_MS-VIE O_MS-VIE
I_MS-VIE
I_MS-VIE
MASCT (reverses the phase of output signals or MAS CT inside the logic) is the
active/standby competition signal controlled by software, and is used for the
active/standby competition bet ween boards. I_MS -VIE and O_MS -VIE are the interlock
signals between boards. If the actual level of MASCT is a high level, it indic ates that the
board is competing for the active state. If the actual level of MAS CT is a low level, it
indicates that the board does not compete for the active state. If the actual level of
O_MS-V IE (I_MS-V IE signal of the peer board) is a low level, it indicates that the board
is in active state. Otherwise, it indicates that the board is in standby state.
If the active board detects that the standby board is in position, the active board
synchronizes its data to the standby board through the primary channel. If the standby
board detects that the active board is offline, the standby board is switched to an active
board.
3.1.3.2
3.1.3.3
3.1.4
Table 3-1
Functional
Board
Backup Mode
Remarks
APBE
1:1 backup
When the active board is
switched to a standby
board, the original active
board is not reset.
DTA
None
Functional
Board
Backup Mode
Remarks
backup
DTI
None
SDTA2
None
SDTI
None
GIP I4
None
GLI
Load sharing
None
GUIM
1+1 backup
None
ICM
1+1 backup
None
THUB
1+1 backup
None
PSN
Load sharing
None
PWRD
No backup
None
RCB
1+1 backup
None
ROMB
1+1 backup
None
RUB
Load sharing
None
SBCX
1+1 backup
None
UIMC
1+1 backup
None
Figure 3-4 shows the connections of the main boards in the table.
Figure 3-4
PSN
UIMC
GPS
Antenna
UIMC
ROMB
RCB
GLI
GLI
CHUB
RCB
Resource Shelf BGSN
GUIM
GUIM
Control Plane
User Plane
APBE
STM-1
GIPI
IMAB
DTB
E1
GE
Iu/Iur/Iub Port
CircuitHW
EIPI
RUB
GIPI
FE
OMC-B
FE
OMC-R
Interface Unit
Switch Unit
Processing Unit
3.1.4.1
Figure 3-5 shows the switching triggered by the fault of the active optical port. The
colored lines indicate the connection that data transits currently. The colored blocks
indicate the processing units that data transits currently. Before switching:
Figure 3-5
Before S witching
Active Board
Control
Plane
Processing
Unit
Workig Optical
Port
Media
Plane
Processing
Unit
Media
Plane
Processing
Unit
Protection Optical
Port
Through the APS mechanism for optical ports, the system initiates the switching after
detecting that the frame of the active optical port is lost. Meanwhile, the media plane
processing units of the faulty APBE are switched to the APBE where the protected
optical port is located. After switching:
10
Figure 3-6
Control
Plane
Processing
Unit
Working
Optical Port
Media
Plane
Processing
Unit
Standby Board
Control
Plane
Processing
Unit
Media
Plane
Processing
Unit
Protection Optical
Port
After the active board detects that the communication to the internal media plane is
abnormal (for details, see Section 3.1.3.2Communication Check for the Internal Media
Plane), the active board is switched over. After switching:
Figure 3-7
Plane
After S witching Because of the Fault of Network Port of Int ernal Media
Active Board
Control
Plane
Processing
Unit
Workin Optical
Port
Media
Plane
Processing
Unit
Standby Board
Control
Plane
Processing
Unit
Media
Plane
Processing
Unit
Protection Optical
Port
When the active board is reset or is removed, the standby board detects that the active
board is offline through the interlock circuit (for details, see Section 3.1.3.1Part ner
Board In-Position Det ection). Then, the standby board is switched to the active board,
and completely functions as the active board. After switching:
11
Figure 3-8
Active Board
Control
Plane
Processing
Unit
Working
Optical Port
3.1.4.2
Media
Plane
Processing
Unit
Standby Board
Control
Plane
Processing
Unit
Media
Plane
Processing
Unit
Protection Optical
Port
Figure 3-9
Active Board
Control
Plane
Processi
ng Unit
External
DT
access
Media
Plane
Processi
ng Unit
Internal Control
Plane Ethernet
Port
Internal Control
Plane Ethernet
Port
Internal Media
Plane Ethernet
Port
Internal Media
Plane Ethernet
Port
Cross Board
HW
DT
Access
Standby Board
Control
Plane
Processin
g Unit
Media
Plane
Processin
g Unit
DT
Access
12
Extern
al DT
access
Figure 3-10
Active Board
Control
Plane
Processi
ng Unit
External
DT
access
Media
Plane
Processi
ng Unit
Internal Control
Plane Ethernet
Port
Internal Control
Plane Ethernet
Port
Internal Media
Plane Ethernet
Port
Internal Media
Plane Ethernet
Port
Standby Board
Cross Board
HW
DT
Access
Control
Plane
Processin
g Unit
Media
Plane
Processin
g Unit
DT
Access
Extern
al DT
access
When active board detecting abnormal communication of internal media plane, its will
initiate switching process. Detection mechanism refer to 3.1. 3.2.
When det ecting active board offline, standby board will switch over to active board and
take over all of active boards function.
Principle of DTA hybrid 1+1 backup is shown below:
E1/T1 signal input through DTA interfac e is processed into internal HW and converged
into active board through TDM switching chip. TDM signal is send to IMA processing
chip which supports IMA group. EMA link messages from multiple E 1 belong to one
logical path, IMA chip will make load sharing on active IMA links. At the back plane,
message of IMA group link connects to NP through SPI.
Figure 3-11
32 E1/T1
access
8*8MHW
Access processing
m*n MHW
TDM Swith
Link
processing
Load sharing
processing
SPI
Message
processing
To
Internal
Switch
Network
232MHW
32 E1/T1
access
8*8MHW
Access processing
m*n MHW
TDM Switch
Link
processing
Load sharing
processing
SPI
Message
Processing
To
Internal
Switch
Network
Totally there are 64 E 1/T1 in the unit, the first 32 E1/ T1 connect to the left board in order,
the second 32 E1/ T1 connect to the right board in order. When configuring link
group(IMA link group, MP), equably distribute the PPP/IMA links to the right and left
board in order to get the best protection.
13
Physically the 2 *32 M HW is multiplied on one pair of LVDS (Low Voltage Differentiate
Signal). This LVDS only exists in BGSN frame, so this backup scheme can only be
applied to BGSN.
The connectivity logic of left and right board is same, but the connectivity result is
different due to different mapping relationships. For example: Suppose link group C is
composed of link group A and B, A is configured at No.1 E1 (first port of left board), B is
configured at No. 33 E1 (first port of right board). When left board is active, it connects
No.1 E1 to A, and the first E1 on right board will connect to B through HW. When right
board is active, it connects its No.1 E1 to B, and left board will connect its No.1 E1 to A
through HW.
3.1.4.3
14
3.1.4.4
Figure 3-12
Active Board
Control
Plane
Processi
ng Unit
External
DT
access
Media
Plane
Processi
ng Unit
DT
Access
Internal Control
Plane Ethernet
Port
Internal Control
Plane Ethernet
Port
Internal Media
Plane Ethernet
Port
Internal Media
Plane Ethernet
Port
Cross Board
HW
Standby Board
Control
Plane
Processin
g Unit
Media
Plane
Processin
g Unit
DT
Access
Extern
al DT
access
15
Figure 3-13
Active Board
Control
Plane
Processi
ng Unit
External
DT
access
Media
Plane
Processi
ng Unit
Internal Control
Plane Ethernet
Port
Internal Control
Plane Ethernet
Port
Internal Media
Plane Ethernet
Port
Internal Media
Plane Ethernet
Port
Standby Board
Control
Plane
Processin
g Unit
Media
Plane
Processin
g Unit
Cross Board
HW
DT
Access
Extern
al DT
access
DT
Access
When active board detecting abnormal communication of internal media plane, its will
initiate switching process. Detection mechanism refer to 3.1. 3.2.
When det ecting active board offline, standby board will switch over to active board and
take over all of active boards function.
Principle of DTI hybrid 1+ 1 backup is shown below:
E1/T1 signal input through DTI interfac e is processed into internal HW and c onverged
into active board through TDM s witching chip. TDM signal is send to NP (Network
Processing) after framing processing of HDLC. According to configuration, one MP (Multi
PPP) link may be formed by multiple PPP links. Under hybrid 1+1 backup, those P PP
will be distributed at active and standby board. NP only needs to process the MP links
and doesnt need to care about the PPP itself which form the MP and where is the
bottom signal of the PPP.
Figure 3-14
32 E1/T1
access
8*8MHW
Access processing
m*n MHW
TDM Swith
Link
processing
Load sharing
processing
SPI
Message
processing
To
Internal
Switch
Network
232MHW
32 E1/T1
access
8*8MHW
Access processing
m*n MHW
TDM Switch
Link
processing
Load sharing
processing
SPI
Message
Processing
To
Internal
Switch
Network
Totally there are 64 E 1/T1 in the unit, the first 32 E1/ T1 connect to the left board in order,
the second 32 E1/ T1 connect to the right board in order. When configuring link
16
group(IMA link group, MP), equably distribute the PPP/IMA links to the right and left
board in order to get the best protection.
Physically the 2 *32 M HW is multiplied on one pair of LVDS (Low Voltage Differentiate
Signal). This LVDS only exists in BGSN frame, so this backup scheme can only be
applied to BGSN.
The connectivity logic of left and right board is same, but the connectivity result is
different due to different mapping relationships. For example: Suppose link group C is
composed of link group A and B, A is configured at No.1 E1 (first port of left board), B is
configured at No. 33 E1 (first port of right board). When left board is active, it connects
No.1 E1 to A, and the first E1 on right board will connect to B through HW. When right
board is active, it connects its No.1 E1 to B, and left board will connect its No.1 E1 to A
through HW.
3.1.4.5
Within 2 seconds, the external Ethernet port receives more than 128 bad frames.
Figure 3-15 shows the GIP I4 switching process. The colored lines indicate the
connection that data transits currently. The colored blocks indicate the processing units
that data transits currently. Before switching:
Figure 3-15
Active Board
Control
Plane
Processing
Unit
Outer Ethernet
Port
Media
Plane
Processing
Unit
Media
Plane
Processing
Unit
Outer Ethernet
Port
Figure 3-16 shows the result after the switching triggered by any reason:
17
Figure 3-16
Control
Plane
Processing
Unit
Outer Ethernet
Port
Media
Plane
Processing
Unit
Media
Plane
Processing
Unit
Outer Ethernet
Port
The external Ethernet faults are judged according t o standard Ethernet circuit signals. In
case frames are lost, the system determines that the Ethernet is faulty and thus triggers
the switching process.
Bad frame detection is implemented according to the check result of the Ethernet frames
received by the bottom-layer chip. If more than 128 bad Ethernet frames are received
within 2 seconds, the system determines that the Ethernet works abnormally and thus
triggers the switching process.
After the active board detects that the communication of the internal media plane is
abnormal, the system initiates the s witching. For details on t he check mechanism, see
Section 3.1.3.2Communication Check for the Internal Media Plane .
When the active board is reset or is removed, the standby board detects that the active
board is offline through the interlock circuit (for details, see Section 3.1.3.1Part ner
Board In-Position Det ection"). Then, the standby board is switched to the active board,
and completely functions as the active board.
3.1.4.6
18
In addition, the GLI is interconnected to the PSN through the HSSL. The switching chip
in the FLI can identify the status of the HSSL. If the HSSL is interrupted, no packets are
sent from the HSSL to the PS N for switching.
3.1.4.7
3.1.4.8
Redundancy protection for the control plane link, including board protection and port
protection. For details, see Section 3.1.2General Process of Board Switching and
Section 3.2.1B ackup of Ethernet Port.
Media plane link protection: The media plane of the GUIM forwards the
active/standby state of the GE ports together with the active/standby state of boards.
The system checks a link by receiving and transmitting the link check packet. If
detecting that the link of the optical port in the active board is faulty, the system
switches over the optical module or the board. For details, see Section
3.2.4Backup of Level-1 Switch Optical Port.
For the net work interface of the internal control plane, the system periodically
queries the receiving/transmitting statistics of the network interface of the control
plane. If detecting that the network interface of the control plane is abnormal, the
system switches over the board and thus the original standby board takes over the
work.
For the network interface of the internal media plane, the board in the shelf wit h the
media plane communication capability checks whether the link is normal through
packet interaction. If a faulty board cannot heal itself, the board wit h the media
plane communication capability sends a message to instruct the GUIM to switch
over the faulty board, thus removing the fault of the net work interface in the active
GUIM.
The GUIM periodically checks the status of the HW communication links used in the
shelf. If all HW links are faulty, the system switches over the GUIM.
19
for the recovery time. Therefore, the ICM is automatically switched over in hardware,
and software switching follows hardware switching. After software receives the interrupt
caused by hardware s witching, the switching flow is the same as described in Section
3.1.2General Process of Board Switching. The ICM mainly provides the following
protection mechanisms:
The ICM checks the GPS module periodically. In cas e the active ICM is abnormal, the
system initiates the switching process. In case the standby ICM is abnormal, the system
sets switching barring.
The hardware in the ICM c hecks the status of the active and standby boards and
compares hardware signals. If the active board is abnormal but the standby board is
normal, the hardware automatically initiates hardware switching. After the hardware
switching is complete, the interrupt informs the software that the hardware switching is
complete. A fter receiving the interrupt, the software implements software switching, thus
keeping the active/standby consistency between the soft ware and hardware.
The ICM supports the input of multiple clock benchmarks, thus providing redundancy
protection bet ween clock benchmarks. The ICM checks the validity of each access clock
benchmark. If det ecting that the clock benchmark is invalid, the ICM gives an alarm and
switches over the clock benchmark as configured.
3.1.4.9
3.1.4.10
20
the two RCB boards compete for the working state. Hardware interlocking is configured
between t he active RCB and standby RCB (for details, see Section 3.1.3.1Part ner
Board In-P osition Detection"), thus avoiding the dual -active or dual -standby inclination.
After the active/standby competition is complete, the active and standby RCB boards are
up in different capacities respectively. The active RCB needs to read t he configuration
data and receives the requests of other boards. The standby RCB actively interacts with
the active RCB, so as to ensure the consistency of software versions and static data. If
the data of the active RCB is changed during the normal operation, the system
synchronizes the changing data to the standby RCB through regular synchronization or
real-time synchronization. During the normal operation, the standby RCB receives the
synchronized data of the active RCB and keeps the consistency of static data between
the active RCB and active RCB, thus taking over the work of t he active RCB smoot hly in
real time.
In case the active RCB fails (for details on the reason, see S ection 3.1.2.1Reason for
Board Switching), the standby RCB is automatically switched int o an active RCB, its
logical address becomes the logical address of the active RCB, and the bottom -layer
communication link is also switched, thus ensuring normal communication bet ween the
new active RCB and other boards. The original active RCB is automatically reset. After
the rest, the original active RCB becomes a standby RCB.
3.1.4.11
3.1.4.12
21
board is available, the RUB board is selected for service set up. Ot herwise, a RUB board
is selected from another alt ernative shelf.
After the RCP selects a RUB board, a DSP will be selected in the RUB according to a
certain strategy. After the DSP is selected, the RUB board notifies the DSP number to
the RCP.
When a DSP in a RUB board is faulty, the status of the DSP is Unavailable. After
receiving the Unavailable notice, the RCP releases the DSP-related service, thus
avoiding resource deadlock. When a subsequent service is set up, the faulty DSP in the
RUB board is not selected for service setup until the DSP becomes normal again. If all
DSPs in the RUB board are faulty or the CP U of the RUB board is faulty, the RCP does
not select the RUB board for setting up a new service.
3.1.4.13
22
Figure 3-17
Active board
Control
Plane
Processing
unit
Working Port
Media
Plane
Processing
unit
Standby Board
Internal Control
Plane Ethernet Port
Internal Control
Plane Ethernet Port
Control
Plane
Processing
unit
Media
Plane
Processing
unit
Protection Port
For 1: 1 board backup mode, when det ecting active optical port abnormal, active board
switching over will be triggered. Status after switching over is shown below:
Figure 3-18
Active board
Control
Plane
Processin
g unit
Working Port
Media
Plane
Processin
g unit
Internal Control
Plane Ethernet
Port
Internal Control
Plane Ethernet
Port
Internal Media
Plane Ethernet
Port
Internal Media
Plane Ethernet
Port
Standby Board
Main Data
Path
Control
Plane
Processin
g unit
Media
Plane
Processin
g unit
Protection
Port
In this case, when S DTA det ecting abnormal communication to internal media plane, it
will trigger switchover. Diagram aft er switchover is shown below:
Figure 3-19 SDTA 2(1: 1backup)A fter switchover caused by internal media plane
Ethernet port fault
Active Board
Control
Plane
Processing
unit
Working port
Media
Plane
Processing
Unit
Internal control
plane Ethernet port
Internal control
Plane Ethernet port
Standby Board
Control
Plane
Processing
unit
Media
Plane
Processing
unit
Protection Port
23
Figure 3-20 SDTA 2(1+ 1 Backup) Before switching of Internal Media Plane Ethernet
Port Failure
Active board
Standby Board
Control
Plane
Processing
unit
Working Port
Media
Plane
Processing
unit
Internal Control
Plane Ethernet Port
Internal Control
Plane Ethernet Port
Control
Plane
Processing
unit
Media
Plane
Processing
unit
Working Port
For 1+ 1 hybrid backup, when active board det ecting internal media plane
communication abnormal (refer to 3.1.3. 2), switching will be triggered. After switching
status is shown below:
Figure 3-21
Failure
SDTA 2(1+ 1backup)A fter switching of Int ernal Media Plane Ethernet Port
Active board
Standby Board
Control
Plane
Processing
unit
Working Port
Media
Plane
Processing
unit
Internal Control
Plane Ethernet Port
Internal Control
Plane Ethernet Port
Control
Plane
Processing
unit
Media
Plane
Processing
unit
Working Port
When active board resets or it is removed, standby board will detect active board offline(refer to 3.1.3.1) through inter-lock circuit and switch to active board to take over all
active boards function.
Figure 3-22
Working port
Internal control
plane Ethernet port
Internal control
Plane Ethernet port
Standby Board
Media
Plane
Processing
Unit
Control
Plane
Processing
unit
Media
Plane
Processing
unit
Working Port
24
3.1.4.14
Figure 3-23
Active board
Control
Plane
Processing
unit
Working Port
Media
Plane
Processing
unit
Standby Board
Internal Control
Plane Ethernet Port
Internal Control
Plane Ethernet Port
Control
Plane
Processing
unit
Media
Plane
Processing
unit
Protection Port
When detecting frame loss on optical port through APS protection mechanism, it will
initiate optical port switching and all media plane processing units will switch to standby
board. After switching:
25
Figure 3-24
Active board
Control
Plane
Processin
g unit
Working Port
Media
Plane
Processin
g unit
Internal Control
Plane Ethernet
Port
Internal Control
Plane Ethernet
Port
Internal Media
Plane Ethernet
Port
Internal Media
Plane Ethernet
Port
Standby Board
Control
Plane
Processin
g unit
Media
Plane
Processin
g unit
Main Data
Path
Protection
Port
Active board
Control
Plane
Processing
unit
Working Port
Media
Plane
Processing
unit
Standby Board
Internal Control
Plane Ethernet Port
Internal Control
Plane Ethernet Port
Control
Plane
Processing
unit
Media
Plane
Processing
unit
Working Port
For 1+ 1 hybrid backup, when active board det ecting internal media plane
communication abnormal (refer to 3.1.3. 2), switching will be triggered. After switching
status is shown below:
Active board
Control
Plane
Processing
unit
Working Port
Media
Plane
Processing
unit
Standby Board
Internal Control
Plane Ethernet Port
Internal Control
Plane Ethernet Port
Control
Plane
Processing
unit
Media
Plane
Processing
unit
Working Port
26
When active board resets or it is removed, standby board will detect active board offline(refer to 3.1.3.1) through inter-lock circuit and switch to active board to take over all
active boards function. Status after switching is showing below:
Figure 3-27
Active board
Standby Board
Control
Plane
Processing
unit
Working Port
3.1.4.15
Media
Plane
Processing
unit
Internal Control
Plane Ethernet Port
Internal Control
Plane Ethernet Port
Control
Plane
Processing
unit
Media
Plane
Processing
unit
Working Port
Redundancy protection for the control plane link between shelves, including board
protection and port protection.
For the net work interface of the internal control plane, the system periodically
queries the receiving/transmitting statistics of the network interface of the control
plane. If detecting that the network interface of the control plane is abnormal, the
system switches over the board and thus the original standby board takes over the
work.
For details, see Section 3.1.2General Process of B oard S witching and Section
3.2.1Backup of Ethernet Port.
3.1.4.16
27
3.1.5
PM
FS
FAM
SA
CC
Table 3-2 describes the backup mode of each board in the B8200.
Table 3-2
Functional Board
Backup Mode
CC
1+1 backup
FS
None
PM
1+1 backup
SA
None
BPC
Resource pool
Remarks
28
3.1.5.1
CC Troubleshooting Process
Primary control
The active and standby CC boards work in the 1+1 backup mode. The active and
standby CC boards have completely the same hardware, run the same software,
and implement NBAP processing and the processin g of the IP/ATM protoc ol stack
between the control plane and user plane. At a time, only one CC board is active.
The active CC board carries services, including access and processing. The
standby CC board does not carry any service, but receives the synchr onized data
of the active CC board and keeps the same running conditions as the active CC
board.
There exists an independent communication channel between the active CC board
and standby CC board. The active and standby CC boards run the same software,
but are in different modes. During the initial power-on operation, the two CC boards
compete for the active/standby state. Upon completion of the competition, the
standby CC board actively interacts with the active CC board, so as to ensure the
consistency of soft ware versions and static data. During the normal operation, the
active CC board synchronizes the changing dat a (for example, database dat a,
control information of the control plane and user plane of the current access,
software parameters, and software versions) to t he standby CC board through
regular synchronization or real-time synchronization. During the normal operation,
the standby CC board rec eives the synchroniz ed data of the active CC board and
keeps the consistency of static data between the active CC board and active CC
board, thus taking over the work of the active CC board smoothly in real time.
The system can initiate the active/standby switching in the OMC. Alternatively, the
active/standby switching is triggered by a severe fault of the active CC board. In
this case, the current active CC board switches the current hardware and software
to the standby state, and only processes the synchronized data of t he new active
CC board without processing any service. The original standby CC board swi tches
its software and hardware to the active state and takes over the work of the original
active CC board by using the synchronized static data and real-time operation dat a,
thus minimizing the impact of the fault of the original CC board on the entire s ystem.
Clock
In cas e the active CC board is faulty, the active CC board is automatically to the
standby CC board.
Clock extraction redundancy design for multiple E1/ T1 links: In case an E1/ T1 link
is faulty, the system can automatically switch the faulty E 1/T1 link to another
normal one for clock extraction.
Configuration of multiple clock reference sources: In case the current clock
reference source is faulty, the current clock referenc e source is automatically
switched to the standby clock referenc e source.
29
Clock holdover: When all clock reference sources are faulty, the system can keep
working normally for at least 90 days. (the BB8200 can be held over for at least 180
days)
3.1.5.2
PM Troubleshooting Process
The PM troubleshooting process is divided int o two parts:
1
3.1.5.3
When the active PM board detects that a new board is inserted to a specific
slot, the active PM board immediately sends a message to the CC board. If
the CC board decides to supply power for the newly inserted board, the CC
board sends a power supply command to both the active and standby PM
boards, thus ensuring that both PM boards enable the out put of the
corresponding channel.
30
In case a baseband board is faulty, Node B assigns the cell setup and radio link setup
initiated by the RNC to anot her available baseband board in the baseband pool.
For the existing UEs in the faulty baseband board, Node B sends a reset request to the
RNC, requesting the RNC to res et the UEs in the faulty baseband board. The public
resources (for example, cells) in the faulty baseband board are deleted, and Node B
notifies the deletion result to the RNC through an audit response. The RNC initiates cell
setup and radio link setup again, and Node B assigns them to another available
baseband board in the baseband pool.
3.2
3.2.1
Port backup is also accompanied by board backup. The board backup mode is 1+1
backup. At a time, the Ethernet port in the active board receives and transmits data. In
case the port of the active board is faulty, the active/standby switching is triggered. The
Ethernet port of the new active board is active for transmitting and receiving data.
Table 3-3lists the FE routing ports and trunk ports supported by various boards.
Table 3-3
Board
UIMC
GUIM
THUB
In a minor environment, the FE routing interc onnection bet ween UIM (including UIMC
and GUIM) boards can be used for networking. In a major environment, a THUB board
needs to be interconnected to other shelves for networking. In 0, a pair of routing ports
or a group of trunk ports can be inte rconnected between t wo shelves.
31
3.2.1.1
Routing Interconnection
Routing interconnection is also called FE interc onnection. The UIM (including UIMC and
GUIM) board and UIM/ THUB board can each provide one pair of FE ports for
interconnection. The boards work in the 1+1 backup mode. The Ethernet between the
active board and standby board communicates with the external world through high
resistance. 0 shows the connection mode. The blue channels indic ate the availabl e
communication links, and the read channels indicate the high-resistance channels. The
two available communication links include an active link and a standby link. All control
plane communication streams are in the active link. When the active link is faulty, the
system switches the faulty link to the standby link. In this communication mode, the
traffic is only 100M because only one channel is really used.
Figure 3-29
UIM
3.2.1.2
UIM
Active
Active
Standby
Standby
TRUNK Interconnection
The routing scheme only supports the t raffic of 100M, and cannot satisfy the needs of
high traffic. Therefore, the inter-shelf cascaded boards are interconnected through a
trunk, so as to increase the inter-shelf cascading traffic (theoretically, the maximum
traffic is as high as 400M). Trunk interconnection complies with the link aggregation
protocol. The physical links with the same transmission media type and transmission
rate are bundled together. Logically, these bundled physical links appear to be one link.
Link aggregation is also c alled t runking, which allows the peer physical link between
switches or between a switch and a server to multiply its bandwidth. Therefore, trunking
is an important technology whereby to inc reas e the link bandwidth and ensure the
flexibility and redundancy of link transmission.
0 shows the connection mode. The blue channels indicate the available communication
links, and the red channels indicate the high -resistance channels. The four available
communication links are aggregated into a group of trunk ports. In case one link is faulty,
communication is implemented through another link.
32
Figure 3-30
3.2.2
UIM/THUB
UIM/THUB
Active
Active
Standby
Standby
3.2.3
3.2.4
33
HSSL. The PSN boards work in the load sharing mode, that is, two PSN boards switch
packets at the same time. The port backup mode is applied bet ween GLI boards. The
GLI boards are cross-connected to the GUIM boards in another shelf respectively. At
each end, every GE port has two optical modules for backup. Through active/standby
port backup and optical module backup, the backup proportion bet ween physical links
and logical links is 4:1, thus providing a reliable link assuranc e for the access of IP
packets to the switching center.
The GUIM has two GE ports. Active/standby GE port backup is accompanied by board
backup. E very GE port has two optical modules. The GLI has four GE ports, and every
GE port has two optical modules. Th e two GE ports wit h the same GLI sequence
number have an active/standby relation. The active/standby relation between t he GE
ports with different sequence numbers is independent from each other. 0 lists the GE
ports supported by the GUIM/GLI.
Table 3-4
Board
Number of GE Ports
GUIM
GLI
E very GE port has two optical modules, thus enabling the double-t ransmitting & onereceiving mechanism through hardware. Both optical modules transmit data to the
external world. At a time, only the active optical port receives data. The hardware also
provides the switching function for the packet-receiving optical module.
The following section describes the inter -shelf fiber interconnection mode in det ail.
3.2.4.1
GUIM-GLI Interconnection
In the GUIM, there are t wo optical ports interconnected to the GLI. The active/standby
relation bet ween the two optical ports is also accompanied by board backup. In normal
cases, the two optical ports both forward the media streams. Traffic balance is attained
between t he two GE ports in the GUIM through a c ertain algorithm. The system
compares the number of the available GE ports in the boards. The system prefers the
GUIM with a larger number of available optical ports as the active end. In case the GE
port of the active GUIM is faulty, board switching is initiated. 0 shows the detailed
connection mode.
34
Figure 3-31
GUIM-GLI Connection
GUIM
#1
G
L
I
#
1
4
1
GUIM
#2
2
1
G
L
I
#
2
3.2.4.2
Figure 3-32
GUIM#1
GUIM#3
GUIM#2
GUIM#4
35
3.3
3.3.1
3.3.2
IP Link Backup
IP link backup is configured through route redundancy. According to the link status, the
upper-layer application selects an available route for backup. The link status can be
obtained by monitoring the on/off state of the physical layer or through the IP layer
detection prot ocol (for example, BFD). In case an external interface of the system is
faulty, the route related to the faulty interface also fails. When sending data, an
application selects another route. If the BFD mechanism det ects that the corresponding
route is not available, the application does not select the corresponding route, either.
Instead, the application selects anot her normal route.
For details about BFD, refer to ZTE UMTS IP UTRA N Feat ure Guide.
3.3.3
3.3.4
Parameter Description
4.1
Parameter List
4.1.1
Abbreviated name
Parameter name
36
4.1.2
BackUpMode
Backup Mode
RcpModule
RCP Module
SubSystem
SubSystem
4.1.3
4.2
Parameter Configuration
4.2.1
4.2.1.1
Backup Mode
OMC Path
Path: View->Configuration Management ->RNC NE ->RNC Ground Resource
Management-> RA CK-> Create->Board->A vailable Board
Parameter Configuration
Relat ed description
None
Interface parameter description:
The parameter indicates the backup mode of boards. Its value is as follows:
0
No backup
37
N+1 backup
1:1 backup
Note: The backup mode is specified when the corresponding boards are configured.
Recommendation: Configure t he backup mode according to the field networking
mode
4.2.1.2
Rcp Module
OMC Path
Path: View->Configuration Management ->RNC NE ->RNC Ground Resource
Management-> RA CK-> Create->Board-> RCP -> Module 1 / Module 2
Parameter Configuration
Relat ed description
None
Interface parameter description:
The parameter indicates the module number of the RCP. Its value is 3 to 127.
Recommendation: none
4.2.1.3
Sub System
OMC Path
Path: View->Configuration Management ->RNC NE ->RNC Ground Resource
Management-> RA CK-> Create->Board-> RCP -> Module 1 / Module 2
Parameter Configuration
Relat ed description
None
Parameter description: none
The parameter indicates the subsystem number of the resource shelf of the RCP.
Its value is 0 to 19. The invalid value is 0xFF. When the Iub int erface is carried in
the ATM, the available SubSystem [0] of the RCP must be the resource shelf that is
38
accessed by Node B under the jurisdiction of the RCP, and the number of available
shelves is 1.
4.2.2
4.2.3
5
5.1
5.2
Alarm List
Alarm Code
198005376
Description
Severity Level
Level 2
Caus e
1. The number of int ernal ports configured for this board and
the neighboring slots exceeds the number of physical slots.
2. Pin Healthy of this slot on the backplane is damaged.
Alarm Code
198066003
Description
Severity Level
Level 2
Caus e
Alarm Code
198005378
Description
Board HW Error
39
Severity Level
Level 2
Caus e
Alarm Code
198005379
Description
Severity Level
Level 2
Caus e
Alarm Code
198066004
Description
Severity Level
Level 2
Caus e
Alarm Code
198005381
Description
Severity Level
Level 1
Caus e
Alarm Code
198005395
Description
Severity Level
Level 2
Caus e
40
Alarm Code
198005399
Description
Severity Level
Level 2
Caus e
1. DSP does not succeed in loading the version and can not
work normally.
2. DSP has HPI interface exception.
3. The media plane loop back fails.
4. Lan91 link is broken.
5. The link of 5328 port is broken.
6. DSP running halts.
7. The system initialization of the Tone resource DSP fails.
Different formats of tone resources or even no tone resource
exists on the board.
Alarm Code
198026127
Description
Severity Level
Level 3
Caus e
Alarm Code
198026128
Description
Severity Level
Level 2
Caus e
Alarm Code
198026129
Description
Severity Level
Level 1
Caus e
Alarm Code
198005122
Description
Severity Level
Level 3
Caus e
41
Alarm Code
198005189
Description
Severity Level
Notification Level 4
Caus e
Alarm Code
198005660
Description
Severity Level
Level 4
Caus e
Alarm Code
198066058
Description
Severity Level
Level 3
Caus e
Glossary
AC
Access Control
APBI
APBE
APS
ATM
BHCA
BME
BM-SC
BRS
Bearer Subsystem
CBC
THUB
Trunk HUB
CN
Core Network
DASF
42
DBS
DDN
43