11 Appendix Troubleshooting FCoE
11 Appendix Troubleshooting FCoE
Appendix
A. Call Flow Diagram
B. Troubleshooting Fibre Channel problems
C. Additional Spanning Tree Slides
D. Troubleshooting NEX-5020 Hardware
E. Troubleshooting NEX-5020 Upgrades
F. NEX-5020 Trace Tools
Appendix A
This call flow diagram comes from the Global Services Product Support Bulletin NEX-5020
Fibre Channel over Ethernet Switch.
Customer Service Rules of Engagement…
In all customer support cases, EMC will take the first call and transfer the call to EMC Level 1
support. EMC will maintain primary control of the customer support relationship. EMC and
Cisco will work together to ensure customer issues are resolved as quickly as possible. It is
agreed that at times that will mean resolving the call with EMC resources, and at other times
that will result in rapid escalation to Cisco. The decision to rapidly escalate to Cisco will be
based on criteria such as customer requests, Service Level Agreements, the nature of the error
condition and call severity.
In all customer support cases, if EMC determines that the problem can not be resolved using
EMC resources, or if, at any time, the customer requests to be transferred to Cisco for support,
EMC will engage Cisco and work jointly through product resolution.
Appendix B
E Port - In expansion port (E port) mode, an interface functions as a fabric expansion port. This
port may be connected to another E port to create an Inter-Switch Link (ISL) between two
switches.
TE Port - In trunking E port (TE port) mode, an interface functions as a trunking expansion
port. It may be connected to another TE port to create an Enhanced ISL (EISL) between two
switches. TE ports are specific to Cisco MDS 9000 and Nexus 5020 Family switches
F Port - In fabric port (F port) mode, an interface functions as a fabric port. This port may be
connected to a peripheral device (host or disk) operating as an N port. An F port can be attached
to only one N port. F ports support class 2 and class 3 service.
Auto Mode - Interfaces configured in auto mode can operate in one of the following modes: F
port, E port, or TE port. The port mode is determined during interface initialization.
SD Port- In SPAN destination port (SD port) mode, an interface functions as a switched port
analyzer (SPAN).
NP Port - An NP port is a port on a device that is in NPV mode and connected to the core NPV
switch through an F port. NP ports operate like N ports that function as proxies for multiple
physical N ports.
Invalid calibration
64 bit counters:
0. rxHCTotalPkts = 152894790
switch# show interface eth1/21
1. txHCTotalPks = 330867280
2. rxHCUnicastPkts = 0
Ethernet1/21 is up 3. txHCUnicastPkts = 0
Hardware is 10000 Ethernet, address is 000d.ec6d.84dc 4. rxHCMulticastPkts = 152894790
(bia 000d.ec6d.84dc) 5. txHCMulticastPkts = 330867231
MTU 1500 bytes, BW 10000000 Kbit, DLY 10 usec, 6. rxHCBroadcastPkts = 0
7. txHCBroadcastPkts = 49
reliability 255/255, txload 1/255, rxload 1/255
8. rxHCOctets = 9785619336
Encapsulation ARPA 9. txHCOctets = 21176709656
Port mode is access 10. rxTxHCPkts64Octets = 483725278
full-duplex, 10000 Mb/s 11. rxTxHCpkts65to127Octets = 29422
12. rxTxHCpkts128to255Octets = 7358
Input flow-control is off, output flow-control is off 13. rxTxHCpkts256to511Octets = 12
5 minute input rate 22203 bytes/sec, 346 packets/sec 14. rxTxHCpkts512to1023Octets = 0
5 minute output rate 640597 bytes/sec, 10000 15. rxTxHCpkts1024to1518Octets = 0
packets/sec 16. rxTxHCpkts1519to1548Octets = 0
Rx 17. rxHCTrunkFrames = 0
18. txHCTrunkFrames = 0
16501327 Input Packets 9 Unicast Packets 16500923 19. rxHCDropEvents = 0
Multicast Packets
395 Broadcast Packets 0 Jumbo Packets 0 Storm All Port Counters:
Suppression Packets 0. InPackets = 152894790
1056159080 Bytes 1. InOctets = 9785619336
0 No buffer 0 runt 0 crc 0 ecc 2. InUcastPkts = 0
3. InMcastPkts = 152894790
0 Overrun 0 Underrun 0 Ignored 0 Bad etype drop
4. InBcastPkts = 0
0 Bad proto drop 0 If down drop 0 Collision 5. InJumboPkts = 0
0 Late collision 0 Lost carrier 0 No carrier 6. StormSuppressPkts = 0
0 Babble 7. OutPackets = 330867280
8. OutOctets = 21176709656
Tx 9. OutUcastPkts = 0
433943286 Output Packets 26171 Multicast Packets 10. OutMcastPkts = 330867231
0 Broadcast Packets 0 Jumbo Packets 11. OutBcastPkts = 49
12. OutJumboPkts = 0
27772499094 Bytes 13. rxHCPkts64Octets = 152880091
0 Ouput errors 14. rxHCPkts65to127Octets = 14699
16499333 Rx pause 0 Tx pause 0 Reset 15. rxHCPkts128to255Octets = 0
16. rxHCPkts256to511Octets = 0
[snip]
Virtual interface is a concept in Forwarding (LU and AC). MACs and Packet Editing only
understand physical interface.
Troubleshooting FC MAC
Troubleshooting FC MAC
00: 2:6B
00: 6:2B
00: 9:6B
00:11:25
00:14:5E
00:50:2E
00:50:8B
00:60:B0
00:A0:B8
00:D0:60
00:D0:B2
00:E0:69
00:E0:8B
Every VSAN starts with the area ID of 00 and we increment the Port ID's, based on order of
login, until we run out at which time we increase the Area ID from 00 to 01 and start the process
over.
We also maintain a database mapping of HBA's and FCID so we try to reassign the same FCID
to an HBA even if someone plugs the HBA into another port on the same switch in the same
VSAN.
This database is maintained until some many HBA's have logged in the system needs to reclaim
the FCID or the switch is rebooted. If you assign a persistent FCID then the HBA will get the
same FCID no matter what.
FLOGI Accept
FLOGI Service
Parameters
[snip] [snip]
FC_CT FC_CT
Revision: 1 Revision: 1
IN_ID: 00.00.00 IN_ID: 00.00.00
GS Type: Directory Service (0xfc) GS Type: Directory Service (0xfc)
GS Subtype: 0x02 GS Subtype: 0x02
Server: dNS (0x01) Server: dNS (0x01)
Options: 0x00 Options: 0x00
dNS dNS
Opcode: RFT_ID (0x0217) Opcode: MSG_ACC (0x8002)
Maximum/Residual Size: 4 Maximum/Residual Size: 0
Port Identifier: 7e.00.00
FC-4 TYPEs Supported: FCP,
Request Response
N5K# 2008 Aug 12 17:41:50.741393 fcns: vsan 5: fc_ct_parse_frame() succeeded - request_id = 26, flags = 0
2008 Aug 12 17:41:50.741445 fcns: vsan 5: received sw query NS_CMD_GE_ID from switch 0xfffca4 to switch
0xfffcea
2008 Aug 12 17:41:50.741472 fcns: vsan 5: Got entry for port-id 0xea0000 matching the query
2008 Aug 12 17:41:50.741500 fcns: vsan 5: sending accept response to port-id 0xfffca4
2008 Aug 12 17:41:51.038089 fcns: vsan 5: fc_ct_parse_frame() succeeded - request_id = 27, flags = 0
2008 Aug 12 17:41:51.038139 fcns: vsan 5: received sw query NS_CMD_GE_ID from switch 0xfffca4 to switch
0xfffcea
2008 Aug 12 17:41:51.038168 fcns: vsan 5: Got entry for port-id 0xea0000 matching the query
2008 Aug 12 17:41:51.038196 fcns: vsan 5: sending accept response to port-id 0xfffca4
2008 Aug 12 17:41:52.720708 fcns: vsan 5: fc_ct_parse_frame() succeeded - request_id = 28, flags = 0
2008 Aug 12 17:41:52.721216 fcns: vsan 5: fc_ct_parse_frame() succeeded - request_id = 29, flags = 0
2008 Aug 12 17:41:52.721421 fcns: vsan 5: fc_ct_parse_frame() succeeded - request_id = 30, flags = 0
2008 Aug 12 17:41:52.721846 fcns: vsan 5: fc_ct_parse_frame() succeeded - request_id = 31, flags = 0
2008 Aug 12 17:41:52.722478 fcns: vsan 5: fc_ct_parse_frame() succeeded - request_id = 32, flags = 0
2008 Aug 12 17:41:52.722512 fcns: vsan 5: received query NS_CMD_GID_FT
2008 Aug 12 17:41:52.722547 fcns: vsan 5: Applying zoning constraints on this query; limiting search space
to the zone(s) containing port-id 0xea0000
2008 Aug 12 17:41:52.722723 fcns: vsan 5: Got entry matching the object in the query; entry belongs to
port-id 0xa401cb
2008 Aug 12 17:41:52.722752 fcns: vsan 5: port-id 0xa401cb is not in the same zone; discarding this entry
2008 Aug 12 17:41:52.722775 fcns: vsan 5: Got entry matching the object in the query; entry belongs to
port-id 0xa401cc
2008 Aug 12 17:41:52.722797 fcns: vsan 5: port-id 0xa401cc is not in the same zone; discarding this entry
Fabric Controller
Each Switch has a Fabric Controller
Assigned address ‘FF FF FD’
– Every Fabric Controller in the fabric has the same address
– Responsible for managing fabric addressing, initialization, routing...
– Node ports can register with Fabric Controller to receive port or
fabric state changes
– A required service
ELS - SCR
SCR – State Change Registration
– Used when ports want to receive notification when the login state of other ports
in the fabric changes.
– SCR sent to Fabric Controller
– Can register for fabric events, port events, or both (Full)
– Login state may change due to:
Loss of signal (NOS, LOS, FLOGI)
LIP has occurred
New port logins
Response
Request
Path Selection Switch has a Path selection (FSPF) is defined Switch is operational with
Domain_ID in the next section (HLO, LSU and routes established
LSA)
The ESC (Exchange Switch Capabilities) frame allows switches to exchange information about
the routing protocols supported and agree on a common routing protocol.
The vendor ID field is used by the NEX-5020 to determine that the switch on the other end is
another MDS or NEX-5020 switch. This is indication that Trunking E_Ports (TE) and VSANs
are supported.
Only the ELP winner, the port that receives the SW_ACC to it’s ELP, sends the ESC.
ff.ff.fd -> ff.ff.fd SW_ILS 1 0xc27f 0xffff 0xff -> 0x0 0x71
ff.ff.fd -> ff.ff.fd FC 1 0xc27f 0xdd22 0x0 -> 0xf Link Ctl, ACK1
SYNC
ff.ff.fd -> ff.ff.fd SW_ILS 1 0xc27f 0xdd22 0x0 -> 0xf SW_ACC (0x71)
ff.ff.fd -> ff.ff.fd FC 1 0xc27f 0xdd22 0xff -> 0x0 Link Ctl, ACK1
ff.ff.fd -> ff.ff.fd SW_ILS 1 0xdd23 0xffff 0x0 -> 0xf 0x71
ff.ff.fd -> ff.ff.fd FC 1 0xdd23 0xc280 0xff -> 0x0 Link Ctl, ACK1
Commit
ff.ff.fd -> ff.ff.fd SW_ILS 1 0xdd23 0xc280 0xff -> 0x0 SW_ACC (0x71)
ff.ff.fd -> ff.ff.fd FC 1 0xdd23 0xc280 0x0 -> 0xf Link Ctl, ACK1
Exchange Peer Parameters (EPP) uses command (0x71). This is in the standard as a vendor
unique command code.
We use this frame for trunk negotiation, VSAN list exchange, and port VSAN consistency
check.
ff.ff.fd -> ff.ff.fd SW_ILS 1 0xdd24 0xffff 0x0 -> 0xf EFP
ff.ff.fd -> ff.ff.fd FC 1 0xdd24 0xc281 0xff -> 0x0 Link Ctl, ACK1
ff.ff.fd -> ff.ff.fd SW_ILS 1 0xc282 0xffff 0xff -> 0x0 EFP
ff.ff.fd -> ff.ff.fd FC 1 0xc282 0xdd25 0x0 -> 0xf Link Ctl, ACK1
ff.ff.fd -> ff.ff.fd SW_ILS 1 0xdd24 0xc281 0xff -> 0x0 SW_ACC (EFP)
ff.ff.fd -> ff.ff.fd SW_ILS 1 0xc282 0xdd25 0x0 -> 0xf SW_ACC (EFP)
Domain ID Assignment
ff.ff.fd -> ff.ff.fd SW_ILS 1 0xdd26 0xffff 0x0 -> 0xf DIA
ff.ff.fd -> ff.ff.fd FC 1 0xdd26 0xc283 0xff -> 0x0 Link Ctl, ACK1
ff.ff.fd -> ff.ff.fd SW_ILS 1 0xdd26 0xc283 0xff -> 0x0 SW_ACC (DIA)
ff.ff.fd -> ff.ff.fd FC 1 0xdd26 0xc283 0x0 -> 0xf Link Ctl, ACK1
ff.ff.fd -> ff.ff.fd SW_ILS 1 0xc284 0xffff 0xff -> 0x0 RDI
ff.ff.fd -> ff.ff.fd FC 1 0xc284 0xdd27 0x0 -> 0xf Link Ctl, ACK1
ff.ff.fd -> ff.ff.fd SW_ILS 1 0xc284 0xdd27 0x0 -> 0xf SW_ACC (RDI)
Final EFP
ff.ff.fd -> ff.ff.fd SW_ILS 1 0xdd28 0xffff 0x0 -> 0xf EFP
ff.ff.fd -> ff.ff.fd FC 1 0xdd28 0xc285 0xff -> 0x0 Link Ctl, ACK1
ff.ff.fd -> ff.ff.fd SW_ILS 1 0xdd28 0xc285 0xff -> 0x0 SW_ACC (EFP)
ff.ff.fd -> ff.ff.fd FC 1 0xdd28 0xc285 0x0 -> 0xf Link Ctl, ACK1
FSPF Offers:
- Fast Computation of new paths in case of Failure
- Routing loop free topology
- Load sharing over equivalent paths
- Guaranteed in order delivery during topology changes
Statistics counters :
Number of packets received : LSU 4 LSA 4 Hello 58 Error packets 0
Number of packets transmitted : LSU 4 LSA 4 Hello 58 Retransmitted LSU 0
Number of times inactivity timer expired for the interface = 0
[snip]
Timers
NEX-5020 FC Timers
N5K(config-if)# sh fctimer
N5K(config-if)# fctimer ?
No matches in current mode, matching in (config) mode
D_S_TOV D_S_TOV in milliseconds(5000-100000)
E_D_TOV E_D_TOV in milliseconds(1000-100000)
R_A_TOV R_A_TOV in milliseconds(5000-100000)
abort abort the fctimer configuration commands
commit commit the fctimer configuration commands
distribute Enable distribution of fctimer configuration using CFS
This database contains only the currently active zone and zoneset
information. FC aliases are not included in this database.
Full zoneset
This database contains the currently active zone and zoneset information
as well as the other configured zones and zonesets that are not
currently active. FC aliases are included in this database.
Device-alias is used to map 1 PWWN to a user friendly naming convention. It can not be sent
to, or used on, non Cisco switches. It can be used in place of the PWWN for application
configuration like zoning, QoS, IVR, fcping, fctrace.
FCalias is only used in conjunction with a single vsan zoning. It is not available for other
applications. It can contain multiple members of different types.
These are the possible ways to define members in a zone. You can mix member types within a
zone.
If the zone is part of an interop VSAN or fabric, you should only use PWWN members.
Management Station
0x76 0x65
Changes initiated from seed switch
Seed switch Subordinate switch
• ACA (Acquire Chance Authorization) locks the zone application so no other 0xEF
changes can be attempted
Subordinate switch
• SFC (Stage Fabric Configuration) sends the zone data in the frame payload
• UFC (Update Fabric Configuration) triggers all switches to make the new data
active
There is a timer that is started by the seed switch for each command.
If the seed switch does not receive and ACC (accept) from each subordinate switch for the
current command, it will abort the zoning activation.
Fibre Channel
R_CTL: 0x02
Dest Addr: ff.fc.65
CS_CTL: 0x00
Src Addr: ff.fc.76
Type: SW_ILS (0x22)
F_CTL: 0x290000 (Exchange Originator, Seq Initiator,
Exchg First, Seq Last, CS_CTL, Transfer Seq
Initiative, Last Data Frame - No Info, ABTS - Abort/MS, )
SEQ_ID: 0x0e
DF_CTL: 0x00
SEQ_CNT: 0
OX_ID: 0x015b
RX_ID: 0xffff
Parameter: 0x00000000
SW_ILS UFC
Cmd Code: UFC (0x26) command
code
Here we see the ACA, the SFC, the UFC and the RCA.
Notice the FCID’s used for the frames. The last byte is the domain ID of the switch sending the
frame or the ACC.
Also, notice that each exchange (OX_ID) is 4 frames. Class F frames require an ACK1.
The Zoning standard requires an ACC (Accept) to each command frame.
In this example, Domain 0x76 is the seed switch, while Domain 0x65 is the subordinate switch.
Disallow parallel configuration: GS4 enforces all configurations to be done within a session. A session begins with the
reception of session-begin command and ends on reception of session-end command. When a session-begin command is
received, the entry switch should lock the entire fabric before it accepts any more commands within the session. The locking
ensures that there exists a single management session which is effecting a configuration change in the fabric. This feature ensures
consistency in the fabric.
GS4 mandates the usage of session for any Name server interaction/registration by a device. Earlier, NS registration could be
done at any time. Normally, as soon as devices login, they would do multiple back-to-back registrations with few msecs apart.
Whenever registrations are done, RSCN needs to be generated. However, to reduce the no. of RSCNs, NS used to run a timer to
wait for some time hoping to consume some more registrations so that a single RSCN can be generated for multiple registrations.
The timeout values are fixed based on heuristics.
This problem has been addressed in GS4. The new standard states that the device should start a session, do multiple registrations
and complete it by a session-end command. The switch can generate the RSCN on reception of session-end command. This
mechanism exactly defines when the RSCN should be generated and clearly better than the earlier timer-based approach. It also
saves timer resources.
GS4/SW3 have defined a new zoning object called `Zone Reference Object'. Zone Reference object is nothing but pointers (in
Coding terminology). If an object (ex: Zone) is part of multiple other higher-level objects (ex: Zonesets), instead of creating an
instance of the object for each higher-level object, you can define the object once and higher-level objects can use references to
the object. This feature certainly reduces the payload size as Reference object consume lesser bytes than the actual object.This
advantage is more pronounced with bigger databases with lot of zonesets most of which reuse the same set of zones.
SW3/GS4 defines standard way of enforcing/exchanging default-zone setting throughout the fabric. Also, when fabrics are
merged, default-zone settings are exchanged to validate the same setting on either side of the fabric.
GS4 defines a new command (GAR) to retrieve the activation results from each of the remote switches. This would also include
the nature of the problem at each switch. This feature would help in easier trouble-shooting of any activation problems.
GS4/SW3 has defined new command codes to distribute the zoning database without any reactivations. This avoids any
hardware changes (TCAM programming) on the remote switches.
SW3/GS4 and has mandated the use of Vendor-ID along with Vendor-specific type value. This would uniquely identify a
member type.
MDS switch supports FWWN members as part of zones. FWWN based zone membership is an easier way of configuring
JBODs/FL devices.
ESS (Exchange Switch Support) defined in SW-3 also enhances interop. Currently, Name Server decides the object in its
response to GE-XX queries based on the interop mode. With ESS, vendors can specify what type of entry objects they
support(Large, small, ..). This will enhance interoperability between the switches of different vendors.
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 60
Appendix: Troubleshooting FCoE
Enhanced
Basic Zoning Enhanced Zoning
Advantages
Enhanced
Basic Zoning Enhanced Zoning
Advantages
To distribute the zoning database, you must reactivate the same zone set. The reactivation may
affect hardware changes for hard zoning on the local switch and on remote switches.
There are 2 types of SFC, basic and enhanced. Both are sent when making the change. If both
are accepted, then the change to enhanced mode is successful. If the enhanced SFC is rejected,
then the mode stays in basic.
Basic mode
Enhanced mode
Zoning policies
Merge control prevents a change from being introduced by accident if a change was made while
the fabrics were isolated.
If a fabric were isolated, and a change made on one side, without merge control restrict if there
were no conflicts, the change is propagated once the isolation issue is resolved.
The intent of the lock is prevent conflicting changes. In basic mode, the last user to activate a
zoneset will have their changes moved to the fabric. With enhanced zoning, the intent is to lock
the fabric once a change is started. Additional attempts to start a zoning change will be blocked,
and the user will not be able to make a zoning change.
Example configuration
ACA sent
Sample configuration to lock
fabric
In enhanced mode, as soon a zone change is started on any switch in the fabric (VSAN), the
ACA is sent to the other switches to lock the fabric.
The SFC, UFC, and RCA are sent on the commit, not the activate command.
No commit will sent RCA to discard changes. Like if you got distracted during the zone
change.
When the lock is present, the fabric is locked, and the user attempting the change will see an
error message.
To determine which switch/domain has the lock, issue show zone internal command.
Look for the ACA by field.
View all domains with the show fcdomain domain-list command.
On the switch that issued the ACA and acquired the lock, the show zone internal will look a bit
different.
On the switch with the lock, the show zone status will show which user has the lock.
Once the lock is cleared, the show zone status display shows none for the session.
[snip]
The show zone internal command on both switches look the same for state and ACA by now
that there is no lock in effect any more.
Lock is gone!
Bits of
Interest
N5K# sh zone ess vsan 5
ESS info on VSAN 5 :
Domain : 239, SWWN : 20:05:00:0d:ec:6b:cd:41, Cap1 : 0xfb, Cap2 : 0x0
Domain : 238, SWWN : 20:05:00:0d:ec:3f:a5:c1, Cap1 : 0xfb, Cap2 : 0x0
0xfb = Enhanced mode supported, enhanced mode enabled, and default zone set to
deny
The syslog should contain messages indicating why the zoneset activate failed. Check the seed
switch first, and then the remote switches.
Check that the ISL is operational.
Collect zone displays or debugs to assist with problem determination.
[snip]
Here we see the 4 frames of a normal activation. This a state machine trace.
The current state, the triggering event, and the resulting stat are shown.
Here we see that the ACA, SFC, UFC, and RCA are all received from domain ID 210.
Timer expiration event will be displayed here.
This trace has the most recent events at the bottom. Here we see there are 50 events. Event 50
is the most recent.
Do not deactivate the zoneset on either end, because that will cause local traffic contained
within that fabric or switch…in the affected VSAN to be impacted.
Manually correct one or the other side
Or use the import/export command to push/pull the desired zoneset.
This command only applies to an E or TE that is in the isolated state.
The user has to decide which end of the isolate link contains the zoneset that is desired to be
active on both ends.
You can export (force a push of the current active zoneset) or import (force a pull of the remote
side’s current active zoneset)
This example is of an import..so we can track the change on the same switch. There will be no
changes to the remote side zoneset.
The only difference on the remote side, will be that the isolated vsan is no longer isolated.
The only zoneset change occurs on the switch that did the import, or was opposite the export.
The user has to decide which end of the isolate link contains the zoneset that is desired to be
active on both ends.
You can export (force a push of the current active zoneset) or import (force a pull of the remote
side’s current active zoneset)
This example is of an import..so we can track the change on the same switch. There will be no
changes to the remote side zoneset.
The only difference on the remote side, will be that the isolated vsan is no longer isolated.
The only zoneset change occurs on the switch that did the import, or was opposite the export.
Appendix C
Unbounded failure
domains
Large broadcast
domains
Large amount of
unknown MAC unicast
traffic
Unbounded multicast
traffic
Management and
support challenges
Possible security
vulnerabilities
VLAN Overview
Segmentation
Flexibility
Security
VLAN design must take into consideration the implementation of a hierarchical network
addressing scheme. The benefits of hierarchical addressing are:
Ease of management and troubleshooting
Minimization of errors
Reduced number of routing table entries
Traffic type
must be
considered.
VLAN Operation
802.1Q Trunking
802.1Q Frame
Interconnection Technologies
Technology Use
Fast Ethernet Connects end-user devices
to the access layer switch
Advantages of EtherChannel
Redundant Topology
Spanning-Tree Operation
One root bridge per broadcast domain.
One root port per nonroot bridge.
One designated port per segment.
Nondesignated ports are unused.
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 100
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 100
Appendix: Troubleshooting FCoE
Describing PortFast
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 101
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 101
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 102
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 102
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 103
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 103
Appendix: Troubleshooting FCoE
Spanning-Tree Recalculation
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 104
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 104
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 105
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 105
Appendix: Troubleshooting FCoE
Extended Bridge ID
with System ID
System ID = VLAN
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 106
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 106
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 107
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 107
Appendix: Troubleshooting FCoE
NEX-5020 Example
Ethernet Port-channel 3
2/2
1/2
VLAN 213
1 /4
1/3
Server3
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 108
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 108
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 109
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 109
Appendix: Troubleshooting FCoE
Spanning-tree outputs
POD2#
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 110
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 110
Appendix: Troubleshooting FCoE
VLAN0212
Spanning tree enabled protocol rstp
Root ID Priority 32768
Address 0011.bc52.20d4
Cost 1
Port 4097 (port-channel2)
Hello Time 2 sec Max Age 20 sec Forward Delay 15 sec
Access
members
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 111
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 111
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 112
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 112
Appendix: Troubleshooting FCoE
--------------------------------------------------------------------------------
Port Send FlowControl Receive FlowControl RxPause TxPause
admin oper admin oper
--------------------------------------------------------------------------------
Eth1/5 off off off off 3127212 0
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 113
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 113
Appendix: Troubleshooting FCoE
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 114
Appendix: Troubleshooting FCoE
switch# show plat fwm in sg 0.0.0.0 0.0.0.0 1 router group entry on VLAN 1 has S, G as
(0.0.0.0, 0.0.0.0)s
sg vlan 1.1 sgp sip 0.0.0.0 gip 0.0.0.0: vlan 1.1 pss_flags 2
sg vlan 1.1 sgp sip 0.0.0.0 gip 0.0.0.0: oifl idx 1
sg vlan 1.1 sgp sip 0.0.0.0 gip 0.0.0.0: iod list -
sg vlan 1.1 sgp sip 0.0.0.0 gip 0.0.0.0 oifl 1: vdc 1 gpinif 0, iods
sg vlan 1.1 sgp sip 0.0.0.0 gip 0.0.0.0 oifl 1: max_iod 4, ref count 58 num_oifs 0
sg vlan 1.1 sgp sip 0.0.0.0 gip 0.0.0.0 oifl 1: if_indexs -
sg vlan 1.1 sgp sip 0.0.0.0 gip 0.0.0.0 oifl 1 pd: mcast Idx 1 ifmap idx 1 (ref 1)
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 115
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 115
Appendix: Troubleshooting FCoE
show platform fcfib <options>: various options to dump software structures and fib database for FC
forwarding.
‘show platform software <options>’ and ‘show system internal <options>’: software
internal information of various software processes (help find out internal finite state machine details, event
history, memory allocation statistics, structures dump, global and other debug information)
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 116
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 116
Appendix: Troubleshooting FCoE
Appendix D
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 117
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 117
Appendix: Troubleshooting FCoE
Hardware Architecture
Gatos ASIC
is the “Unified
Port Controller”
Altos ASIC
is the “Unified
Crossbar Fabric”
Eth 3 & 4
Inband lo-hi
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 118
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 118
Appendix: Troubleshooting FCoE
Other method:
switch# show hardware internal gatos port ethernet 1/7 xcvr regs | begin "SFP"
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 119
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 119
Appendix: Troubleshooting FCoE
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 120
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 121
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 121
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 122
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 122
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 123
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 123
Appendix: Troubleshooting FCoE
Slow draining (< 5 Mbps for 10s straight) ports or stuck ports are
error disabled. shut / no shut is required to bring the port back up.
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 124
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 124
Appendix: Troubleshooting FCoE
Gatos Debugging
switch# show hardware internal gatos
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 125
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 125
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 126
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 126
Appendix: Troubleshooting FCoE
Note: shows only non-zero interrupt counters. Includes fatal and non fatal error interrupts
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 127
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 127
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 128
Virtual interface is a concept in Forwarding (LU and AC). MACs and Packet Editing only
understand physical interface.
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 128
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 129
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 129
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 130
Virtual interface is a concept in Forwarding (LU and AC). MACs and Packet Editing only
understand physical interface.
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 130
Appendix: Troubleshooting FCoE
DROP_L2_BIND_CHK_FAILURE Frame was received with an unexpected source CNA logs, tcpdump on CNA to know the frame’s source
MAC address (FC OUI | SID isn’t correct) MAC address; software and hardware LIF dump; software
FLOGI database; FCFIB MAC addresses (show mac-
address-table internal)
DROP_FCF_SW_VSAN_IDX_MISS VSAN index table lookup resulted null VSAN configuration; softwrae and hardware vsan index
tables
DROP_FCF_SW_DOMAIN_IDX_MISS Domain index table lookup resulted null Domain configuration; software and hardware domain
index tables
DROP_FCF_SW_TBL_MISS Unable to find a FC path-select table for Software and hardware vsan index table, domain index
(domain, vsan) tables, and fc path-select tables
DROP_FCF_LOCAL_DOMAIN_MISS STM lookup failed for traffic destined to a VFC Software and hardware mac-address-table
DROP_NO_FABRIC_SELECTED Frame could not be forwarded to the fabric due Software and hardware PIF, LIF, Local Id Table, Multicast
"DROP_FABRIC_SELECT_TO_NULL to improper fabric selection (in PIF, LIF, Local Index Table.
Index Table, Multicast Index Table, etc.) or if the
"DROP_NO_FABRIC_CONNECTIVITY
destination was a reserved ‘drop index’.
DROP_HIT_DROP_PORT_MAP_IDX
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 131
Virtual interface is a concept in Forwarding (LU and AC). MACs and Packet Editing only
understand physical interface.
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 131
Appendix: Troubleshooting FCoE
DROP_PINNED_DEJA_VU_FAILURE End host mode: a frame with a source MAC of a If an error is suspected, software and hardware PIF, LIF
server interface received on a border interface settings for pbp_index; pinning information.
DROP_ST_MISS_NPIV_BORDER Station table entry not found when forwarding Software and hardware mac-address-table
traffic towards VFC in NPV mode.
DROP_PINNED_BORDER_TO_NULL There is no pinned interface; typically this would Software and hardware PIF, LIF settings for pbp_index;
not happen because a server interface is pinning information.
brought down if it can not be pinned to a border
interface
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 132
Virtual interface is a concept in Forwarding (LU and AC). MACs and Packet Editing only
understand physical interface.
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 132
Appendix: Troubleshooting FCoE
switch# show hardware internal gatos asic 13 interrupt match lu ‘lu’ can be substituted with
Interrupt name |Status ‘fw’ or ‘ac’
----------------------------------------------------------------+------
gat_lu_lkup1_INT_VlanXlateKeyTable_tcam0_full_f |Set
gat_lu_lkup1_INT_VlanXlateKeyTable_tcam0_write_on_full_f |Set
gat_lu_lkup1_INT_VlanXlateKeyTable_tcam1_full_f |Set
gat_lu_lkup1_INT_VlanXlateKeyTable_tcam1_write_on_full_f |Set
Done.
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 133
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 133
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 134
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 134
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 135
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 135
Appendix: Troubleshooting FCoE
FC Forwarding - Basics
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 136
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 136
Appendix: Troubleshooting FCoE
switch(config-if)# show platform fwm info vlan 7 this command has more options to
look at entities that are per VLAN
vlan 1.7: dump_all 0 verbose 1
vlan 1.7: state 0x0 num_lifs 1 fwm_ptr 0x85da32c pd_ptr 0x85daef4
vlan 1.7: lif list - Eth1/22
vlan 1.7 fwm: age 300, cfg age 300
vlan 1.7 fwm: mac count 0, ipg count 0, macg count 0 vlanif count 1
vlan 1.7 fwm: dyn_macs 0 fwd_vlanifs 1 igmp_sgs 1
vlan 1.7 fwm: pss_flags 0 igmp snoop: 1
vlan 1.7 pd: int-vlan 10 state table idx 10 vacl_label 128
vlan 1.7 pd: flood refs - unicast 7 broadcast 7 multicast 7
vlan 1.7 pd: gatos 9 lif_members: 0
vlan 1.7 pd: if members (VLAN egress masks): 1-2,26
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 137
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 137
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 138
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 138
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 139
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 139
Appendix: Troubleshooting FCoE
ethanalyzer: tool to capture supervisor frames as they are sent out or received (also dumps internal
headers)
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 140
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 140
Appendix: Troubleshooting FCoE
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 141
Appendix: Troubleshooting FCoE
switch# show plat fwm in sg 0.0.0.0 0.0.0.0 1 router group entry on VLAN 1 has
S, G as (0.0.0.0, 0.0.0.0)s
sg vlan 1.1 sgp sip 0.0.0.0 gip 0.0.0.0: vlan 1.1 pss_flags 2
sg vlan 1.1 sgp sip 0.0.0.0 gip 0.0.0.0: oifl idx 1
sg vlan 1.1 sgp sip 0.0.0.0 gip 0.0.0.0: iod list -
sg vlan 1.1 sgp sip 0.0.0.0 gip 0.0.0.0 oifl 1: vdc 1 gpinif 0, iods
sg vlan 1.1 sgp sip 0.0.0.0 gip 0.0.0.0 oifl 1: max_iod 4, ref count 58 num_oifs 0
sg vlan 1.1 sgp sip 0.0.0.0 gip 0.0.0.0 oifl 1: if_indexs -
sg vlan 1.1 sgp sip 0.0.0.0 gip 0.0.0.0 oifl 1 pd: mcast Idx 1 ifmap idx 1 (ref 1)
switch# show platform fwm info vlan 1 all_macgs: lists all MAC group entries in a given VLAN
switch# show plat fwm info vdc 1: lists all oifls in a given vdc (Ethernet or FC)
switch# show plat fwm info oifl <oifl_index> : lists all interfaces in an oifl from an oifl index (from vlan,
vdc, SG, or macg dump)
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 142
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 142
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 143
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 143
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 144
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 144
Appendix: Troubleshooting FCoE
debug platform hardware peek: dumps the content of a specific hardware table entry
debug hardware internal poke: modifies the content of a spefici hardware table entry [Be
careful!]
show platform fwm info <options>: various options to dump software structures indicating
the intended hardware programming for Ethernet and FC forwarding.
show platform fcfib <options>: various options to dump software structures and fib database
for FC forwarding.
show interface <if> counters storm-control: storm control related drops
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 145
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 145
Appendix: Troubleshooting FCoE
POD2# sh cdp ?
all Show interfaces that are CDP enabled
entry Show CDP entries in database
global Show CDP global parameters
interface Show CDP parameters for an interface
neighbors Show CDP neighbors
traffic Show CDP traffic statistics
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 146
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 146
Appendix: Troubleshooting FCoE
Version:
Cisco Internetwork Operating System Software
IOS (tm) s72033_rp Software (s72033_rp-IPSERVICES_WAN-VM), Version
12.2(18)SXF11, RELEASE SOFTWARE (fc1)
Technical Support: http://www.cisco.com/techsupport
Copyright (c) 1986-2007 by cisco Systems, Inc.
Compiled Fri 14-Sep-07 23:09 by kellythw
Advertisement Version: 2
Native VLAN: 1 Sent on Native VLAN
Duplex: full
----------------------------------------
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 147
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 147
Appendix: Troubleshooting FCoE
DCBX Overview
DCBX is used to:
Auto-negotiation of capability and configuration
– Priority Flow Control capability and all associated CoS values
enabled with PFC
– FCoE capability
Allows one link peer to push config to other link peer
– FCoE CoS value
Logical Link Up/down signaling of Ethernet and FC
DCBX negotiation failures will result in
vfc not coming up
Per-priority-pause not enabled on CoS values with PFC config
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 148
http://download.intel.com/technology/eedc/dcb_cep_spec.pdf
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 148
Appendix: Troubleshooting FCoE
LDP Transmit and Receive should be enabled on the interface for DCBX negotiation to be successful
switch# show lldp interface ethernet 2/1 | grep tx_en
tx_enabled: TRUE
switch# show lldp interface ethernet 2/1 | grep rx_en
rx_enabled: TRUE
If not enabled please enable lldp transmit & receive using the following commands
switch# conf t
switch(config)# int eth2/3 If lldp disabled vfc will not come up
switch(config-if)# lldp transmit
switch(config-if)# lldp receive
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 149
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 149
Appendix: Troubleshooting FCoE
switch# show lldp interface ethernet 2/2 | grep "No remote peers exist"
No remote peers exist The remote adapter/switch does not support LLDP
Note that this case may also happen because of the peer sending bad LLDP frames. The user can find out
using
show platform software dcbx internal errors | grep "Received bad LLDP Packet"
As of FCS Menlo based CNA will support the following DCBX TLV’s
switch# show platform software dcbx internal info interface ethernet 2/1 | grep "DCX TLV"
DCX TLV's
DCX TLV type: 1 DCX TLV Length: 10 DCX TLV Value DCBX Control TLV
DCX TLV type: 6 DCX TLV Length: 5 DCX TLV Value DCBX Ethernet Logical Link TLV (Sub type 1)
DCX TLV type: 6 DCX TLV Length: 5 DCX TLV Value DCBX FC Logical Link TLV (Sub type 0)
DCX TLV type: 5 DCX TLV Length: 5 DCX TLV Value DCBX FCoE Cos TLV
DCX TLV type: 3 DCX TLV Length: 5 DCX TLV Value DCBX PFC TLV
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 150
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 150
Appendix: Troubleshooting FCoE
Oplin based CNA will support the following DCBX TLV’s today:
DCX TLV's
DCX TLV type: 1 DCX TLV Length: 10 DCX TLV Value --> DCBX Control TLV
DCX TLV type: 2 DCX TLV Length: 28 DCX TLV Value --> DCBX Priority Groups TLV
DCX TLV type: 3 DCX TLV Length: 5 DCX TLV Value --> DCBX PFC TLV
DCX TLV type: 5 DCX TLV Length: 5 DCX TLV Value --> DCBX FCoE Cos TLV
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 151
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 151
Appendix: Troubleshooting FCoE
Error
– Indicates negotiation error.
– Never expected to happen when connected to CNA
– When two NEX-5020’s are connected back-to-back
If PFC is enabled on different CoS values negotiation error can happen
– Operating Config
– Indicates negotiation result
– Absence of operating config indicates that the peer does not support this DCBX TLV or negotiation error
– “remote_feature_tlv_present” indicates whether the remote peer supports this feature TLV or not
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 152
Timers
TX Delay While timer info: Timer type 0, Timer ID 10726,eventID
DCX_TX_EV_TXDELAYWHILE_EXPIRED Is timer active 0time remaining
-1817611244
.
.
.
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 152
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 153
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 153
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 154
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 154
Appendix: Troubleshooting FCoE
Capturing on eth4
2008-04-18 18:34:00.659286 00:c0:dd:0a:ad:1d -> 01:80:c2:00:00:0e 1 0 00:c0:dd:0a:ad:1d -> 01:80:c2:00:00:0e 0x0
0x0 LLDP Chassis Id = 00:c0:dd:0a:ad:1c Port Id = 00:c0:dd:0a:ad:1d TTL = 120
2008-04-18 18:34:05.105217 00:c0:dd:0a:ad:1d -> 01:80:c2:00:00:0e 1 0 00:c0:dd:0a:ad:1d -> 01:80:c2:00:00:0e 0x0
0x0 LLDP Chassis Id = 00:c0:dd:0a:ad:1c Port Id = 00:c0:dd:0a:ad:1d TTL = 120 ---> DCX Initial Negotiation
2008-04-18 18:34:05.106611 00:c0:dd:0a:ad:1d -> fc:fc:fc:ff:ff:fe 4 0 00.00.00 -> ff.ff.fe 0x33e 0xffff FC
ELS FLOGI
2008-04-18 18:34:05.124385 00:0d:ec:6d:83:c0 -> 00:c0:dd:0a:ad:1d 4 0 ff.ff.fe -> 1b.00.00 0x33e 0x4b5 FC
ELS ACC (FLOGI) ---> FLOGI/LSACC for VFC bringup
2008-04-18 18:34:06.105306 00:c0:dd:0a:ad:1d -> 01:80:c2:00:00:0e 1 0 00:c0:dd:0a:ad:1d -> 01:80:c2:00:00:0e 0x0
0x0 LLDP Chassis Id = 00:c0:dd:0a:ad:1c Port Id = 00:c0:dd:0a:ad:1d TTL = 120
2008-04-18 18:34:07.105351 00:c0:dd:0a:ad:1d -> 01:80:c2:00:00:0e 1 0 00:c0:dd:0a:ad:1d -> 01:80:c2:00:00:0e 0x0
0x0 LLDP Chassis Id = 00:c0:dd:0a:ad:1c Port Id = 00:c0:dd:0a:ad:1d TTL = 120
2008-04-18 18:34:08.105441 00:c0:dd:0a:ad:1d -> 01:80:c2:00:00:0e 1 0 00:c0:dd:0a:ad:1d -> 01:80:c2:00:00:0e 0x0
0x0 LLDP Chassis Id = 00:c0:dd:0a:ad:1c Port Id = 00:c0:dd:0a:ad:1d TTL = 120
2008-04-18 18:34:09.105537 00:c0:dd:0a:ad:1d -> 01:80:c2:00:00:0e 1 0 00:c0:dd:0a:ad:1d -> 01:80:c2:00:00:0e 0x0
0x0 LLDP Chassis Id = 00:c0:dd:0a:ad:1c Port Id = 00:c0:dd:0a:ad:1d TTL = 120
2008-04-18 18:34:39.007974 00:c0:dd:0a:ad:1d -> 01:80:c2:00:00:0e 1 0 00:c0:dd:0a:ad:1d -> 01:80:c2:00:00:0e 0x0
0x0 LLDP Chassis Id = 00:c0:dd:0a:ad:1c Port Id = 00:c0:dd:0a:ad:1d TTL = 120
2008-04-18 18:35:09.010407 00:c0:dd:0a:ad:1d -> 01:80:c2:00:00:0e 1 0 00:c0:dd:0a:ad:1d -> 01:80:c2:00:00:0e 0x0
0x0 LLDP Chassis Id = 00:c0:dd:0a:ad:1c Port Id = 00:c0:dd:0a:ad:1d TTL = 120 ---> DCX KeepAlives
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 155
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 155
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 156
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 156
Appendix: Troubleshooting FCoE
Failures causing NXOS not Console continuous print error messages every 30 seconds.
be able to come up System LED sets to Flashing Amber. Example of such failure:
properly DRAM, backplane SPROM checksum failure, PCIe enumeration
failure
Failures not fatal and System comes all the way up. Syslog, OBFL and callhome initiated
NXOS can boot up to indicate failure. Example of such failure: OBFL flash, CTS
keystore.
Failure causing port System comes all the way up. Syslog, OBFL and callhome initiated
failures to indicate failure. Example of such failure: ASIC ECC error found
during POST or OHMS
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 157
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 157
Appendix: Troubleshooting FCoE
Environmental Monitoring
switch# show environment
Displays following status:
– Fan
– Temperature
– Power Supply
– Power Usage Summary
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 158
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 158
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 159
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 159
Appendix: Troubleshooting FCoE
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 160
Appendix: Troubleshooting FCoE
Eth 1 2 3 4 5 6
Port ------------------ 8) TestFrontPort :
. . . . . .
Eth 1 2 3 4 5 6
Port ------------------
5) TestFabricPort : . . . . . .
Eth 1 2 3 4 5 6
Port ------------------
. . . . . .
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 161
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 161
Appendix: Troubleshooting FCoE
NOHMs logging
switch# show logging |grep NOHMS
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 162
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 162
Appendix: Troubleshooting FCoE
Show tech
Capture to terminal emulator buffer or log file:
switch# terminal length 0
switch# show tech-support details
`show switchname`
switch
`show system uptime`
System start time: Mon Aug 11 15:33:17 2008
System uptime: 2 days, 0 hours, 46 minutes, 4 seconds
.
.
.
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 163
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 163
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 164
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 164
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 165
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 165
Appendix: Troubleshooting FCoE
Core Files
Show cores
switch# show cores
Module-num Process-name PID Core-create-time
---------- ------------ --- ----------------
1 fwm 2834 Aug 13 16:3
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 166
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 166
Appendix: Troubleshooting FCoE
Service: fwm
Description: FWM Daemon
PID: 2834
Exit code: signal 3 (core dumped)
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 167
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 167
Appendix: Troubleshooting FCoE
Appendix E
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 168
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 168
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 169
169
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 169
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 170
170
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 170
Appendix: Troubleshooting FCoE
Switch reboots and comes up on new code and with boot variables set
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 171
171
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 171
Appendix: Troubleshooting FCoE
Helpful commands
Ping, show file filename cksum/md5sum, show install all
impact/status/failure-reason
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 172
172
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 172
Appendix: Troubleshooting FCoE
Recovery Methods
Boot Sequence
See Nexus 5000 Series Switch CLI Software Configuration Guide page 3-2
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 173
173
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 173
Appendix: Troubleshooting FCoE
Within 2 sec
Of power on
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 174
174
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 174
Appendix: Troubleshooting FCoE
Before the boot sequence starts, the BIOS performs internal tests
on the switch. If the tests fail, then the loader does not gain control.
Instead, the BIOS image retains control and prints a message to
the console at 9600 baud every 30 seconds that indicates a failure.
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 175
Console Settings
BIOS only operates at 9600 baud, 8 data bits, no parity, and 1 stop bit. The user cannot change the console settings
for the BIOS. This is normally not a problem as the BIOS only interacts with the console when checking for if the
user wants to break into the golden BIOS, as well as output in the case of critical errors such as failed DRAM test.
The console settings for loader, kickstart and system are all stored in CMOS. All three images follow the settings
in the CMOS. There are two places in the system that allow the users to change the console settings stored in the
CMOS
Loader via the CLI command serial
System image via the CLI command line
The factory default for the console settings for the loader, kickstart and system images is 9600, 8 data bits, no
parity, and 1 stop bit.
Recovering from corrupted/forgotten console settings
If the user cannot figure out or forgets the console settings stored in CMOS, or if the CMOS gets corrupted, users
can go back to 9600-8-N-1 on the terminal side and then reboot the switch. When the switch is booting, user can
force the loader to run at the factory default console settings of 9600-8-N-1 by hitting Ctrl-Shift-R repeatedly until
the loader> prompt appears. Note that Ctrl-Shift-R must be entered when the console server settings are at 9600-8-
N-1. The loader gives the user a one second window where it will detect this, and it will not give any sort of
indication that it has started/stopped checking for this keystroke. This is consistent with MDS ideology regarding
loader behavior. The user can then reset the CMOS to factory default settings, or write their own.
175
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 175
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 176
176
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 176
Appendix: Troubleshooting FCoE
The Kickstart CLI can be used to recover from lost password and to
recover form corrupt configs etc.
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 177
177
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 177
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 178
178
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 178
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 179
179
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 179
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 180
180
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 180
Appendix: Troubleshooting FCoE
Set IP address
loader> set ip 172.22.42.157 255.255.254.0
Correct - ip addr is 172.22.42.157, mask is 255.255.254.0
Found Intel 82571EB at 0x8020, ROM address 0x9980
Probing...[Intel 82571EB]Ethernet addr: 00:0D:EC:A3:38:40
Address: 172.22.42.157
Netmask: 255.255.254.0
Server: 0.0.0.0
Gateway: 0.0.0.0
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 181
181
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 181
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 182
182
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 182
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 183
183
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 183
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 184
184
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 184
Appendix: Troubleshooting FCoE
Appendix F
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 185
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 185
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 186
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 186
Appendix: Troubleshooting FCoE
Ethanalyzer
Ethanalyzer is a Cisco NX-OS protocol analyzer tool based on the Wireshark
(formerly Ethereal) open source code. Ethanalyzer is a command-line version of
Wireshark that captures and decodes packets. You can use Ethanalyzer to
troubleshoot your network and analyze the control-plane traffic.
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 187
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 187
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 188
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 188
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 189
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 189
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 190
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 190
Appendix: Troubleshooting FCoE
SPAN Sources
The Switched Port Analyzer (SPAN) feature (sometimes called port
mirroring or port monitoring) selects network traffic for analysis by a
network analyzer. The network analyzer can be a Cisco SwitchProbe, a
Fibre Channel Analyzer, or other Remote Monitoring (RMON) probes.
SPAN sources refer to the interfaces from which traffic can be monitored.
The Cisco Nexus 5000 Series switch supports Ethernet, virtual Ethernet,
Fibre Channel, virtual Fibre Channel, port channels, SAN port channels,
VLANs, and VSANs as SPAN sources. With VLANs or VSANs, all
supported interfaces in the specified VLAN or VSAN are included as SPAN
sources. You can choose the SPAN traffic in the ingress direction, the
egress direction, or both directions for Ethernet, virtual Ethernet, Fibre
Channel, and virtual Fibre Channel source interfaces:
– Ingress source (Rx)—Traffic entering the switch through this source port is copied
to the SPAN destination port.
– Egress source (Tx)—Traffic exiting the switch through this source port is copied to
the SPAN destination port.
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 191
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 191
Appendix: Troubleshooting FCoE
Source Ports
A source port, also called a monitored port, is a switched interface that you monitor for
network traffic analysis. The switch supports any number of ingress source ports (up to
the maximum number of available ports on the switch) and any number of source
VLANs or VSANs.
A source port has these characteristics:
– Can be of any port type: Ethernet, virtual Ethernet, Fibre Channel, virtual Fibre
Channel, port channel, SAN port channel, VLAN, and VSAN.
– Cannot be monitored in multiple SPAN sessions.
– Cannot be a destination port.
– Each source port can be configured with a direction (ingress, egress, or both) to
monitor. For VLAN, VSAN, port channel, and SAN port channel sources, the
monitored direction can only be ingress and applies to all physical ports in the group.
The rx/tx option is not available for VLAN or VSAN SPAN sessions.
– Source ports can be in the same or different VLANs or VSANs.
– For VLAN or VSAN SPAN sources, all active ports in the source VLAN or VSAN are
included as source ports.
– The switch supports a maximum of two egress SPAN source ports.
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 192
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 192
Appendix: Troubleshooting FCoE
SPAN Destinations
SPAN destinations refer to the interfaces that monitors source
ports. The Cisco Nexus 5000 Series switch supports Ethernet
and Fibre Channel interfaces as SPAN destinations.
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 193
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 193
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 194
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 194
Appendix: Troubleshooting FCoE
Configuration limitations:
Vlan or port-channel cannot be configured as egress source
Vlan or port-channel cannot be a monitor destination
Only two egress sources supported
Only one destination port can be configured for a session
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 195
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 195
Appendix: Troubleshooting FCoE
Wireshark
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 196
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 196
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 197
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 197
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 198
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 198
Appendix: Troubleshooting FCoE
MODES
– NTM (No Truncate Mode) - FC Frames are passed without any modifications.
– ETM (Ethernet Truncate Mode) - Truncates FC frame to max 1496B by reducing payload size from
528 lines to 368 lines.
– STM (Shallow Truncate Mode) - Truncates FC frame to max 256B by reducing payload size from 528
lines to 58 lines.
– DTM (Deep Truncate Mode) - Truncates FC frame to max 64B by reducing payload size from 528
lines to 10 lines.
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 199
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 199
Appendix: Troubleshooting FCoE
© 2008 EMC Corporation. All rights reserved. Appendix: Troubleshooting FCoE - 200
Copyright © 2008 EMC Corporation. Do not Copy - All Rights Reserved. - 200