Cisco Switch Health Monitor
Cisco Switch Health Monitor
Monitoring critical system resources is very important to maintain stability of the network. We
recommend that you monitor the switch CPU, memory, file systems, and environmental resources on a
regular basis.
This workflow discusses the commonly used commands and procedures to monitor and maintain system
health.
Step 1 Use the show version command to retrieve the overall switch status.
If you are only interested in the switch uptime and last reload, you can run a more direct command using
the pipe “|” feature built into Cisco IOS XE (and Cisco IOS) software.
This example shows that Cisco IOS XE release 3.3.2 SE was running for five weeks before a privileged
user initiated a switch reload.
Note The switch is a multicore platform that is different from its predecessors. A single core can experience
high CPU, so it is important to monitor each core when running these commands.
This output shows the five-second, one-minute, and five-minute periods on each CPU core. It also
shows the Forwarding Engine Driver (FED), IOS daemon IOSd, and Wireless Controller Module
(WCM) processes have the highest CPU utilization.
Best Practice User Guide for the Catalyst 3850 and Catalyst 3650 Switch Series
104
System Health Monitoring
Run a System Baseline for Core Resources
Step 3 Use the history command to display a graph of sustained CPU utilization.
This graph helps to formulate patterns. For example, if you observe a spike to 100 percent every 30
minutes, you can conclude that something might be polling the switch on a regular schedule. Examine
your SNMP configuration to help determine the cause.
1111122222222222222222222
111111111111111111111111111111222225555588888888886666666666
100
90
80
70
60
50
40
30 ********************
20 *************************
10 *************************
0....5....1....1....2....2....3....3....4....4....5....5....
0 5 0 5 0 5 0 5 0 5
CPU% per second (last 60 seconds)
Reference:
For detailed information to help troubleshoot your high CPU usage concerns, see the Catalyst 3850
Series Switch High CPU Usage Troubleshooting document.
Best Practice User Guide for the Catalyst 3850 and Catalyst 3650 Switch Series
105
System Health Monitoring
Run a System Baseline for Core Resources
Note An (*) asterisk indicates the default file system. If the file system has a dash (-) or a zero (0) for the
Size(b) field, that indicates that the file system is not present or not recognized.
Step 6 Use the dir filesystem or the show filesystem command to list the files under a specific files system.
When you find crash files, it is important to immediately retrieve them to diagnose a system failure or
unexpected crash.
Best Practice User Guide for the Catalyst 3850 and Catalyst 3650 Switch Series
106
System Health Monitoring
Run a System Baseline for Environmental Resources
This example shows that crash files were created in the directory.
dir crashinfo
Directory of crashinfo:/
Step 8 If your switches are in a stack, run the show environment stack command to view all of the
environmental outputs stack wide.
Although some of settings are adjustable, we recommend leaving the settings with their default values.
Best Practice User Guide for the Catalyst 3850 and Catalyst 3650 Switch Series
107
System Health Monitoring
Other System Monitoring Considerations
Best Practice User Guide for the Catalyst 3850 and Catalyst 3650 Switch Series
108
System Health Monitoring
Other System Monitoring Considerations
Step 10 Use the show spanning-tree detail command to frequently check STP stability.
This command displays network stability information about the number of topology changes within each
VLAN, the last time a TCN was received, and so forth. Frequently monitoring this information is critical
to maintaining overall health of the switch and network.
Best Practice User Guide for the Catalyst 3850 and Catalyst 3650 Switch Series
109
System Health Monitoring
Other System Monitoring Considerations
Best Practice User Guide for the Catalyst 3850 and Catalyst 3650 Switch Series
110