Integrated Management Module Error Messages
Integrated Management Module Error Messages
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Message Severity Description Action
Numeric sensor Ambient Error An upper critical sensor going high Reduce the ambient temperature.
Temp going high (upper has asserted.
critical) has asserted.
Numeric sensor Ambient Error An upper nonrecoverable sensor Reduce the ambient temperature.
Temp going high (upper going high has asserted.
non-recoverable) has
asserted.
Numeric sensor Planar 3.3V Error A lower critical sensor going low (Trained service technician only)
going low (lower critical) has asserted. Replace the system board.
has asserted.
Numeric sensor Planar 3.3V Error An upper critical sensor going high (Trained service technician only)
going high (upper critical) has asserted. Replace the system board.
has asserted.
Numeric sensor Planar 5V Error A lower critical sensor going low (Trained service technician only)
going low (lower critical) has asserted. Replace the system board.
has asserted.
Numeric sensor Planar 5V Error An upper critical sensor going high (Trained service technician only)
going high (upper critical) has asserted. Replace the system board.
has asserted.
Numeric sensor Planar Error A lower critical sensor going low Replace the 3 V battery.
VBAT going low (lower has asserted.
critical) has asserted.
Numeric sensor Fan nA Error A lower critical sensor going low 1. Reseat the failing fan n, which
Tach going low (lower has asserted. is indicated by a lit LED near
critical) has asserted. the fan connector on the system
(n = fan number) board.
2. Replace the failing fan.
(n = fan number)
Numeric sensor Fan nB Error A lower critical sensor going low 1. Reseat the failing fan n, which
Tach going low (lower has asserted. is indicated by a lit LED near
critical) has asserted. the fan connector on the system
(n = fan number) board.
2. Replace the failing fan.
(n = fan number)
The connector System Error An interconnect configuration error Reseat the front video cable on the
board has encountered a has occurred. system board.
configuration error.
i
Table 1. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
The Processor CPU nStatus Error A processor failed - IERR condition 1. Make sure that the latest levels
has Failed with IERR. has occurred. of firmware and device drivers
(n = microprocessor are installed for all adapters
number) and standard devices, such as
Ethernet, SCSI, and SAS.
Important: Some cluster
solutions require specific code
levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
2. Run the DSA program for the
hard disk drives and other I/O
devices.
3. (Trained service technician only)
Replace microprocessor n.
(n = microprocessor number)
An Over-Temperature Error An overtemperature condition has 1. Make sure that the fans are
Condition has been occurred for microprocessor n. operating, that there are no
detected on the Processor (n = microprocessor number) obstructions to the airflow, that
CPU nStatus. the air baffles are in place and
(n = microprocessor correctly installed, and that the
number) server cover is installed and
completely closed.
2. Make sure that the heat sink for
microprocessor nis installed
correctly.
3. (Trained service technician only)
Replace microprocessor n.
(n = microprocessor number)
ii
Table 1. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
The Processor CPU nStatus Error A processor failed - FRB1/BIST 1. Check for a UEFI firmware
has Failed with FRB1/BIST condition has occurred. update.
condition. Important: Some cluster
(n = microprocessor solutions require specific code
number) levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
2. Make sure that the installed
microprocessors are compatible
with each other (see "Installing
a microprocessor and heat sink"
in the Problem Determination and
Service Guide for information
about microprocessor
requirements).
3. (Trained service technician only)
Reseat microprocessor n.
4. (Trained service technician only)
Replace microprocessor n.
(n = microprocessor number)
The Processor CPU nStatus Error A processor configuration 1. Make sure that the installed
has a Configuration mismatch has occurred. microprocessors are compatible
Mismatch. with each other (see "Installing
(n = microprocessor a microprocessor and heat sink"
number) in the Problem Determination and
Service Guide for information
about microprocessor
requirements).
2. (Trained service technician only)
Replace the incompatible
microprocessor.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
An SM BIOS Uncorrectable Error An SMBIOS uncorrectable CPU 1. Check for a UEFI firmware
CPU complex error for complex error has asserted. update.
Processor CPU nStatus has Important: Some cluster
asserted. solutions require specific code
(n = microprocessor levels or coordinated code
number) updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
2. Make sure that the installed
microprocessors are compatible
with each other (see "Installing
a microprocessor and heat sink"
in the Problem Determination and
Service Guide for information
about microprocessor
requirements).
3. (Trained service technician only)
Reseat microprocessor n.
4. (Trained service technician only)
Replace microprocessor n.
(n = microprocessor number)
Sensor CPU nOverTemp Error A sensor has changed to Critical 1. Make sure that the fans are
has transitioned to critical state from a less severe state. operating, that there are no
from a less severe state. obstructions to the airflow, that
(n = microprocessor the air baffles are in place and
number) correctly installed, and that the
server cover is installed and
completely closed.
2. Make sure that the heat sink for
microprocessor n is installed
correctly.
3. (Trained service technician only)
Replace microprocessor n.
(n = microprocessor number)
iv
Table 1. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Sensor CPU nOverTemp Error A sensor has changed to 1. Make sure that the fans are
has transitioned to Nonrecoverable state from a less operating, that there are no
non-recoverable from a less severe state. obstructions to the airflow, that
severe state. the air baffles are in place and
(n = microprocessor correctly installed, and that the
number) server cover is installed and
completely closed.
2. Make sure that the heat sink for
microprocessor n is installed
correctly.
3. (Trained service technician only)
Replace microprocessor n.
(n = microprocessor number)
Sensor CPU nOverTemp Error A sensor has changed to Critical 1. Make sure that the fans are
has transitioned to critical state from Nonrecoverable state. operating, that there are no
from a non-recoverable obstructions to the airflow, that
state. the air baffles are in place and
(n = microprocessor correctly installed, and that the
number) server cover is installed and
completely closed.
2. Make sure that the heat sink for
microprocessor nis installed
correctly.
3. (Trained service technician only)
Replace microprocessor n.
(n = microprocessor number)
Sensor CPU nOverTemp Error A sensor has changed to 1. Make sure that the fans are
has transitioned to Nonrecoverable state. operating, that there are no
non-recoverable. obstructions to the airflow, that
(n = microprocessor the air baffles are in place and
number) correctly installed, and that the
server cover is installed and
completely closed.
2. Make sure that the heat sink for
microprocessor nis installed
correctly.
3. (Trained service technician only)
Replace microprocessor n.
(n = microprocessor number)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
A bus timeout has occurred Error A bus timeout has occurred. 1. Remove the adapter from the
on system %1. PCI slot that is indicated by a
(%1 = lit LED.
CIM_ComputerSystem.ElementName)
2. Replace the riser-card assembly.
3. Remove all PCI adapters.
4. (Trained service technicians
only) Replace the system board.
A software NMI has Error A software NMI has occurred. 1. Check the device driver.
occurred on system %1.
2. Reinstall the device driver.
(%1 =
CIM_ComputerSystem.ElementName)
The System %1 Error A POST error has occurred. 1. Recover the UEFI firmware
encountered a POST Error. (Sensor = ABR Status) from the backup page:
(%1 =
a. Restart the server.
CIM_ComputerSystem.ElementName)
b. At the prompt, press F3 to
recover the firmware.
2. Update the UEFI firmware to
the latest level.
Important: Some cluster
solutions require specific code
levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
The System %1 Error A POST error has occurred. 1. Update the UEFI firmware on
encountered a POST Error. (Sensor = Firmware Error) the primary page.
(%1 = Important: Some cluster
CIM_ComputerSystem.ElementName) solutions require specific code
levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
2. (Trained service technician only)
Replace the system board.
vi
Table 1. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
A Uncorrectable Bus Error Error A bus uncorrectable error has 1. Check the system-event log.
has occurred on system %1. occurred.
2. Check the PCI error LEDs.
(%1 = (Sensor = Critical Int PCI)
CIM_ComputerSystem.ElementName) 3. Remove the adapter from the
indicated PCI slot.
4. Check for a UEFI firmware
update.
Important: Some cluster
solutions require specific code
levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
5. (Trained service technician only)
Replace the system board.
A Uncorrectable Bus Error Error A bus uncorrectable error has 1. Check the system-event log.
has occurred on system %1. occurred.
2. Check the microprocessor error
(%1 = (Sensor = Critical Int CPU)
LEDs.
CIM_ComputerSystem.ElementName)
3. Remove the failing
microprocessor from the system
board.
4. Check for a UEFI firmware
update.
Important: Some cluster
solutions require specific code
levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
5. Make sure that the two
microprocessors are matching.
6. (Trained service technician only)
Replace the system board.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
A Uncorrectable Bus Error Error A bus uncorrectable error has 1. Check the system-event log.
has occurred on system %1. occurred.
2. Check the DIMM error LEDs.
(%1 = (Sensor = Critical Int DIM)
CIM_ComputerSystem.ElementName) 3. Remove the failing DIMM from
the system board.
4. Check for a UEFI firmware
update.
Important: Some cluster
solutions require specific code
levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
5. Make sure that the installed
DIMMs are supported and
configured correctly.
6. (Trained service technician only)
Replace the system board.
Sensor Sys Board Fault has Error A sensor has changed to Critical 1. Check the system-event log.
transitioned to critical from state from a less severe state.
2. Check for an error LED on the
a less severe state.
system board.
3. Replace any failing device.
4. Check for a UEFI firmware
update.
Important: Some cluster
solutions require specific code
levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
5. (Trained service technician only)
Replace the system board.
viii
Table 1. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
The Power Supply (Power Error Power supply nhas failed. 1. If the power-on LED is lit,
Supply: n) has Failed. (n = power supply number) complete the following steps:
(n = power supply number)
a. Reduce the server to the
minimum configuration.
b. Reinstall the components
one at a time, restarting the
server each time.
c. If the error recurs, replace
the component that you just
reinstalled.
2. Reseat power supply n.
3. Replace power supply n.
(n = power supply number)
Sensor PS n Fan Fault has Error A sensor has changed to Critical 1. Make sure that there are no
transitioned to critical from state from a less severe state. obstructions, such as bundled
a less severe state. cables, to the airflow from the
(n = power supply number) power-supply fan.
2. Replace power supply n.
(n = power supply number)
Sensor VT Fault has Error A sensor has changed to 1. Check the power-supply LEDs.
transitioned to Nonrecoverable state.
2. Follow the actions in "Power
non-recoverable.
supply LEDs" in the Problem
Determination and Service Guide.
3. Replace the failing power
supply.
4. (Trained service technician only)
Replace the system board.
Sensor Pwr Rail A Fault Error A sensor has changed to 1. Turn off the server and
has transitioned to Nonrecoverable state. disconnect it from power.
non-recoverable.
2. Remove the optical drive, fans,
hard disk drives, and hard disk
drive backplane.
3. Restart the server.
4. Reinstall each device, one at a
time, starting the server each
time to isolate the failing
device.
5. Replace the failing device.
6. (Trained service technician only)
Replace the system board.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Sensor Pwr Rail B Fault has Error A sensor has changed to 1. Turn off the server and
transitioned to Nonrecoverable state. disconnect it from power.
non-recoverable.
2. Remove the optical drive, fans,
hard disk drives, and hard disk
drive backplane.
3. Restart the server.
4. Reinstall each device, one at a
time, starting the server each
time to isolate the failing
device.
5. Replace the failing device.
6. (Trained service technician only)
Replace the system board.
Sensor Pwr Rail C Fault Error A sensor has changed to 1. Turn off the server and
has transitioned to Nonrecoverable state. disconnect it from power.
non-recoverable.
2. (Trained service technician only)
Remove the SAS/SATA RAID
riser card, the DIMMs in
connectors 1 through 8, and the
microprocessor in socket 1.
3. Restart the server.
4. Reinstall each device, one at a
time, starting the server each
time to isolate the failing
device.
5. Replace the failing device.
6. (Trained service technician only)
Replace the system board.
Sensor Pwr Rail D Fault Error A sensor has changed to 1. Turn off the server and
has transitioned to Nonrecoverable state. disconnect it from power.
non-recoverable.
2. (Trained service technician only)
Remove the microprocessor
from socket 1.
3. Restart the server.
4. Reinstall the microprocessor in
socket 1 and restart the server.
5. (Trained service technician only)
Replace the failing
microprocessor.
6. (Trained service technician only)
Replace the system board.
x
Table 1. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Sensor Pwr Rail E Fault has Error A sensor has changed to 1. Turn off the server and
transitioned to Nonrecoverable state. disconnect it from power.
non-recoverable.
2. (Trained service technician only)
Remove the PCI riser card from
PCI riser-card connector 2 and
the microprocessor from socket
2.
3. Restart the server.
4. Reinstall each device, one at a
time, starting the server each
time to isolate the failing
device.
5. Replace the failing device.
6. (Trained service technician only)
Replace the system board.
Sensor PS n Therm Fault Error A sensor has changed to Critical 1. Make sure that there are no
has transitioned to critical state from a less severe state. obstructions, such as bundled
from a less severe state. cables, to the airflow from the
(n = power supply number) power-supply fan.
2. Replace power supply n.
(n = power supply number)
Redundancy Cooling Zone Error Redundancy has been lost and is 1. Make sure that the connectors
1 has been reduced. insufficient to continue operation. on fans 1 and 2 are not
damaged.
2. Make sure that the fan 1 and 2
connectors on the system board
are not damaged.
3. Make sure that the fans are
correctly installed.
4. Reseat the fans.
5. Replace the fans.
Sensor RAID Error has Error A sensor has changed to Critical 1. Check the hard disk drive
transitioned to critical from state from a less severe state. LEDs.
a less severe state.
2. Reseat the hard disk drive for
which the status LED is lit.
3. Replace the defective hard disk
drive.
The Drive n Status has Error A drive has been removed. Reseat hard disk drive n.
been removed from unit (n = hard disk drive number)
Drive 0 Status.
(n = hard disk drive
number)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
The Drive n Status has Error A drive has been disabled because 1. Run the hard disk drive
been disabled due to a of a fault. diagnostic test on drive n.
detected fault.
2. Reseat the following
(n = hard disk drive
components:
number)
a. Hard disk drive
b. Cable from the system
board to the backplane
3. Replace the following
components one at a time, in
the order shown, restarting the
server each time:
a. Hard disk drive
b. Cable from the system
board to the backplane
c. Hard disk drive backplane
(n = hard disk drive number)
Array %1 is in critical Error An array is in Critical state. Replace the hard disk drive that is
condition. (Sensor = Drive n Status) indicated by a lit status LED.
(%1 = (n = hard disk drive number)
CIM_ComputerSystem.ElementName)
Array %1 has failed. Error An array is in Failed state. Replace the hard disk drive that is
(%1 = (Sensor = Drive n Status) indicated by a lit status LED.
CIM_ComputerSystem.ElementName) (n = hard disk drive number)
Memory uncorrectable Error A memory uncorrectable error has 1. If the server failed the POST
error detected for DIMM occurred. memory test, reseat the DIMMs.
All DIMMs on Memory
2. Replace any DIMM that is
Subsystem All DIMMs.
indicated by a lit error LED.
Note: You do not have to
replace DIMMs by pairs.
3. Run the Setup utility to enable
all the DIMMs.
4. Run the DSA memory test.
xii
Table 1. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory Logging Limit Error The memory logging limit has been 1. Update the UEFI to the latest
Reached for DIMM All reached. level.
DIMMs on Memory Important: Some cluster
Subsystem All DIMMs. solutions require specific code
levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
2. Reseat the DIMMs and run the
DSA memory test.
3. Replace any DIMM that is
indicated by a lit error LED.
Memory DIMM Error A DIMM configuration error has Make sure that DIMMs are
Configuration Error for All occurred. installed in the correct sequence
DIMMs on Memory and have the same size, type,
Subsystem All DIMMs. speed, and technology.
Memory uncorrectable Error A memory uncorrectable error has 1. If the server failed the POST
error detected for DIMM occurred. memory test, reseat the DIMMs.
One of the DIMMs on
2. Replace any DIMM that is
Memory Subsystem One of
indicated by a lit error LED.
the DIMMs.
Note: You do not have to
replace DIMMs by pairs.
3. Run the Setup utility to enable
all the DIMMs.
4. Run the DSA memory test.
Memory Logging Limit Error The memory logging limit has been 1. Update the UEFI to the latest
Reached for DIMM One of reached. level.
the DIMMs on Memory Important: Some cluster
Subsystem One of the solutions require specific code
DIMMs. levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
2. Reseat the DIMMs and run the
DSA memory test.
3. Replace any DIMM that is
indicated by a lit error LED.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory DIMM Error A DIMM configuration error has Make sure that DIMMs are
Configuration Error for occurred. installed in the correct sequence
One of the DIMMs on and have the same size, type,
Memory Subsystem One of speed, and technology.
the DIMMs.
Memory uncorrectable Error A memory uncorrectable error has 1. If the server failed the POST
error detected for DIMM n occurred. memory test, reseat the DIMMs.
Status on Memory
2. Replace any DIMM that is
Subsystem DIMM n Status.
indicated by a lit error LED.
(n = DIMM number)
Note: You do not have to
replace DIMMs by pairs.
3. Run the Setup utility to enable
all the DIMMs.
4. Run the DSA memory test.
5. (Trained service technician only)
Replace the system board.
Memory Logging Limit Error The memory logging limit has been 1. Update the UEFI to the latest
Reached for DIMM nStatus reached. level.
on Memory Subsystem Important: Some cluster
DIMMnStatus. solutions require specific code
(n = DIMM number) levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
2. Reseat the DIMMs and run the
DSA memory test.
3. Replace any DIMM that is
indicated by a lit error LED.
Memory DIMM Error A DIMM configuration error has Make sure that DIMMs are
Configuration Error for occurred. installed in the correct sequence
DIMM nStatus on Memory and have the same size, type,
Subsystem DIMM nStatus. speed, and technology.
(n = DIMM number)
xiv
Table 1. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Sensor DIMM n Temp has Error A sensor has changed to Critical 1. Make sure that the fans are
transitioned to critical from state from a less severe state. operating, that there are no
a less severe state. obstructions to the airflow, that
(n = DIMM number) the air baffles are in place and
correctly installed, and that the
server cover is installed and
completely closed.
2. If a fan has failed, complete the
action for a fan failure.
3. Replace DIMM n.
(n = DIMM number)
A PCI PERR has occurred Error A PCI PERR has occurred. 1. Check the riser-card LEDs.
on system %1. (Sensor = All PCI Err)
2. Reseat the affected adapters and
(%1 =
riser card.
CIM_ComputerSystem.ElementName)
3. Update the server and adapter
firmware (UEFI and IMM).
Important: Some cluster
solutions require specific code
levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
4. Remove both adapters.
5. Replace the PCIe adapter.
6. Replace the riser card.
A PCI SERR has occurred Error A PCI SERR has occurred. 1. Check the riser-card LEDs.
on system %1. (Sensor = All PCI Err)
2. Reseat the affected adapters and
(%1 =
riser card.
CIM_ComputerSystem.ElementName)
3. Update the server and adapter
firmware (UEFI and IMM).
Important: Some cluster
solutions require specific code
levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
4. Remove both adapters.
5. Replace the PCIe adapter.
6. Replace the riser card.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
A PCI PERR has occurred Error A PCI PERR has occurred. 1. Check the riser-card LEDs.
on system %1. (Sensor = PCI Slot n; n = PCI slot
2. Reseat the affected adapters and
(%1 = number)
riser card.
CIM_ComputerSystem.ElementName)
3. Update the server and adapter
firmware (UEFI and IMM).
Important: Some cluster
solutions require specific code
levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
4. Remove the adapter from slot n.
5. Replace the PCIe adapter.
6. Replace riser card n.
(n = PCI slot number)
A PCI SERR has occurred Error A PCI SERR has occurred. 1. Check the riser-card LEDs.
on system %1. (Sensor = PCI Slot n; n = PCI slot
2. Reseat the affected adapters and
(%1 = number)
riser card.
CIM_ComputerSystem.ElementName)
3. Update the server and adapter
firmware (UEFI and IMM).
Important: Some cluster
solutions require specific code
levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
4. Remove the adapter from slot n.
5. Replace the PCIe adapter.
6. Replace riser card n.
(n = PCI slot number)
xvi
Table 1. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
A PCI PERR has occurred Error A PCI PERR has occurred. 1. Check the riser-card LEDs.
on system %1. (Sensor = One of PCI Err)
2. Reseat the affected adapters and
(%1 =
riser card.
CIM_ComputerSystem.ElementName)
3. Update the server and adapter
firmware (UEFI and IMM).
Important: Some cluster
solutions require specific code
levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
4. Remove both adapters.
5. Replace the PCIe adapter.
6. Replace the riser card.
7. (Trained service technician only)
Replace the system board.
A PCI SERR has occurred Error A PCI SERR has occurred. 1. Check the riser-card LEDs.
on system %1. (Sensor = One of PCI Err)
2. Reseat the affected adapters and
(%1 =
riser card.
CIM_ComputerSystem.ElementName)
3. Update the server and adapter
firmware (UEFI and IMM).
Important: Some cluster
solutions require specific code
levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
4. Remove both adapters.
5. Replace the PCIe adapter.
6. Replace the riser card.
7. (Trained service technician only)
Replace the system board.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Fault in slot System board Error 1. Check the riser-card LEDs.
on system %1.
2. Reseat the affected adapters and
(%1 =
riser card.
CIM_ComputerSystem.ElementName)
3. Update the server and adapter
firmware (UEFI and IMM).
Important: Some cluster
solutions require specific code
levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
4. Remove both adapters.
5. Replace the PCIe adapter.
6. Replace the riser card.
7. (Trained service technician only)
Replace the system board.
Redundancy Bckup Mem Error Redundancy has been lost and is 1. Check the system-event log for
Status has been reduced. insufficient to continue operation. DIMM failure events
(uncorrectable or PFA) and
correct the failures.
2. Re-enable mirroring in the
Setup utility.
Sensor Planar Fault has Error A sensor has changed to Critical (Trained service technician only)
transitioned to critical from state from a less severe state. Replace the system board.
a less severe state.
IMM Network Initialization Info An IMM network has completed No action; information only.
Complete. initialization.
Certificate Authority %1 Error A problem has occurred with the 1. Make sure that the certificate
has detected a %2 SSL Server, SSL Client, or SSL that you are importing is
Certificate Error. Trusted CA certificate that has been correct.
(%1 = imported into the IMM. The
2. Try importing the certificate
imported certificate must contain a
IBM_CertificateAuthority.CADistinguishedName;
again.
%2 = public key that corresponds to the
CIM_PublicKeyCertificate.ElementName) key pair that was previously
generated by the Generate a New
Key and Certificate Signing
Request link.
xviii
Table 1. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Ethernet Data Rate Info A user has modified the Ethernet No action; information only.
modified from %1 to %2 by port data rate.
user %3.
(%1 =
CIM_EthernetPort.Speed;
%2 =
CIM_EthernetPort.Speed;
%3 = user ID)
Ethernet Duplex setting Info A user has modified the Ethernet No action; information only.
modified from %1 to %2 by port duplex setting.
user %3.
(%1 =
CIM_EthernetPort.FullDuplex;
%2 =
CIM_EthernetPort.FullDuplex;
%3 = user ID)
Ethernet MTU setting Info A user has modified the Ethernet No action; information only.
modified from %1 to %2 by port MTU setting.
user %3.
(%1 =
CIM_EthernetPort.ActiveMaximumTransmissionUnit;
%2 =
CIM_EthernetPort.ActiveMaximumTransmissionUnit;
%3 = user ID)
Ethernet Duplex setting Info A user has modified the Ethernet No action; information only.
modified from %1 to %2 by port MAC address setting.
user %3.
(%1 =
CIM_EthernetPort.NetworkAddresses;
%2 =
CIM_EthernetPort.NetworkAddresses;
%3 = user ID)
Ethernet interface %1 by Info A user has enabled or disabled the No action; information only.
user %2. Ethernet interface.
(%1 =
CIM_EthernetPort.EnabledState;
%2 = user ID)
Hostname set to %1 by Info A user has modified the host name No action; information only.
user %2. of the IMM.
(%1 =
CIM_DNSProtocolEndpoint.Hostname;
%2 = user ID)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
IP address of network Info A user has modified the IP address No action; information only.
interface modified from %1 of the IMM.
to %2 by user %3.
(%1 =
CIM_IPProtocolEndpoint.IPv4Address;
%2 =
CIM_StaticIPAssignmentSettingData.IPAddress;
%3 = user ID)
IP subnet mask of network Info A user has modified the IP subnet No action; information only.
interface modified from %1 mask of the IMM.
to %2 by user %3s.
(%1 =
CIM_IPProtocolEndpoint.SubnetMask;
%2 =
CIM_StaticIPAssignmentSettingData.SubnetMask;
%3 = user ID)
IP address of default Info A user has modified the default No action; information only.
gateway modified from %1 gateway IP address of the IMM.
to %2 by user %3s.
(%1 =
CIM_IPProtocolEndpoint.GatewayIPv4Address;
%2 =
CIM_StaticIPAssignmentSettingData.DefaultGatewayAddress;
%3 = user ID)
OS Watchdog response %1 Info A user has enabled or disabled an No action; information only.
by %2. OS Watchdog.
(%1 = Enabled or Disabled;
%2 = user ID)
DHCP[%1] failure, no IP Info A DHCP server has failed to assign 1. Make sure that the network
address assigned. an IP address to the IMM. cable is connected.
(%1 = IP address,
2. Make sure that there is a DHCP
xxx.xxx.xxx.xxx)
server on the network that can
assign an IP address to the
IMM.
Remote Login Successful. Info A user has successfully logged in No action; information only.
Login ID: %1 from %2 at IP to the IMM.
address %3.
(%1 = user ID; %2 =
ValueMap(CIM_ProtocolEndpoint.ProtocolIFType;
%3 = IP address,
xxx.xxx.xxx.xxx)
Attempting to %1 server Info A user has used the IMM to No action; information only.
%2 by user %3. perform a power function on the
(%1 = Power Up, Power server.
Down, Power Cycle, or
Reset; %2 =
IBM_ComputerSystem.ElementName;
%3 = user ID)
xx
Table 1. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Security: Userid: '%1' had Error A user has exceeded the maximum 1. Make sure that the correct login
%2 login failures from WEB number of unsuccessful login ID and password are being
client at IP address %3. attempts from a Web browser and used.
(%1 = user ID; %2 = has been prevented from logging in
2. Have the system administrator
MaximumSuccessiveLoginFailures for the lockout period.
reset the login ID or password.
(currently set to 5 in the
firmware); %3 = IP address,
xxx.xxx.xxx.xxx)
Security: Login ID: '%1' had Error A user has exceeded the maximum 1. Make sure that the correct login
%2 login failures from CLI number of unsuccessful login ID and password are being
at %3. attempts from the command-line used.
(%1 = user ID; %2 = interface and has been prevented
2. Have the system administrator
MaximumSuccessiveLoginFailures from logging in for the lockout
reset the login ID or password.
(currently set to 5 in the period.
firmware); %3 = IP address,
xxx.xxx.xxx.xxx)
Remote access attempt Error A user has attempted to log in 1. Make sure that the correct login
failed. Invalid userid or from a Web browser by using an ID and password are being
password received. Userid invalid login ID or password. used.
is '%1' from WEB browser
2. Have the system administrator
at IP address %2.
reset the login ID or password.
(%1 = user ID; %2 = IP
address, xxx.xxx.xxx.xxx)
Remote access attempt Error A user has attempted to log in 1. Make sure that the correct login
failed. Invalid userid or from a Telnet session by using an ID and password are being
password received. Userid invalid login ID or password. used.
is '%1' from TELNET client
2. Have the system administrator
at IP address %2.
reset the login ID or password.
(%1 = user ID; %2 = IP
address, xxx.xxx.xxx.xxx)
The Chassis Event Log Info A user has cleared the IMM event No action; information only.
(CEL) on system %1 log.
cleared by user %2.
(%1 =
CIM_ComputerSystem.ElementName;
%2 = user ID)
IMM reset was initiated by Info A user has initiated a reset of the No action; information only.
user %1. IMM.
(%1 = user ID)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
ENET[0] DHCP-HSTN=%1, Info The DHCP server has assigned an No action; information only.
DN=%2, IP@=%3, SN=%4, IMM IP address and configuration.
GW@=%5, DNS1@=%6.
(%1 =
CIM_DNSProtocolEndpoint.Hostname;
%2 =
CIM_DNSProtocolEndpoint.DomainName;
%3 =
CIM_IPProtocolEndpoint.IPv4Address;
%4 =
CIM_IPProtocolEndpoint.SubnetMask;
%5 = IP address,
xxx.xxx.xxx.xxx; %6 = IP
address, xxx.xxx.xxx.xxx)
ENET[0] Info An IMM IP address and No action; information only.
IP-Cfg:HstName=%1, configuration have been assigned
IP@%2, NetMsk=%3, using client data.
GW@=%4.
(%1 =
CIM_DNSProtocolEndpoint.Hostname;
%2 =
CIM_StaticIPSettingData.IPv4Address;
%3 =
CIM_StaticIPSettingData.SubnetMask;
%4 =
CIM_StaticIPSettingData.DefaultGatewayAddress)
LAN: Ethernet[0] interface Info The IMM Ethernet interface has No action; information only.
is no longer active. been disabled.
LAN: Ethernet[0] interface Info The IMM Ethernet interface has No action; information only.
is now active. been enabled.
DHCP setting changed to Info A user has changed the DHCP No action; information only.
by user %1. mode.
(%1 = user ID)
IMM: Configuration %1 Info A user has restored the IMM No action; information only.
restored from a configuration by importing a
configuration file by user configuration file.
%2.
(%1 =
CIM_ConfigurationData.ConfigurationName;
%2 = user ID)
xxii
Table 1. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Watchdog %1 Screen Error An operating-system error has 1. Reconfigure the watchdog timer
Capture Occurred. occurred, and the screen capture to a higher value.
(%1 = OS Watchdog or was successful.
2. Make sure that the IMM
Loader Watchdog)
Ethernet over USB interface is
enabled.
3. Reinstall the RNDIS or
cdc_ether device driver for the
operating system.
4. Disable the watchdog.
5. Check the integrity of the
installed operating system.
Watchdog %1 Failed to Error An operating-system error has 1. Reconfigure the watchdog timer
Capture Screen. occurred, and the screen capture to a higher value.
(%1 = OS Watchdog or failed.
2. Make sure that the IMM
Loader Watchdog)
Ethernet over USB interface is
enabled.
3. Reinstall the RNDIS or
cdc_ether device driver for the
operating system.
4. Disable the watchdog.
5. Check the integrity of the
installed operating system.
6. Update the IMM firmware.
Important: Some cluster
solutions require specific code
levels or coordinated code
updates. If the device is part of
a cluster solution, verify that
the latest level of code is
supported for the cluster
solution before you update the
code.
Running the backup IMM Error The IMM has resorted to running Update the IMM firmware.
main application. the backup main application. Important: Some cluster solutions
require specific code levels or
coordinated code updates. If the
device is part of a cluster solution,
verify that the latest level of code is
supported for the cluster solution
before you update the code.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Please ensure that the IMM Error The server does not support the Update the IMM firmware to a
is flashed with the correct installed IMM firmware version. version that the server supports.
firmware. The IMM is Important: Some cluster solutions
unable to match its require specific code levels or
firmware to the server. coordinated code updates. If the
device is part of a cluster solution,
verify that the latest level of code is
supported for the cluster solution
before you update the code.
IMM reset was caused by Info The IMM has been reset because a No action; information only.
restoring default values. user has restored the configuration
to its default settings.
IMM clock has been set Info The IMM clock has been set to the No action; information only.
from NTP server %1. date and time that is provided by
(%1 = the Network Time Protocol server.
IBM_NTPService.ElementName)
SSL data in the IMM Error There is a problem with the 1. Make sure that the certificate
configuration data is certificate that has been imported that you are importing is
invalid. Clearing into the IMM. The imported correct.
configuration data region certificate must contain a public
2. Try to import the certificate
and disabling SSL+H25. key that corresponds to the key
again.
pair that was previously generated
through the Generate a New Key
and Certificate Signing Request
link.
Flash of %1 from %2 Info A user has successfully updated No action; information only.
succeeded for user %3. one of the following firmware
(%1 = components:
CIM_ManagedElement.ElementName; v IMM main application
%2 = Web or LegacyCLI;
v IMM boot ROM
%3 = user ID)
v UEFI firmware
v Diagnostics
v System power backplane
v Remote expansion enclosure
power backplane
v Integrated service processor
v Remote expansion enclosure
processor
Flash of %1 from %2 failed Info An attempt to update a firmware Try to update the firmware again.
for user %3. component from the interface and
(%1 = IP address has failed.
CIM_ManagedElement.ElementName;
%2 = Web or LegacyCLI;
%3 = user ID)
xxiv
Table 1. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See PARTS LISTING in the Problem Determination and Service Guide to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
The Chassis Event Log Info The IMM event log is 75% full. To avoid losing older log entries,
(CEL) on system %1 is 75% When the log is full, older log save the log as a text file and clear
full. entries are replaced by newer ones. the log.
(%1 =
CIM_ComputerSystem.ElementName)
The Chassis Event Log Info The IMM event log is full. When To avoid losing older log entries,
(CEL) on system %1 is the log is full, older log entries are save the log as a text file and clear
100% full. replaced by newer ones. the log.
(%1 =
CIM_ComputerSystem.ElementName)
%1 Platform Watchdog Error A Platform Watchdog Timer 1. Reconfigure the watchdog timer
Timer expired for %2. Expired event has occurred. to a higher value.
(%1 = OS Watchdog or
2. Make sure that the IMM
Loader Watchdog; %2 = OS
Ethernet over USB interface is
Watchdog or Loader
enabled.
Watchdog)
3. Reinstall the RNDIS or
cdc_ether device driver for the
operating system.
4. Disable the watchdog.
5. Check the integrity of the
installed operating system.
IMM Test Alert Generated Info A user has generated a test alert No action; information only.
by %1. from the IMM.
(%1 = user ID)
Security: Userid: '%1' had Error A user has exceeded the maximum 1. Make sure that the correct login
%2 login failures from an number of unsuccessful login ID and password are being
SSH client at IP address attempts from SSH and has been used.
%3. prevented from logging in for the
2. Have the system administrator
(%1 = user ID; %2 = lockout period.
reset the login ID or password.
MaximumSuccessiveLoginFailures
(currently set to 5 in the
firmware); %3 = IP address,
xxx.xxx.xxx.xxx)