01SV860 063 056.readme
01SV860 063 056.readme
01SV860 063 056.readme
--------------------------------------------------------------------------------
--
Contents
For IBM i customers who have systems with machine type model 8286-41A or
8286-42A, firmware update has a prerequisite on partitions running IBM i
operating system that own physical I/O.
For IBM i 7.1, the following minimum code levels are prerequisites:
IBM i 7.1 TR PTF Group SF99707 Level 9 + Cumulative PTF Package C4283710 +
HIPER PTF Group
For IBM i 7.2, the following minimum code levels are prerequisites:
IBM i 7.2 TR PTF Group SF99717 Level 1 + Cumulative PTF Pacakge C4276720 +
HIPER PTF Group
For IBM i 7.3,
- All IBM i 7.3 code levels are compatible with this firmware update.
Note 1: These code levels are not a requirement for IBM i partitions that are
a client of VIOS.
Note 2: These IBM i code levels are listed as prerequisites for the feature
code EMX0 expansion drawer. If this firmware release has already been applied,
the above IBM i code level should be applied on IBM i partitions in order to
maintain system stability.
Note: The file names and service pack levels used in the following examples
are for clarification only, and are not necessarily levels that have been, or
will be released.
System firmware file naming convention:
01SVxxx_yyy_zzz
* The service pack level (yyy) and the last disruptive service pack level
(zzz) are the same. Example: SV830_040_040 is disruptive, no
matter what level of SV830 is currently installed on the system.
* The service pack level (yyy) currently installed on the system is lower
than the last disruptive service pack level (zzz) of the service pack to be
installed. Example: Currently installed service pack is
SV830_040_040 and new service pack is SV830_050_045.
An installation is concurrent if:
The release level (xxx) is the same, and
The service pack level (yyy) currently installed on the system is the same or
higher than the last disruptive service pack level (zzz) of the service pack to
be installed.
Example: Currently installed service pack is SV830_040_040, new service pack
is SV830_071_040.
3.1 Firmware Information and Description
* DEFERRED: A problem was fixed for a Field Core Override (FCO) error that
causes a processor chip without functional cores to be guarded with a SRC
B111BA24 error logged and by guard association causes all the memory and I/O
resources behind the processor chip to be lost for the current IPL. This
problem is triggered by a system being manufactured with one or more feature
codes of #2319 (Factory Deconfiguration of 1-core) to assist with optimization
of software licensing. For more information on Field Core Override, refer to
IBM Knowledge Center:
http://www.ibm.com/support/knowledgecenter/POWER8/p8hby/fieldcore.htm. The
error only occurs in systems where the total number of active cores is less
than the number of processor chips. When the fix is applied on a system that
has lost memory or I/O resources due to the errant processor guard, the system
must be re-IPLed with the guard removed from the processor to recover the
resources.
Without the fix, the problem may be circumvented by the following four steps:
1) Power off the system.
2) Use the Field Core Override function to increase the number of active
processor cores in the system. The Advanced System Management Interface (ASMI)
"System Configuration -> Hardware Deconfiguration -> Field Core Override" panel
shows the number of cores that are active in the system and it can be used to
increase the number of active processor cores in the system.
3) Unguard the failed processor. Use the ASMI "System Configuration ->
Hardware Deconfiguration -> Clear All Deconfiguration Errors" panel to restore
the guarded processor.
4) IPL with the increased number of active processor cores and the unguarded
processor.
This problem does not pertain to the IBM Power System E850 (8408-44E) model.
SV860_056_056 / FW860.10
11/18/16 Impact: New Severity: New
The folllowing pertains to Power System S812L (8247-21L), Power System S822L
(8247-22L), Power System S824L (8247-42L), Power System S822 (8284-22A), Power
System S814 (8286-41A), Power System S824 (8286-42A) andPower System E850C
(8408-44E)servers only. New features and functions
* A problem was fixed for a failed IPL with SRC UE BC8A090F that does not
have a hardware callout or a guard of the failing hardware. The system may be
recovered by guarding out the processor associated with the error and re-IPLing
the system. With the fix, the bad processor core is guarded and the system is
able to IPL.
* A problem was fixed for an Operations Panel Function 04 (Lamp test) during
an IPL causing the IPL to fail. With the fix, the lamp test request is
rejected during the IPL until the hypervisor is available. The lamp test can
be requested without problems anytime after the system is powered on to
hypervisor ready or an OS is running in a partition.
* A problem was fixed for On-Chip Controller (OCC) errors that had excessive
callouts for processor FRUs. Many of the OCC errors are recoverable and do not
required that the processor be called out and guarded. With the fix, the
processors will only be called out for OCC errors if there are three or more
OCC failures during a time period of a week.
* A problem was fixed for the On-Chip Controller (OCC) incorrectly calling
out processors with SRC B1112A16 for L4 Cache DIMM failures with SRC B124E504.
This false error logging can occur if the DIMM slot that is failing is adjacent
to two unoccupied DIMM slots.
* A problem was fixed for device time outs during a IPL logged with a SRC
B18138B4. This error is intermittent and no action is needed for the error
log. The service processor hardware server has allotted more time of the
device transactions to allow the transactions to complete without a time-out
error.
* Support for 6 core processor with FC #8A2225 and CCIN 54E1 extended for
use in the Power System S822L (8247-22L). Support was already in place for
this processor since FW810.20 for the S822 (8284-22A).
* For the IBM Power System E850 (8408-44E) system, a problem was fixed for
the incorrect values for the Idle Power Saver (IPS) mode call home data. The
call home "max" is reported much lower numbers than what the On-chip
Controllers (OCC) read for the IPS. This problem only affects 4-socket systems
as it is caused by an integer overflow of the summation of the IPS value from
all OCCs in the system.
System firmware changes that affect certain systems
* DISRUPTIVE: On systems using the PowerVM firmware, a problem was fixed for
an "Incomplete" state caused by initiating a resource dump with selector macros
from NovaLink (vio -dump -lp 1 -fr). The failure causes a communication
process stack frame, HVHMCCMDRTRTASK, size to be exceeded with a hypervisor
page fault that disrupts the NovalLink and/or HMC communications. The recovery
action is to re-IPL the CEC but that will need to be done without the
assistance of the management console. For each partition that has a OS running
on the system, shut down each partition from the OS. Then from the Advanced
System Management Interface (ASMI), power off the managed system.
Alternatively, the system power button may also be used to do the power off.
If the management console Incomplete state persists after the power off, the
managed system should be rebuilt from the management console. For more
information on management console recovery steps, refer to this IBM Knowledge
Center link:
https://www.ibm.com/support/knowledgecenter/en/POWER7/p7eav/aremanagedsystemstat
e_incomplete.htm
. The fix is disruptive because the size of the PowerVM hypervisor must be
increased to accommodate the over-sized stack frame of the failing task.
* DEFERRED: On systems using the PowerVM firmware, a problem was fixed for a
CAPI function unavailable condition on a system with the maximum number of CAPI
adapters and partitions. Not enough bytes were allocated for CAPI for the
maximum configuration case. The problem may be circumvented by reducing the
number of active partitions or CAPI adapters. The fix is deferred because the
size of the hypervisor must be increased to provide the additional CAPI space.
* DEFERRED: On systems using PowerVM firmware, a problem was fixed for
cable card capable PCI slots that fail during the IPL. Hypervisor I/O Bus
Interface UE B7006A84 is reported for each cable card capable PCI slot that
doesn't contain a PCIe3 Optical Cable Adapter for the PCIe Expansion Drawer
(feature code #EJ05). PCI slots containing a cable card will not report an
error but will not be functional. The problem can be resolved by performing an
AC cycle of the system. The trigger for the failure is the I2C devices used to
detect the cable cards are not coming out of the power on reset process in the
correct state due to a race condition.
* On systems using PowerVM firmware, a problem was fixed for network issues,
causing critical situations for customers, when an SR-IOV logical port or vNIC
is configured with a non-zero Port VLAN ID (PVID). This fix updates adapter
firmware to 10.2.252.1922, for the following Feature Codes: EN15, EN16, EN17,
EN18, EN0H, EN0J, EL38, EN0M, EN0N, EN0K, EN0L, and EL3C.
The SR-IOV adapter firmware level update for the shared-mode adapters happens
under user control to prevent unexpected temporary outages on the adapters. A
system reboot will update all SR-IOV shared-mode adapters with the new firmware
level. In addition, when an adapter is first set to SR-IOV shared mode, the
adapter firmware is updated to the latest level available with the system
firmware (and it is also updated automatically during maintenance operations,
such as when the adapter is stopped or replaced). And lastly, selective manual
updates of the SR-IOV adapters can be performed using the Hardware Management
Console (HMC). To selectively update the adapter firmware, follow the steps
given at the IBM Knowledge Center for using HMC to make the updates:
https://www.ibm.com/support/knowledgecenter/HW4M4/p8efd/p8efd_updating_sriov_fir
mware.htm
.
Note: Adapters that are capable of running in SR-IOV mode, but are currently
running in dedicated mode and assigned to a partition, can be updated
concurrently either by the OS that owns the adapter or the managing HMC (if OS
is AIX or VIOS and RMC is running).
* On systems using the PowerVM firmware, a problem was fixed for a Live
Partition Mobility migration that resulted in the source managed system going
to the management console Incomplete state after the migration to the target
system was completed. This problem is very rare and has only been detected
once.. The problem trigger is that the source partition does not halt execution
after the migration to the target system. The management console went to the
Incomplete state for the source managed system when it failed to delete the
source partition because the partition would not stop running. When this
problem occurred, the customer network was running very slowly and this may
have contributed to the failure. The recovery action is to re-IPL the source
system but that will need to be done without the assistance of the management
console. For each partition that has a OS running on the source system, shut
down each partition from the OS. Then from the Advanced System Management
Interface (ASMI), power off the managed system. Alternatively, the system
power button may also be used to do the power off. If the management console
Incomplete state persists after the power off, the managed system should be
rebuilt from the management console. For more information on management
console recovery steps, refer to this IBM Knowledge Center link:
https://www.ibm.com/support/knowledgecenter/en/POWER7/p7eav/aremanagedsystemstat
e_incomplete.htm
* On systems using the PowerVM firmware, a fix was made to provide an option
to change the ordering of PCIe Host Bridge (PHB) devices on Power 8 systems to
match the discovery order on Power 7 systems.
* On systems using PowerVM firmware, a problem was fixed for a shared
processor pool partition showing an incorrect zero "Available Pool Processor"
(APP) value after a concurrent firmware update. The zero APP value means that
no idle cycles are present in the shared processor pool but in this case it
stays zero even when idle cycles are available. This value can be displayed
using the AIX "lparstat" command. If this problem is encountered, the
partitions in the affected shared processor pool can be dynamically moved to a
different shared processor pool. Before the dynamic move, the "uncapped"
partitions should be changed to "capped" to avoid a system hang. The old
affected pool would continue to have the APP error until the system is re-IPLed.
* On systems using PowerVM firmware, a problem was fixed for a latency time
of about 2 seconds being added to a target Live Partition Mobility (LPM)
migration system when there is a latency time check failure. With the fix, in
the case of a latency time check failure, a much smaller default latency is
used instead of two seconds. This error would not be noticed if the customer
system is using a NTP time server to maintain the time.
* On systems with OPAL firmware, a problem was fixed for misaligned mapped
interrupts to virtual PCI devices that could cause a PB_CENT_CRESP_ADDR_ERROR
checkstop.
* On systems with OPAL firmware, a problem was fixed for a PXE (Preboot
eXecution Environment) boot (also known as network boot) hang that occurred
when a network server was down. With the fix, the boot is able to recover so
that alternative methods of booting can be selected using petitboot menu items.
* A problem was fixed for PCI Host Bridge (PHB) "link down" Endpoint
Recoverable errors that became fatal exceptions when not handled by the CAPI
adapters. With the fix, the recoverable errors are now detected by the CAPI
adapters to allow for run-time link recovery.
* On systems using PowerVM firmware, a rare problem was fixed for a system
hang that can occur when dynamically moving "uncapped" partitions to a
different shared processor pool. To prevent a system hang, the "uncapped"
partitions should be changed to "capped" before doing the move.
* On systems using the PowerVM firmware, support was added fora new utility
option for the System Management Services (SMS) menus. This is the SMS SAS I/O
Information Utility. It has been introduced to allow an user to get additional
information about the attached SAS devices. The utility is accessed by
selecting option 3 (I/O Device Information) from the main SMS menu, and then
selecting the option for "SAS Device Information".
* On systems using the PowerVM hypervisor firmware and Novalink, a problem
was fixed for a NovaLink installation error where the hypervisor was unable to
get the maximum logical memory buffer (LMB) size from the service processor.
The maximum supported LMB size should be 0xFFFFFFFF but in some cases it was
initialized to a value that was less than the amount of configured memory,
causing the service processor read failure with error code 0X00000134.
* On systems using the PowerVM hypervisor firmware and CAPI adapters, a
problem was fixed for CAPI adapter error recovery. When the CAPI adapter goes
into the error recovery state, the Memory Mapped I/O (MMIO) traffic to the
adapter from the OS continues, disrupting the recovery. With the fix, the MMIO
and DMA traffic to the adapter are now frozen until the CAPI adapter is fully
recovered. If the adapter becomes unusable because of this error, it can be
recovered using concurrent maintenance steps from the HMC, keeping the adapter
in place during the repair. The error has a low frequency since it only occurs
when the adapter has failed for another reason and needs recovery.
* On systems using the PowerVM hypervisor firmware, when using affinity
groups, if the group includes a VIOS, ensure the group is placed in the same
drawer where the VIOS physical I/O is located. Prior to this change, if the
VIOS was in an affinity group with other partitions, the partitions placement
could over-ride the VIOS adapter placement rules and the VIOS could end up in a
different drawer from the IO adapters.
* On systems using PowerVM firmware, a problem was fixed to improve error
recovery when attempting to boot an iSCSI target backed by a drive formatted
with a block size other than 512 bytes. Instead of stopping on this error, the
boot attempt fails and then continues with the next potential boot device.
Information regarding the reason for the boot failure is available in an error
log entry. The 512 byte block size for backing devices for iSCSI targets is a
partition firmware requirement.
* On systems using PowerVM firmware, a problem was fixed for a false thermal
alarm in the active optical cables (AOC) for the PCIe3 expansion drawer with
SRCs B7006AA6 and B7006AA7 being logged every 24 hours. The AOC cables have
feature codes of #ECC6 through #ECC9, depending on the length of the cable.
The SRCs should be ignored as they call for the replacement of the cable, cable
card, or the expansion drawer module. With the fix, the false AOC thermal
alarms are no longer reported.
* On systems using PowerVM firmware that have an attached HMC, a problem was
fixed for a Live Partition Mobility migration that resulted in a system hang
when an EEH error occurred simultaneously with a request for a page migration
operation. On the HMC, it shows an incomplete state for the managed system
with reference code A181D000. The recovery action is to re-IPL the source
system but that will need to be done without the assistance of the HMC. From
the Advanced System Management Interface (ASMI), power off the managed
system. Alternatively, the system power button may also be used to do the
power off. If the HMC Incomplete state persists after the power off, the
managed system should be rebuilt from the HMC. For more information on HMC
recovery steps, refer to this IBM Knowledge Center link:
https://www.ibm.com/support/knowledgecenter/en/POWER7/p7eav/aremanagedsystemstat
e_incomplete.htm
* On systems using the OPAL firmware, a problem was fixed for fundamental PCI
resets at boot time causing the PCI adapters to not be usable in the Linux OS.
No errors occur in the skiboot but the adapters are not configurable once the
OS is reached.
* On systems using the OPAL firmware, a problem was fixed for time-out errors
during the power off of PCI slots with " Timeout powering off slot ...
FIRENZE-PCI: Wrong state 00000000 on slot" error message during a power off of
the system. SV860_039_039 / FW860.00
11/02/16 Impact: New Severity:
NewThe
folllowing pertains toPower System E850C (8408-44E) servers only.
New Features and Functions
NOTE:
* GA Level
Four FW840 features that have been disabled for the 860.00 GA are listed
below. These will be re-enabled for the 860.10 service pack:
1. Support disabled for Live Partition Mobility (LPM) operations.
2. Support disabled for partition Suspend and Resume from the HMC.
3. Support disabled for partition Remote Restart.
4. Support disabled for PowerVM vNIC. PowerVM vNIC combined many of the best
features of SR-IOV and PowerVM SEA to provide a network solution with options
for advanced functions such as Live Partition Mobility along with better
performance and I/O efficiency when compared to PowerVM SEA. In addition
PowerVM vNIC provided users with bandwidth control (QoS) capability by
leveraging SR-IOV logical ports as the physical interface to the network.
* New features that have been disabled: vNIC failover; new redundant path
LPM function; and PCIe cable recovery on a link to the PCIe3 expansion
drawer.
* Do not use the following functions. They are not disabled but should not
be used as the implementations and testing has not been completed for 860.00:
1. SMS SAS I/O Information utility. If a non-SCDD (Self Configuring Device
Data) drive is attached to a controller and the utility is used to look at
devices attached to the controller, a Default Catch condition will occur due to
a partition firmware data stack underflow. This utility is accessed by
selecting option 3 (I/O Device Information) from the main SMS menu, and then
selecting option 2 (SAS Device Information).
2. 32TB Max Memory Enablement for partitions.
3. PowerVM NovaLink enhancements. For more information, refer to IBM
Knowledge Center:
http://www.ibm.com/support/knowledgecenter/POWER8/p8eig/p8eig_kickoff.htm
4. PowerVM change to support HDDW using 64K pages
5. IBM Power System E850(8408-44E) concurrent add of the PCIe expansion
drawer (#EMX0).
6. IBM Power System E850(8408-84E) concurrent add of PCIe3 Optical Cable
Adapter for PCIe3 Expansion Drawer (F/C #EJ08)
7. Enforcement of limits to IBM i support on IBM Power System S822 (8284-22A)
8. Dynamic TCE memory allocation for SR-IOV adapters
9. Dynamic Toggle of SRR
10. Power Boot List Management Platform Support
11. SAP HANA (#EPVR) enhancements - Solution edition for SAP HANA 3.65 GHz +
12 Activations
12. HMC new gui enhancements
13. LPAR DR Restart
14. HMC override for Port vs LUN level validation
15. SNMP traps for system state
16. HMC Option to boot without IPv6 Support
17. PCIe3 3D Graphics Adapter x16 (#EC51) boot support (for Linux only)
18. Non-volatile Memory Express (NVMe) boot
19. Service processor security updates
20. vHMC support for DHCP server configuration
* Support for the IBM Power System E850 (8408-44E). Similar in many respects
to the 8408-E8E but upgraded with faster processors (4.223GHz, 10C 3.957GHz,
12C 3.658GHz ) with a maximum of 48 cores and an upgrade in memory to DDR4 with
expanded capacity to 4 TB with 128 GB Dimms available. As with 8408-E8E, there
is no IBM i or OPAL support. Operating System offerings for PowerVM
partitions are AIX and Linux (RHEL, SLES, and Ubuntu).
--------------------------------------------------------------------------------
--
6.0 Installing the Firmware
The method used to install new firmware will depend on the release level of
firmware which is currently installed on your server. The release level can be
determined by the prefix of the new firmware's filename.Example: SVxxx_yyy_zzz
Where xxx = release level
* If the release level will stay the same (Example: Level SV830_040_040 is
currently installed and you are attempting to install level SV830_071_040) this
is considered an update.
* If the release level will change (Example: Level SV830_040_040 is currently
installed and you are attempting to install level SV840_050_050) this is
considered an upgrade.
(http://www-01.ibm.com/support/knowledgecenter/8286-42A/p8ha1/updupdates.htm)
Instructions for installing firmware updates and upgrades can be found at
http://www-01.ibm.com/support/knowledgecenter/9119-MHE/p8ha1/updupdates.htm IBM
i Systems:
For information concerning IBM i Systems, go to the following URL to access
Fix Central:
http://www-933.ibm.com/support/fixcentral/
Choose "Select product", under Product Group specify "System i", under
Product specify "IBM i", then Continue and specify the desired firmware PTF
accordingly.
7.0 Firmware History
The complete Firmware Fix History for this Release Level can be reviewed at
the following url:
http://download.boulder.ibm.com/ibmdl/pub/software/server/firmware/SVQ-Firmware-
Hist.html