Aix Led Codes
Aix Led Codes
24-April-2003
Beginning with AIX5L, IBM discontinued the publication of the Messages Guide and
Reference whitebook [SC23-4129]. This info is [only] available using a web applet:
http://www16.boulder.ibm.com/pseries/en_US/infocenter/base/errorsearch.htm
There are instances when this solution is simply insufficient. I have created this unofficial
document in the spirit of the Messages Guide and Reference. Use it at your own risk. All
errors are mine and mine alone.
This page provides descriptions for the numbers and characters that display on the operator panel and
descriptions of the location codes used to identify a particular item. Information is available about the
following codes:
AB-CD-EF-GH
AB-CD-EF-G,H
For planars, cards, and non-SCSI devices, the location code is defined as follows:
AB-CD-EF-GH
| | | |
| | | Device/FRU/Port ID
| | Connector ID
| devfunc Number, Adapter Number or Physical Location
Bus Type or PCI Parent Bus
● The AB value identifies a bus type or PCI parent bus as assigned by the firmware.
● The CD value identifies adapter number, adapter's devfunc number, or physical location. The
devfunc number is defined as the PCI device number times 8, plus the function number.
● The EF value identifies a connector.
● The GH value identifies a port, address, device, or FRU.
Adapters and cards are identified only with AB-CD. The possible values for AB are:
00 Processor bus
01 ISA bus
02 EISA bus
03 MCA bus
04 PCI bus used in the case where the PCI bus cannot be identified
05 PCMCIA buses
xy For PCI adapters where x is equal to or greater than 1. The x and y are characters in the range of 0-9,
A-H, J-N, P-Z (O, I, and lower case are omitted) and are equal to the parent bus's ibm, aix-loc Open
Firmware Property.
The possible values for CD depend on the adapter or card are as follows:
● For pluggable PCI adapters/cards, CD is the device's devfunc number (PCI device number times
8, plus the function number). The C and D are characters in the range of 0-9, and A-F (hex
numbers). This allows the location code to uniquely identify multiple adapters on individual PCI
cards.
For pluggable ISA adapters, CD is equal to the order in which the ISA cards defined or
configured, either by SMIT or the ISA Adapter Configuration Service Aid.
For integrated ISA adapters, CD is equal to a unique code identifying the ISA adapter. In most
cases, this is equal to the adapter's physical location code. In cases where a physical location code
is not available, CD is FF.
● EF is the connector ID. It is used to identify a connector on the adapter to which a resource is
attached.
● GH is used to identify a port, device, or FRU. For example:
❍ For async devices, GH defines the port on the fanout box. The values are 00 to 15.
For the integrated adapters, EF-GH is the same as the definition for the pluggable adapters. For example,
the location code for a diskette drive is 01-D1-00-00. A second diskette drive is 01-D1-00-01.
AB-CD-EF-G,H
| | | | |
| | | | Logical Unit address of the SCSI Device
| | | Control Unit Address of the SCSI Device
| | Connector ID
| devfunc Number, Adapter Number or Physical Location
Bus Type or PCI Parent Bus
Where:
There is also a bus location code that is generated as '00-xxxxxxxx' where xxxxxxxx is equivalent to the
node's unit address. Refer to the system unit service guide for additional information.
Location Codes for CHRP Model Architecture System Units Top of page
Note:
You need to know which system architecture the system unit on which you are working uses. If
you are working with a RSPC model use the Location Codes for RSPC Model Architecture
System Units. If you do not know which model you have, refer to Determining System
Architecture in Diagnostic Information for Multiple Bus Systems before proceeding.
The (CHRP) system unit uses Physical Location Codes in conjunction with AIX Location Codes to
provide mapping of the failing field replaceable units. The location codes are produced by the system
unit's firmware and the AIX operating system.
Note:
Some systems might produce 4-digit codes. If the leftmost digit of a 4-digit code is 0, use the three
rightmost digits.
c00
AIX Install/Maintenance loaded successfully.
c01
Insert the first diagnostic diskette.
c02
Diskettes inserted out of sequence.
c03
The wrong diskette is in diskette drive.
c04
The loading stopped with an irrecoverable error.
c05
A diskette error occurred.
c06
The rc.boot configuration shell script is unable to determine type of boot.
c07
Insert the next diagnostic diskette.
c08
RAM file system started incorrectly.
c09
The diskette drive is reading or writing a diskette.
c20
An unexpected halt occurred, and the system is configured to enter the kernel debug program
instead of entering a system dump.
c21
The ifconfig command was unable to configure the network for the client network host.
c22
The tftp command was unable to read client's ClientHostName info file during a client network
boot.
c24
Unable to read client's ClientHostName.info file during a client network boot.
c25
Client did not mount remote miniroot during network install.
c26
Client did not mount the /usr file system during the network boot.
c29
The system was unable to configure the network device.
c31
Select the console display for the diagnostics. To select No console display, set the key mode
switch to Normal then to Service. The diagnostic programs then load and run the diagnostics
automatically. If you continue to get the message, check the cables and make sure you are using
the serial port.
c32
A directly attached display (HFT) was selected.
c33
A TTY terminal attached to serial ports S1 or S2 was selected.
c34
A file was selected. The console messages store in a file.
c35
No console found.
c40
Configuration files are being restored.
c41
Could not determine the boot type or device.
c42
Extracting data files from diskette.
c43
Cannot access the boot/install tape.
c44
Initializing installation database with target disk information.
c45
Cannot configure the console.
c46
Normal installation processing.
c47
Could not create a physical volume identifier (PVID) on disk.
c48
Prompting you for input.
c49
Could not create or form the JFS log.
c50
Creating root volume group on target disks.
c51
No paging devices were found.
c52
Changing from RAM environment to disk environment.
c53
Not enough space in the /tmp directory to do a preservation installation.
c54
Installing either BOS or additional packages.
c55
Could not remove the specified logical volume in a preservation installation.
c56
Running user-defined customization.
c57
Failure to restore BOS.
c58
Displaying message to turn the key.
c59
Could not copy either device special files, device ODM, or volume group information from RAM
to disk.
c61
Failed to create the boot image.
c62
Loading platform dependent debug files.
c63
Loading platform dependent data files.
c64
Failed to load platform dependent data files.
c70
Problem Mounting diagnostic CD-ROM disc.
c99
Diagnostics have completed. This code is only used when there is no console.
Fxx
(xx is any number) Refer to Firmware chapter of the service manual.
The following dump progress indicators, or dump status codes, are part of a Type 102 message.
Note:
When a lowercase c is listed, it displays in the lower half of the character position. Some systems
produce 4-digit codes, the two leftmost positions can have a blanks or zeros. Use the two
rightmost digits.
0c0
The dump completed successfully.
0c1
The dump failed due to an I/O error.
0c2
A dump, requested by the user, is started.
0c3
The dump is inhibited.
0c4
The dump device is not large enough.
0c5
The dump did not start, or the dump crashed.
0c6
Dumping to a secondary dump device.
0c7
Reserved.
0c8
The dump function is disabled.
0c9
A dump is in progress.
0cc
Unknown dump failure
Crash Codes
Note:
Some systems may produce 4-digit codes. If the leftmost digit of a 4-digit code is 0, use the three
rightmost digits.
The crash codes that follow are part of a Type 102 message. These crash codes are grouped into three
categories:
Category 1
Dump analysis is the appropriate first action in Problem Determination, begin the Problem
Category 2
Dump analysis most likely will not aid in Problem Determination, begin the Problem
Determination process with hardware support.
Category 3
Both software and hardware support may be needed in Problem Determination, go to MAP 0070:
888 Sequence in Operator Panel Display in Diagnostic Information for Multiple Bus Systemsto
assist in problem isolation.
Category 1
300
Data storage interrupt from the processor.
32x
Data storage interrupt because of an I/O exception from IOCC.
38x
Data storage interrupt because of an I/O exception from SLA.
400
Instruction storage interrupt.
700
Program interrupt.
Category 2
200
Machine check because of a memory bus error.
201
Machine check because of a memory timeout.
202
Machine check because of a memory card failure.
203
Machine check because of a out of range address.
204
Machine check because of an attempt to write to ROS.
205
Machine check because of an uncorrectable address parity.
206
Machine check because of an uncorrectable ECC error.
207
Machine check because of an unidentified error.
208
Machine check due to an L2 uncorrectable ECC.
500
External interrupt because of a scrub memory bus error.
501
External interrupt because of an unidentified error.
51x
External interrupt because of a DMA memory bus error.
52x
External interrupt because of an IOCC channel check.
53x
External interrupt from an IOCC bus timeout; x represents the IOCC number.
54x
External interrupt because of an IOCC keyboard check.
800
Floating point is not available.
Category 3
000
Unexpected system interrupt.
558
600
AIX 4.3.3.3 and above: Alignment Interrupt. If pre-AIX 4.3.3.3: AIX has crashed because the
Portability Assist Layer (PAL) for this machine type has detected a problem.
605
AIX has crashed because the Portability Assist Layer (PAL) for this machine type has detected a
problem (AIX 4.3.3.3 and above).
The numbers in this list display on the operator panel as the system loads the operating system and
prepares the hardware by loading software drivers.
Note:
Some systems may produce 4-digit codes. If the leftmost digit of a 4-digit code is 0, use the three
rightmost digits.
2E6
The PCI Differential Ultra SCSI adapter or the Universal PCI Differential Ultra SCSI adapter
being configured.
2E7
Configuration method unable to determine if the SCSI adapter type is SE or DE type.
440
9.1GB Ultra SCSI Disk Drive being identified or configured.
441
18.2GB Ultra SCSI Disk Drive being identified or configured.
444
2-Port Multiprotocol PCI Adapter (ASIC) being identified or configured.
447
PCI 64-bit Fibre Channel Arbitrated Loop Adapter being configured.
500
Querying Standard I/O slot.
501
Querying card in Slot 1.
502
Querying card in Slot 2.
503
Querying card in Slot 3.
504
Querying card in Slot 4.
505
Querying card in Slot 5.
506
Querying card in Slot 6.
507
Querying card in Slot 7.
508
Querying card in Slot 8.
510
Starting device configuration.
511
Device configuration completed.
512
Restoring device configuration files from media.
513
Restoring basic operating system installation files from media.
516
Contacting server during network boot.
517
Mounting client remote file system during network IPL.
518
Remote mount of the root (/) and /usr file systems failed during network boot.
520
Bus configuration running.
521
/etc/init invoked cfgmgr with invalid options; /etc/inithas been corrupted or incorrectly modified
(irrecoverable error).
522
The configuration manager has been invoked with conflicting options (irrecoverable error).
523
The configuration manager is unable to access the ODM database (irrecoverable error).
524
The configuration manager is unable to access the config.rules object in the ODM database
(irrecoverable error).
525
The configuration manager is unable to get data from a customized device object in the ODM
database (irrecoverable error).
526
The configuration manager is unable to get data from a customized device driver object in the
ODM database ( irrecoverable error).
527
The configuration manager was invoked with the phase 1 flag; running phase 1 at this point is not
permitted (irrecoverable error).
528
The configuration manager cannot find sequence rule, or no program name was specified in the
ODM database (irrecoverable error).
529
The configuration manager is unable to update ODM data (irrecoverable error).
530
The program savebase returned an error.
531
The configuration manager is unable to access the PdAt object class (irrecoverable error).
532
There is not enough memory to continue (malloc failure); irrecoverable error.
533
The configuration manager could not find a configuration method for a device.
534
The configuration manager is unable to acquire database lock (irrecoverable error).
535
HIPPI diagnostics interface driver being configured.
536
The configuration manager encountered more than one sequence rule specified in the same phase
(irrecoverable error).
537
The configuration manager encountered an error when invoking the program in the sequence rule.
538
The configuration manager is going to invoke a configuration method.
539
The configuration method has terminated, and control has returned to the configuration manager.
541
A DLT tape device is being configured.
549
Console could not be configured for the Copy a System Dump Menu.
551
IPL vary-on is running.
552
IPL vary-on failed.
553
IPL phase 1 is complete.
554
The boot device could not be opened or read, or unable to define NFS swap device during network
boot.
555
An ODM error occurred when trying to vary-on the rootvg, or unable to create an NFS swap
device during network boot.
556
Logical Volume Manager encountered error during IPL vary-on.
557
The root filesystem does not mount.
558
There is not enough memory to continue the system IPL.
559
Less than 2 M bytes of good memory are available to load the AIX kernel.
569
FCS SCSI protocol device is being configured (32 bits).
570
Virtual SCSI devices being configured.
571
HIPPI common function device driver being configured.
572
HIPPI IPI-3 master transport driver being configured.
573
HIPPI IPI-3 slave transport driver being configured.
574
HIPPI IPI-3 transport services user interface device driver being configured.
575
A 9570 disk-array driver being configured.
576
Generic async device driver being configured.
577
Generic SCSI device driver being configured.
578
Generic commo device driver being configured.
579
Device driver being configured for a generic device.
580
HIPPI TCPIP network interface driver being configured.
581
Configuring TCP/IP.
582
Configuring Token-Ring data link control.
583
Configuring an Ethernet data link control.
584
Configuring an IEEE Ethernet data link control.
585
Configuring an SDLC MPQP data link control.
586
Configuring a QLLC X.25 data link control.
587
Configuring a NETBIOS.
588
Configuring a Bisync Read-Write (BSCRW).
589
SCSI target mode device being configured.
590
Diskless remote paging device being configured.
591
Configuring an LVM device driver.
592
Configuring an HFT device driver.
593
Configuring SNA device drivers.
594
Asynchronous I/O being defined or configured.
595
X.31 pseudo-device being configured.
596
SNA DLC/LAPE pseudo-device being configured.
597
OCS software being configured.
598
OCS hosts being configured during system reboot.
599
Configuring FDDI data link control.
59B
FCS SCSI protocol device being configured (64 bits).
5C0
Streams-based hardware drive being configured.
5C1
Streams-based X.25 protocol being configured.
5C2
Streams-based X.25 COMIO emulator driver being configured
5C3
Streams-based X.25 TCP/IP interface driver being configured.
5C4
FCS adapter device driver being configured.
5C5
SCB network device driver for FCS being configured.
5C6
AIX SNA channel being configured.
600
Starting network boot portion of /sbin/rc.boot.
602
Configuring network parent devices.
603
/usr/lib/methods/defsys, /usr/lib/methods/cfgsys, or /usr/lib/methods/cfgbus failed.
604
Configuring physical network boot device.
605
Configuration of physical network boot device failed.
606
Running /usr/sbin/ifconfig on logical network boot device.
607
/usr/sbin/ifconfig failed.
608
Attempting to retrieve the client.info file with tftp.Note that a flashing 608 indicates multiple
attempt(s) to retrieve the client_info file are occurring.
609
The client.info file does not exist or it is zero length.
60B
18.2GB 68-pin LVD SCSI Disk Drive being configured.
610
Attempting remote mount of NFS file system.
611
Remote mount of the NFS file system failed.
612
Accessing remote files; unconfiguring network boot device.
614
Configuring local paging devices.
615
Configuration of a local paging device failed.
616
Converting from diskless to dataless configuration.
617
Diskless to dataless configuration failed.
618
Configuring remote (NFS) paging devices.
619
Configuration of a remote (NFS) paging device failed.
61B
36.4GB 80-pin LVD SCSI Disk Drive being configured.
61D
36.4GB 80-pin LVD SCSI Disk Drive being configured.
61E
18.2GB 68-pin LVD SCSI Disk Drive being configured.
620
Updating special device files and ODM in permanent filesystem with data from boot RAM
filesystem.
621
9.1 GB LVD 80-pin SCSI Drive being configured.
622
Boot process configuring for operating system installation.
62D
9.1GB 68-pin LVD SCSI Disk Drive being configured.
62E
9.1GB 68-pin LVD SCSI Disk Drive being configured.
636
TURBROWAYS 622 Mbps PCI MMF ATM Adapter.
637
Dual Channel PCI-2 Ultra2 SCSI Adapter being configured.
638
4.5GB Ultra SCSI Single Ended Disk Drive being configured.
639
9.1GB 10K RPM Ultra SCSI Disk Drive (68-pin).
63A
See 62D.
63B
9.1GB 80-pin LVD SCSI Disk Drive being configured.
63C
See 60B.
63D
18.2GB 80-pin LVD SCSI Disk Drive being configured.
63E
63F
See 61B.
640
9.1GB 10K RPM Ultra SCSI Disk Drive (80-pin).
646
High-Speed Token-Ring PCI Adapter being configured.
64A
See 62E.
64B
9.1GB 80-pin LVD SCSI Disk Drive being configured.
64C
See 61E.
64D
18.2 GB LVD 80-pin Drive/Carrier being configured.
64E
36.4GB 68-pin LVD SCSI Disk Drive being configured.
64F
See 61D.
650
IBM SCSD disk drive being configured.
653
18.2GB Ultra-SCSI 16-bit Disk Drive being configured.
655
GXT130P Graphics adapter being configured.
657
GXT2000P graphics adapter being configured.
658
659
2102 Fibre Channel Disk Subsystem Controller Drawer being identified or configured.
660
2102 Fibre Channel Disk Array being identified or configured.
662
Ultra2 Integrated SCSI controller.
663
The ARTIC960RxD Digital Trunk Quad PCI Adapter or the ARTIC960RxF Digital Trunk
Resource Adapter being configured.
664
32x (MAX) SCSI-2 CD-ROM drive being configured.
667
PCI 3-Channel Ultra2 SCSI RAID Adapter being configured.
669
PCI Gigabit Ethernet Adapter being configured.
66C
10/100/1000 Base-T EthernetPCI Adapter.
66D
PCI 4-Channel Ultra-3 SCSI RAID Adapter.
66E
4.7 GB DVD-RAM drive.
674
ESCON(R) Channel PCI Adapter being configured.
677
PCI 32-bit Fibre Channel Arbitrated Loop Adapter being configured.
67B
PCI Cryptographic Coprocessor being configured.
682
20x (MAX) SCSI-2 CD-ROM Drive being configured.
689
4.5GB Ultra SCSI Single Ended Disk Drive being configured.
68C
20 GB 4-mm Tape Drive being configured.
68E
POWER GXT6000P PCI Graphics Adapter.
690
9.1GB Ultra SCSI Single Ended Disk Drive being configured.
69b
64-bit/66MHz PCI ATM 155 MMF PCI adapter being configured.
69d
64-bit/66MHz PCI ATM 155 UTP PCI adapter being configured.
6CC
SSA disk drive being configured.
700
A 1.1 GB 8-bit SCSI disk drive being identified or configured.
701
A 1.1 GB 16-bit SCSI disk drive being identified or configured.
702
A 1.1 GB 16-bit differential SCSI disk drive being identified or configured.
703
A 2.2 GB 8-bit SCSI disk drive being identified or configured.
704
A 2.2 GB 16-bit SCSI disk drive being identified or configured.
705
The configuration method for the 2.2 GB 16-bit differential SCSI disk drive is being run. If an
irrecoverable error occurs, the system halts.
706
A 4.5 GB 16-bit SCSI disk drive being identified or configured.
707
A 4.5 GB 16-bit differential SCSI disk drive being identified or configured.
708
A L2 cache being identified or configured.
710
POWER GXT150M graphics adapter being identified or configured.
711
Unknown adapter being identified or configured.
712
Graphics slot bus configuration is executing.
713
The IBM ARTIC960 device being configured.
714
A video capture adapter being configured.
715
The Ultramedia Services audio adapter being configured. (this number displays briefly on the
panel).
717
TP Ethernet Adapter being configured.
718
GXT500 Graphics Adapter being configured.
720
Unknown read/write optical drive type being configured.
721
Unknown disk or SCSI device being identified or configured.
722
723
Unknown CD-ROM being identified or configured.
724
Unknown tape drive being identified or configured.
725
Unknown display adapter being identified or configured.
726
Unknown input device being identified or configured.
727
Unknown async device being identified or configured.
728
Parallel printer being identified or configured.
729
Unknown parallel device being identified or configured.
730
Unknown diskette drive being identified or configured.
731
PTY being identified or configured.
732
Unknown SCSI initiator type being configured.
733
7GB 8 mm tape drive being configured.
734
4x SCSI-2 640 MB CD-ROM Drive being configured.
736
Quiet Touch keyboard and speaker cable being configured.
741
745
16GB 4 mm Tape Auto Loader being configured.
746
SCSI-2 Fast/Wide PCI Adapter being configured.
747
SCSI-2 Differential Fast/Wide PCI Adapter being configured.
749
7331 Model 205 Tape Library being configured.
751
SCSI 32-bit SE F/W RAID Adapter being configured.
754
1.1GB 16-bit SCSI disk drive being configured.
755
2.2GB 16-bit SCSI disk drive being configured.
756
4.5GB 16-bit SCSI disk drive being configured.
757
External 13GB 1.5M/s 1/4 inch tape being configured.
763
SP Switch MX Adapter being configured.
764
SP System Attachment Adapter being configured.
772
4.5GB SCSI F/W Disk Drive being configured.
773
9.1GB SCSI F/W Disk Drive being configured.
774
776
PCI Token-Ring Adapter being identified or configured.
777
10/100 Ethernet Tx PCI Adapter being identified or configured.
778
POWER GXT3000P 3D PCI Graphics adapter being configured.
77B
4-Port 10/100 Ethernet Tx PCI Adapter being identified or configured.
77c
A 1.0 GB 16-bit SCSI disk drive being identified or configured.
783
4 mm DDS-2 Tape Autoloader being configured.
789
2.6 GB External Optical Drive being configured.
78B
POWER GXT4000P PCI Graphics Adapter.
78C
PCI bus configuration executing.
78D
GXT300P 2D Graphics adapter being configured.
790
Multi-bus Integrated Ethernet Adapter being identified or configured.
797
TURBOWAYS(R) 155 UTP/STP ATM Adapter being identified or configured.
798
Video streamer adapter being identified or configured.
799
79c
ISA bus configuration executing.
7C0
CPU/System Interface being configured.
7C1
Business Audio Subsystem being identified or configured.
7cc
PCMCIA bus configuration executing.
800
TURBOWAYS 155 MMF ATM Adapter being identified or configured.
803
7336 Tape Library robotics being configured.
804
8x Speed SCSI-2 CD-ROM Drive being configured.
806
POWER GXT800 PCI Graphics adapter being configured.
807
SCSI Device Enclosure being configured.
80c
SSA 4-Port Adapter being identified or configured.
811
Processor complex being identified or configured.
812
Memory being identified or configured.
813
Battery for time-of-day, NVRAM, and so on being identified or configured, or system I/O control
logic being identified or configured.
814
NVRAM being identified or configured.
815
Floating-point processor test.
816
Operator panel logic being identified or configured.
817
Time-of-day logic being identified or configured.
819
Graphics input device adapter being identified or configured.
821
Standard keyboard adapter being identified or configured.
823
Standard mouse adapter being identified or configured.
824
Standard tablet adapter being identified or configured.
825
Standard speaker adapter being identified or configured.
826
Serial Port 1 adapter being identified or configured.
827
Parallel port adapter being identified or configured.
828
Standard diskette adapter being identified or configured.
831
3151 adapter being identified or configured, or Serial Port 2 being identified or configured.
834
64-port async controller being identified or configured.
835
16-port async concentrator being identified or configured.
836
128-port async controller being identified or configured.
837
16-port remote async node being identified or configured.
838
Network Terminal Accelerator Adapter being identified or configured.
839
7318 Serial Communications Server being configured.
840
PCI Single-Ended Ultra SCSI Adapter being configured.
841
8-port async adapter (EIA-232) being identified or configured.
842
8-port async adapter (EIA-422A) being identified or configured.
843
8-port async adapter (MIL-STD 188) being identified or configured.
844
7135 RAIDiant Array disk drive subsystem controller being identified or configured.
845
7135 RAIDiant Array disk drive subsystem drawer being identified or configured.
846
RAIDiant Array SCSI 1.3GB Disk Drive being configured.
847
16-port serial adapter (EIA-232) being identified or configured.
848
16-port serial adapter (EIA-422) being identified or configured.
849
X.25 Interface Coprocessor/2 adapter being identified or configured.
850
Token-Ring network adapter being identified or configured.
851
T1/J1 Portmaster(R) adapter being identified or configured.
852
Ethernet adapter being identified or configured.
854
3270 Host Connection Program/6000 connection being identified or configured.
855
Portmaster Adapter/A being identified or configured.
857
FSLA adapter being identified or configured.
858
5085/5086/5088 adapter being identified or configured.
859
FDDI adapter being identified or configured.
85c
Token-Ring High-Performance LAN adapter being identified or configured.
861
Optical adapter being identified or configured.
862
Block Multiplexer Channel Adapter being identified or configured.
865
ESCON Channel Adapter or emulator being identified or configured.
866
SCSI adapter being identified or configured.
867
Async expansion adapter being identified or configured.
868
SCSI adapter being identified or configured.
869
SCSI adapter being identified or configured.
870
Serial disk drive adapter being identified or configured.
871
Graphics subsystem adapter being identified or configured.
872
Grayscale graphics adapter being identified or configured.
874
Color graphics adapter being identified or configured.
875
Vendor generic communication adapter being configured.
876
8-bit color graphics processor being identified or configured.
877
POWER Gt3(TM)/POWER Gt4(TM) being identified or configured.
878
POWER Gt4 graphics processor card being configured.
879
24-bit color graphics card, MEV2 being configured.
880
POWER Gt1(TM) adapter being identified or configured.
887
Integrated Ethernet adapter being identified or configured.
889
SCSI adapter being identified or configured.
890
SCSI-2 Differential Fast/Wide and Single-Ended Fast/Wide Adapter/A being configured.
891
Vendor SCSI adapter being identified or configured.
892
Vendor display adapter being identified or configured.
893
Vendor LAN adapter being identified or configured.
894
Vendor async/communications adapter being identified or configured.
895
Vendor IEEE 488 adapter being identified or configured.
896
Vendor VME bus adapter being identified or configured.
897
S/370(TM) Channel Emulator adapter being identified or configured.
898
POWER Gt1x(TM) graphics adapter being identified or configured.
899
3490 attached tape drive being identified or configured.
89c
A multimedia SCSI CD-ROM being identified or configured.
900
GXT110P Graphics Adapter being identified or configured.
901
Vendor SCSI device being identified or configured.
902
Vendor display device being identified or configured.
903
Vendor async device being identified or configured.
904
Vendor parallel device being identified or configured.
905
Vendor other device being identified or configured.
908
POWER GXT1000 Graphics subsystem being identified or configured.
910
1/4GB Fiber Channel/266 Standard Adapter being identified or configured.
911
Fiber Channel/1063 Adapter Short Wave being configured.
912
2.0GB SCSI-2 differential disk drive being identified or configured.
913
1.0GB differential disk drive being identified or configured.
914
5GB 8 mm differential tape drive being identified or configured.
915
4GB 4 mm tape drive being identified or configured.
916
Non-SCSI vendor tape adapter being identified or configured.
917
A 2.0 GB 16-bit differential SCSI disk drive being identified or configured.
918
A 2 GB 16-bit single-ended SCSI disk drive being identified or configured.
920
Bridge Box being identified or configured.
921
101 keyboard being identified or configured.
922
102 keyboard being identified or configured.
923
Kanji keyboard being identified or configured.
924
Two-button mouse being identified or configured.
925
Three-button mouse being identified or configured.
926
5083 tablet being identified or configured.
927
5083 tablet being identified or configured.
928
Standard speaker being identified or configured.
929
Dials being identified or configured.
930
Lighted program function keys (LPFK) being identified or configured.
931
IP router being identified or configured.
933
Async planar being identified or configured.
934
Async expansion drawer being identified or configured.
935
3.5-inch diskette drive being identified or configured.
936
5.25-inch diskette drive being identified or configured.
937
An HIPPI adapter being configured.
938
Serial HIPPI PCI adapter being configured.
942
POWER GXT 100 graphics adapter being identified or configured.
943
A 3480 or 3490 control unit attached to a System/370 Channel Emulator/A adapter are being
identified or configured.
944
100MB ATM adapter being identified or configured.
945
1.0GB SCSI differential disk drive being identified or configured.
946
Serial port 3 adapter being identified or configured.
947
A 730MB SCSI disk drive being configured.
948
Portable disk drive being identified or configured.
949
Unknown direct bus-attach device being identified or configured.
950
Missing SCSI device being identified or configured.
951
670MB SCSI disk drive being identified or configured.
952
355MB SCSI disk drive being identified or configured.
953
320MB SCSI disk drive being identified or configured.
954
400MB SCSI disk drive being identified or configured.
955
857MB SCSI disk drive being identified or configured.
956
670MB SCSI disk drive electronics card being identified or configured.
957
120 MB DBA disk drive being identified or configured.
958
160 MB DBA disk drive being identified or configured.
959
160 MB SCSI disk drive being identified or configured.
960
1.37GB SCSI disk drive being identified or configured.
964
Internal 20 GB 8 mm tape drive identified or configured.
968
1.0 GB SCSI disk drive being identified or configured.
970
Half-inch, 9-track tape drive being identified or configured.
971
150 MB 1/4-inch tape drive being identified or configured.
972
2.3 GB 8 mm SCSI tape drive being identified or configured.
973
Other SCSI tape drive being identified or configured.
974
CD-ROM drive being identified or configured.
975
An optical disk drive being identified or configured.
977
M-Audio Capture and Playback Adapter being identified or configured.
981
540MB SCSI-2 single-ended disk drive being identified or configured.
984
1GB 8-bit disk drive being identified or configured.
985
M-Video Capture Adapter being identified or configured.
986
2.4GB SCSI disk drive being identified or configured.
987
An Enhanced SCSI CD-ROM drive being identified or configured.
989
200MB SCSI disk drive being identified or configured.
990
2.0GB SCSI-2 single-ended disk drive being identified or configured.
991
525MB 1/4-inch cartridge tape drive being identified or configured.
994
5 GB 8 mm tape drive being identified or configured.
995
1.2GB 1/4 inch cartridge tape drive being identified or configured.
996
A single-port, multiprotocol communications adapter being identified or configured.
997
FDDI adapter being identified or configured.
998
2.0 GB 4 mm tape drive being identified or configured.
999
7137 or 3514 Disk Array Subsystem being configured.
D46
Token-Ring cable
D81
T2 Ethernet Adapter being configured.
2530
10/100 Mbps Ethernet PCI Adapter II being configured.
The P2/E1 is the physical location code indicating an Ethernet port built into the P2 planar.
The 10-60 is an AIX location code indicating a PCI parent bus of 10, and a devfunc number of 60
(for more information , see AIX Location Codes).
These physical location codes can appear in many places while running diagnostics; for instance,
Physical location codes provide a mapping of logical functions in a platform (or expansion sites for
logical functions, such as connectors or ports) to their specific locations within the physical structure of
the platform.
The format for the location code is a string of alphanumeric characters separated by a dash (-), slash (/),
pound sign (#), or period (.). The base location is all of the information before the slash (/) or pound sign
(#). It identifies a device that is connected or plugged into the parent. Extended location information
follows the slash (/). It identifies a device that is part of the parent, a connector, or a cable. Cable
information follows the pound sign (#). It identifies a cable that is connected to the parent. The following
are examples:
The period (.) is used to identify sub-locations such as memory DIMMs on a base memory card or a
specific SCSI address. The following are examples:
Depending on the AIX and firmware levels, AIX Diagnostics may include extended location information
when identifying a planar or card. The extended location information or cable information is always
included when identifying a cable or connector. Location codes with extended location information that
display without a description identifying the devices, always identify the cable attached to the port.
The following table lists the assigned values for the location type prefixes. In most cases, the prefix value
assignments were chosen to provide some mnemonic characteristic, so that they would be easier to
remember. The underlined characters in the description field are intended to illustrate this mnemonic
relationship.
Prefix Value
Description (n=instance #)
Rack or drawer unit Un
Drawer unit mounted in a rack Un.n (U0.n if rack
cannot be sensed by
firmware)
Single enclosure platform (No enclosure location
code)
Planar (backplane, system, I/O) Pn
Planar riser card, extender Pn.n
Power/voltage supply, voltage regulator, backup battery Vn
Fan/sensor Fn
LED/LCD operator panel Ln
or
Logical device address n relative to adapter port
CPU/cache card (or pluggable module if on planar) Cn
CPU/cache module on CPU card (if pluggable) Cn.n
Memory card or SIMM/DIMM on planar Mn
Memory SIMM/DIMM on memory card Mn.n
Other extra-function base system cards (for example, service processor) Xn
I/O adapter In
Pluggable modules or daughter cards on I/O adapter In.n
Device in Bay n Dn
Ports/Connectors:
Graphics/video connector Gn
Keyboard/keyboard connector Kn
Mouse/mouse connector On
Serial port Sn
Parallel port Rn
Ethernet connector En
Token Ring connector Tn
SCSI (pronounced scuzzy) connector Zn
Other I/O ports or connectors Qn
SCSI device addresses (including SSA (Serial Storage Architecture))
Primary address (SCSI control unit ID) An
Primary and secondary address (SCSI ID and LUN (Logical Unit Number)) An.n
SCSI device location in SCSI Enclosure Services (SES)
SCSI bank Bn
SCSI bank and bay Bn.n
Undefined prefixes (reserved) H, J, N, Y
Unique device address, this address remains constant independent of which Wn
port the device is attached to.
Location Codes for RSPC Model Architecture System Units Top of page
Notes:
1. RSPC systems are only supported with AIX or Diagnostic versions below 5.2.0
2. You need to know which system architecture the system unit on which you are working uses. If
you are working with a CHRP model, use the Location Codes for CHRP Model Architecture
System Units. If you do not know which model you have, refer to Determining System
Architecture in Diagnostic Information for Multiple Bus Systems before proceeding.
Because the same diagnostic programs are used on all system units, a location code is used to physically
locate a failing device or unit. The location code is displayed along with the service request number
(SRN) when the diagnostic programs isolate a failure. If the location code is not known, you can run the
Display Previous Diagnostic Results service aid to display the results of the last time the diagnostic
programs were run.
AB-CD-EF-GH non-SCSI
AB-CD-EF-G,H SCSI
For planars, cards, and non-SCSI devices, the location code is defined as follows:
AB-CD-EF-GH
| | | |
| | | Device/FRU/Port ID
| | Connector ID
| Slot or Adapter Number
Bus Type
For systems that have memory modules that plug directly into the system planar, the
location code is 00-00-00-GH where GH is the memory module slot. For systems that have
memory cards with memory modules, the location code is 00-CD-EF-GH, where CD is the
card slot and GH is the memory module slot.
❍ For L2 caches, GH defines the cache. Values for GH are 1 through 16.
❍ For PCMCIA devices, GH defines the PCMCIA. Values for GH are 1 through 16.
❍ For async devices, GH defines the port on the fanout box. Values are 00 to 15.
❍ For a diskette drive, H defines which diskette drive 1 or 2. G is always 0.
❍ For all other devices, GH is equal to 00.
For integrated adapters, EF-GH is the same as the definition for a pluggable adapter. For example, the
location code for a diskette drive is 01-A0-00-00. A second diskette drive is 01-A0-00-01.
AB-CD-EF-G,H
| | | | |
| | | | Logical Unit Address of SCSI Device
| | | Control Unit Address of SCSI Device
| | Connector ID
| Slot or Adapter Number
Bus Type
Where:
Adapters and cards are identified with only AB-CD. The possible values for AB are as follows:
For pluggable adapters or cards, this is a two-digit slot number in the range from 01 to 99. However, in
the case of ISA cards these numbers do not actually correspond to the physical slot numbers. They simply
are based on the order in which the ISA cards are defined or configured, either by SMIT or the ISA
Adapter Configuration Service Aid.
For integrated adapters, the first character (C) is a letter in the range from A to Z. This letter is based on
the order in which the integrated adapters are defined in residual data. This ensures unique location codes
for the integrated adapters. The second character (D) is set to 0.
Processor-PCI bus
00-00 PCI bus
Memory module in system planar
00-00-00-01
Memory module in card
00-0A-00-01
Integrated PCI adapters
04-A0 ISA bus (Integrated PCI-ISA bridge)
04-B0 Secondary PCI bus (Integrated PCI-PCI bridge)
04-C0 Integrated PCI SCSI controller
Non-integrated PCI adapters
04-01 Any PCI card in slot 1
04-02 Any PCI card in slot 2
Integrated ISA adapters
01-A0 Diskette adapter
01-B0 Parallel port adapter
01-C0 Serial port 1 adapter
This page provides descriptions for the numbers and characters that display on the operator panel and
descriptions of the location codes used to identify a particular item. Information is available about the
following codes:
AB-CD-EF-GH
AB-CD-EF-G,H
For planars, cards, and non-SCSI devices, the location code is defined as follows:
AB-CD-EF-GH
| | | |
| | | Device/FRU/Port ID
| | Connector ID
| devfunc Number, Adapter Number or Physical Location
Bus Type or PCI Parent Bus
● The AB value identifies a bus type or PCI parent bus as assigned by the firmware.
● The CD value identifies adapter number, adapter's devfunc number, or physical location. The
devfunc number is defined as the PCI device number times 8, plus the function number.
● The EF value identifies a connector.
● The GH value identifies a port, address, device, or FRU.
Adapters and cards are identified only with AB-CD. The possible values for AB are:
00 Processor bus
01 ISA bus
02 EISA bus
03 MCA bus
04 PCI bus used in the case where the PCI bus cannot be identified
05 PCMCIA buses
xy For PCI adapters where x is equal to or greater than 1. The x and y are characters in the range of 0-9,
A-H, J-N, P-Z (O, I, and lower case are omitted) and are equal to the parent bus's ibm, aix-loc Open
Firmware Property.
The possible values for CD depend on the adapter or card are as follows:
● For pluggable PCI adapters/cards, CD is the device's devfunc number (PCI device number times
8, plus the function number). The C and D are characters in the range of 0-9, and A-F (hex
numbers). This allows the location code to uniquely identify multiple adapters on individual PCI
cards.
For pluggable ISA adapters, CD is equal to the order in which the ISA cards defined or
configured, either by SMIT or the ISA Adapter Configuration Service Aid.
For integrated ISA adapters, CD is equal to a unique code identifying the ISA adapter. In most
cases, this is equal to the adapter's physical location code. In cases where a physical location code
is not available, CD is FF.
● EF is the connector ID. It is used to identify a connector on the adapter to which a resource is
attached.
● GH is used to identify a port, device, or FRU. For example:
❍ For async devices, GH defines the port on the fanout box. The values are 00 to 15.
For the integrated adapters, EF-GH is the same as the definition for the pluggable adapters. For example,
the location code for a diskette drive is 01-D1-00-00. A second diskette drive is 01-D1-00-01.
AB-CD-EF-G,H
| | | | |
| | | | Logical Unit address of the SCSI Device
| | | Control Unit Address of the SCSI Device
| | Connector ID
| devfunc Number, Adapter Number or Physical Location
Bus Type or PCI Parent Bus
Where:
There is also a bus location code that is generated as '00-xxxxxxxx' where xxxxxxxx is equivalent to the
node's unit address. Refer to the system unit service guide for additional information.
Location Codes for CHRP Model Architecture System Units Top of page
Note:
You need to know which system architecture the system unit on which you are working uses. If
you are working with a RSPC model use the Location Codes for RSPC Model Architecture
System Units. If you do not know which model you have, refer to Determining System
Architecture in Diagnostic Information for Multiple Bus Systems before proceeding.
The (CHRP) system unit uses Physical Location Codes in conjunction with AIX Location Codes to
provide mapping of the failing field replaceable units. The location codes are produced by the system
unit's firmware and the AIX operating system.
Note:
Some systems might produce 4-digit codes. If the leftmost digit of a 4-digit code is 0, use the three
rightmost digits.
c00
AIX Install/Maintenance loaded successfully.
c01
Insert the first diagnostic diskette.
c02
Diskettes inserted out of sequence.
c03
The wrong diskette is in diskette drive.
c04
The loading stopped with an irrecoverable error.
c05
A diskette error occurred.
c06
The rc.boot configuration shell script is unable to determine type of boot.
c07
Insert the next diagnostic diskette.
c08
RAM file system started incorrectly.
c09
The diskette drive is reading or writing a diskette.
c20
An unexpected halt occurred, and the system is configured to enter the kernel debug program
instead of entering a system dump.
c21
The ifconfig command was unable to configure the network for the client network host.
c22
The tftp command was unable to read client's ClientHostName info file during a client network
boot.
c24
Unable to read client's ClientHostName.info file during a client network boot.
c25
Client did not mount remote miniroot during network install.
c26
Client did not mount the /usr file system during the network boot.
c29
The system was unable to configure the network device.
c31
Select the console display for the diagnostics. To select No console display, set the key mode
switch to Normal then to Service. The diagnostic programs then load and run the diagnostics
automatically. If you continue to get the message, check the cables and make sure you are using
the serial port.
c32
A directly attached display (HFT) was selected.
c33
A TTY terminal attached to serial ports S1 or S2 was selected.
c34
A file was selected. The console messages store in a file.
c35
No console found.
c40
Configuration files are being restored.
c41
Could not determine the boot type or device.
c42
Extracting data files from diskette.
c43
Cannot access the boot/install tape.
c44
Initializing installation database with target disk information.
c45
Cannot configure the console.
c46
Normal installation processing.
c47
Could not create a physical volume identifier (PVID) on disk.
c48
Prompting you for input.
c49
Could not create or form the JFS log.
c50
Creating root volume group on target disks.
c51
No paging devices were found.
c52
Changing from RAM environment to disk environment.
c53
Not enough space in the /tmp directory to do a preservation installation.
c54
Installing either BOS or additional packages.
c55
Could not remove the specified logical volume in a preservation installation.
c56
Running user-defined customization.
c57
Failure to restore BOS.
c58
Displaying message to turn the key.
c59
Could not copy either device special files, device ODM, or volume group information from RAM
to disk.
c61
Failed to create the boot image.
c62
Loading platform dependent debug files.
c63
Loading platform dependent data files.
c64
Failed to load platform dependent data files.
c70
Problem Mounting diagnostic CD-ROM disc.
c99
Diagnostics have completed. This code is only used when there is no console.
Fxx
(xx is any number) Refer to Firmware chapter of the service manual.
The following dump progress indicators, or dump status codes, are part of a Type 102 message.
Note:
When a lowercase c is listed, it displays in the lower half of the character position. Some systems
produce 4-digit codes, the two leftmost positions can have a blanks or zeros. Use the two
rightmost digits.
0c0
The dump completed successfully.
0c1
The dump failed due to an I/O error.
0c2
A dump, requested by the user, is started.
0c3
The dump is inhibited.
0c4
The dump device is not large enough.
0c5
The dump did not start, or the dump crashed.
0c6
Dumping to a secondary dump device.
0c7
Reserved.
0c8
The dump function is disabled.
0c9
A dump is in progress.
0cc
Unknown dump failure
Crash Codes
Note:
Some systems may produce 4-digit codes. If the leftmost digit of a 4-digit code is 0, use the three
rightmost digits.
The crash codes that follow are part of a Type 102 message. These crash codes are grouped into three
categories:
Category 1
Dump analysis is the appropriate first action in Problem Determination, begin the Problem
Category 2
Dump analysis most likely will not aid in Problem Determination, begin the Problem
Determination process with hardware support.
Category 3
Both software and hardware support may be needed in Problem Determination, go to MAP 0070:
888 Sequence in Operator Panel Display in Diagnostic Information for Multiple Bus Systemsto
assist in problem isolation.
Category 1
300
Data storage interrupt from the processor.
32x
Data storage interrupt because of an I/O exception from IOCC.
38x
Data storage interrupt because of an I/O exception from SLA.
400
Instruction storage interrupt.
700
Program interrupt.
Category 2
200
Machine check because of a memory bus error.
201
Machine check because of a memory timeout.
202
Machine check because of a memory card failure.
203
Machine check because of a out of range address.
204
Machine check because of an attempt to write to ROS.
205
Machine check because of an uncorrectable address parity.
206
Machine check because of an uncorrectable ECC error.
207
Machine check because of an unidentified error.
208
Machine check due to an L2 uncorrectable ECC.
500
External interrupt because of a scrub memory bus error.
501
External interrupt because of an unidentified error.
51x
External interrupt because of a DMA memory bus error.
52x
External interrupt because of an IOCC channel check.
53x
External interrupt from an IOCC bus timeout; x represents the IOCC number.
54x
External interrupt because of an IOCC keyboard check.
800
Floating point is not available.
Category 3
000
Unexpected system interrupt.
558
600
AIX 4.3.3.3 and above: Alignment Interrupt. If pre-AIX 4.3.3.3: AIX has crashed because the
Portability Assist Layer (PAL) for this machine type has detected a problem.
605
AIX has crashed because the Portability Assist Layer (PAL) for this machine type has detected a
problem (AIX 4.3.3.3 and above).
The numbers in this list display on the operator panel as the system loads the operating system and
prepares the hardware by loading software drivers.
Note:
Some systems may produce 4-digit codes. If the leftmost digit of a 4-digit code is 0, use the three
rightmost digits.
2E6
The PCI Differential Ultra SCSI adapter or the Universal PCI Differential Ultra SCSI adapter
being configured.
2E7
Configuration method unable to determine if the SCSI adapter type is SE or DE type.
440
9.1GB Ultra SCSI Disk Drive being identified or configured.
441
18.2GB Ultra SCSI Disk Drive being identified or configured.
444
2-Port Multiprotocol PCI Adapter (ASIC) being identified or configured.
447
PCI 64-bit Fibre Channel Arbitrated Loop Adapter being configured.
500
Querying Standard I/O slot.
501
Querying card in Slot 1.
502
Querying card in Slot 2.
503
Querying card in Slot 3.
504
Querying card in Slot 4.
505
Querying card in Slot 5.
506
Querying card in Slot 6.
507
Querying card in Slot 7.
508
Querying card in Slot 8.
510
Starting device configuration.
511
Device configuration completed.
512
Restoring device configuration files from media.
513
Restoring basic operating system installation files from media.
516
Contacting server during network boot.
517
Mounting client remote file system during network IPL.
518
Remote mount of the root (/) and /usr file systems failed during network boot.
520
Bus configuration running.
521
/etc/init invoked cfgmgr with invalid options; /etc/inithas been corrupted or incorrectly modified
(irrecoverable error).
522
The configuration manager has been invoked with conflicting options (irrecoverable error).
523
The configuration manager is unable to access the ODM database (irrecoverable error).
524
The configuration manager is unable to access the config.rules object in the ODM database
(irrecoverable error).
525
The configuration manager is unable to get data from a customized device object in the ODM
database (irrecoverable error).
526
The configuration manager is unable to get data from a customized device driver object in the
ODM database ( irrecoverable error).
527
The configuration manager was invoked with the phase 1 flag; running phase 1 at this point is not
permitted (irrecoverable error).
528
The configuration manager cannot find sequence rule, or no program name was specified in the
ODM database (irrecoverable error).
529
The configuration manager is unable to update ODM data (irrecoverable error).
530
The program savebase returned an error.
531
The configuration manager is unable to access the PdAt object class (irrecoverable error).
532
There is not enough memory to continue (malloc failure); irrecoverable error.
533
The configuration manager could not find a configuration method for a device.
534
The configuration manager is unable to acquire database lock (irrecoverable error).
535
HIPPI diagnostics interface driver being configured.
536
The configuration manager encountered more than one sequence rule specified in the same phase
(irrecoverable error).
537
The configuration manager encountered an error when invoking the program in the sequence rule.
538
The configuration manager is going to invoke a configuration method.
539
The configuration method has terminated, and control has returned to the configuration manager.
541
A DLT tape device is being configured.
549
Console could not be configured for the Copy a System Dump Menu.
551
IPL vary-on is running.
552
IPL vary-on failed.
553
IPL phase 1 is complete.
554
The boot device could not be opened or read, or unable to define NFS swap device during network
boot.
555
An ODM error occurred when trying to vary-on the rootvg, or unable to create an NFS swap
device during network boot.
556
Logical Volume Manager encountered error during IPL vary-on.
557
The root filesystem does not mount.
558
There is not enough memory to continue the system IPL.
559
Less than 2 M bytes of good memory are available to load the AIX kernel.
569
FCS SCSI protocol device is being configured (32 bits).
570
Virtual SCSI devices being configured.
571
HIPPI common function device driver being configured.
572
HIPPI IPI-3 master transport driver being configured.
573
HIPPI IPI-3 slave transport driver being configured.
574
HIPPI IPI-3 transport services user interface device driver being configured.
575
A 9570 disk-array driver being configured.
576
Generic async device driver being configured.
577
Generic SCSI device driver being configured.
578
Generic commo device driver being configured.
579
Device driver being configured for a generic device.
580
HIPPI TCPIP network interface driver being configured.
581
Configuring TCP/IP.
582
Configuring Token-Ring data link control.
583
Configuring an Ethernet data link control.
584
Configuring an IEEE Ethernet data link control.
585
Configuring an SDLC MPQP data link control.
586
Configuring a QLLC X.25 data link control.
587
Configuring a NETBIOS.
588
Configuring a Bisync Read-Write (BSCRW).
589
SCSI target mode device being configured.
590
Diskless remote paging device being configured.
591
Configuring an LVM device driver.
592
Configuring an HFT device driver.
593
Configuring SNA device drivers.
594
Asynchronous I/O being defined or configured.
595
X.31 pseudo-device being configured.
596
SNA DLC/LAPE pseudo-device being configured.
597
OCS software being configured.
598
OCS hosts being configured during system reboot.
599
Configuring FDDI data link control.
59B
FCS SCSI protocol device being configured (64 bits).
5C0
Streams-based hardware drive being configured.
5C1
Streams-based X.25 protocol being configured.
5C2
Streams-based X.25 COMIO emulator driver being configured
5C3
Streams-based X.25 TCP/IP interface driver being configured.
5C4
FCS adapter device driver being configured.
5C5
SCB network device driver for FCS being configured.
5C6
AIX SNA channel being configured.
600
Starting network boot portion of /sbin/rc.boot.
602
Configuring network parent devices.
603
/usr/lib/methods/defsys, /usr/lib/methods/cfgsys, or /usr/lib/methods/cfgbus failed.
604
Configuring physical network boot device.
605
Configuration of physical network boot device failed.
606
Running /usr/sbin/ifconfig on logical network boot device.
607
/usr/sbin/ifconfig failed.
608
Attempting to retrieve the client.info file with tftp.Note that a flashing 608 indicates multiple
attempt(s) to retrieve the client_info file are occurring.
609
The client.info file does not exist or it is zero length.
60B
18.2GB 68-pin LVD SCSI Disk Drive being configured.
610
Attempting remote mount of NFS file system.
611
Remote mount of the NFS file system failed.
612
Accessing remote files; unconfiguring network boot device.
614
Configuring local paging devices.
615
Configuration of a local paging device failed.
616
Converting from diskless to dataless configuration.
617
Diskless to dataless configuration failed.
618
Configuring remote (NFS) paging devices.
619
Configuration of a remote (NFS) paging device failed.
61B
36.4GB 80-pin LVD SCSI Disk Drive being configured.
61D
36.4GB 80-pin LVD SCSI Disk Drive being configured.
61E
18.2GB 68-pin LVD SCSI Disk Drive being configured.
620
Updating special device files and ODM in permanent filesystem with data from boot RAM
filesystem.
621
9.1 GB LVD 80-pin SCSI Drive being configured.
622
Boot process configuring for operating system installation.
62D
9.1GB 68-pin LVD SCSI Disk Drive being configured.
62E
9.1GB 68-pin LVD SCSI Disk Drive being configured.
636
TURBROWAYS 622 Mbps PCI MMF ATM Adapter.
637
Dual Channel PCI-2 Ultra2 SCSI Adapter being configured.
638
4.5GB Ultra SCSI Single Ended Disk Drive being configured.
639
9.1GB 10K RPM Ultra SCSI Disk Drive (68-pin).
63A
See 62D.
63B
9.1GB 80-pin LVD SCSI Disk Drive being configured.
63C
See 60B.
63D
18.2GB 80-pin LVD SCSI Disk Drive being configured.
63E
63F
See 61B.
640
9.1GB 10K RPM Ultra SCSI Disk Drive (80-pin).
646
High-Speed Token-Ring PCI Adapter being configured.
64A
See 62E.
64B
9.1GB 80-pin LVD SCSI Disk Drive being configured.
64C
See 61E.
64D
18.2 GB LVD 80-pin Drive/Carrier being configured.
64E
36.4GB 68-pin LVD SCSI Disk Drive being configured.
64F
See 61D.
650
IBM SCSD disk drive being configured.
653
18.2GB Ultra-SCSI 16-bit Disk Drive being configured.
655
GXT130P Graphics adapter being configured.
657
GXT2000P graphics adapter being configured.
658
659
2102 Fibre Channel Disk Subsystem Controller Drawer being identified or configured.
660
2102 Fibre Channel Disk Array being identified or configured.
662
Ultra2 Integrated SCSI controller.
663
The ARTIC960RxD Digital Trunk Quad PCI Adapter or the ARTIC960RxF Digital Trunk
Resource Adapter being configured.
664
32x (MAX) SCSI-2 CD-ROM drive being configured.
667
PCI 3-Channel Ultra2 SCSI RAID Adapter being configured.
669
PCI Gigabit Ethernet Adapter being configured.
66C
10/100/1000 Base-T EthernetPCI Adapter.
66D
PCI 4-Channel Ultra-3 SCSI RAID Adapter.
66E
4.7 GB DVD-RAM drive.
674
ESCON(R) Channel PCI Adapter being configured.
677
PCI 32-bit Fibre Channel Arbitrated Loop Adapter being configured.
67B
PCI Cryptographic Coprocessor being configured.
682
20x (MAX) SCSI-2 CD-ROM Drive being configured.
689
4.5GB Ultra SCSI Single Ended Disk Drive being configured.
68C
20 GB 4-mm Tape Drive being configured.
68E
POWER GXT6000P PCI Graphics Adapter.
690
9.1GB Ultra SCSI Single Ended Disk Drive being configured.
69b
64-bit/66MHz PCI ATM 155 MMF PCI adapter being configured.
69d
64-bit/66MHz PCI ATM 155 UTP PCI adapter being configured.
6CC
SSA disk drive being configured.
700
A 1.1 GB 8-bit SCSI disk drive being identified or configured.
701
A 1.1 GB 16-bit SCSI disk drive being identified or configured.
702
A 1.1 GB 16-bit differential SCSI disk drive being identified or configured.
703
A 2.2 GB 8-bit SCSI disk drive being identified or configured.
704
A 2.2 GB 16-bit SCSI disk drive being identified or configured.
705
The configuration method for the 2.2 GB 16-bit differential SCSI disk drive is being run. If an
irrecoverable error occurs, the system halts.
706
A 4.5 GB 16-bit SCSI disk drive being identified or configured.
707
A 4.5 GB 16-bit differential SCSI disk drive being identified or configured.
708
A L2 cache being identified or configured.
710
POWER GXT150M graphics adapter being identified or configured.
711
Unknown adapter being identified or configured.
712
Graphics slot bus configuration is executing.
713
The IBM ARTIC960 device being configured.
714
A video capture adapter being configured.
715
The Ultramedia Services audio adapter being configured. (this number displays briefly on the
panel).
717
TP Ethernet Adapter being configured.
718
GXT500 Graphics Adapter being configured.
720
Unknown read/write optical drive type being configured.
721
Unknown disk or SCSI device being identified or configured.
722
723
Unknown CD-ROM being identified or configured.
724
Unknown tape drive being identified or configured.
725
Unknown display adapter being identified or configured.
726
Unknown input device being identified or configured.
727
Unknown async device being identified or configured.
728
Parallel printer being identified or configured.
729
Unknown parallel device being identified or configured.
730
Unknown diskette drive being identified or configured.
731
PTY being identified or configured.
732
Unknown SCSI initiator type being configured.
733
7GB 8 mm tape drive being configured.
734
4x SCSI-2 640 MB CD-ROM Drive being configured.
736
Quiet Touch keyboard and speaker cable being configured.
741
745
16GB 4 mm Tape Auto Loader being configured.
746
SCSI-2 Fast/Wide PCI Adapter being configured.
747
SCSI-2 Differential Fast/Wide PCI Adapter being configured.
749
7331 Model 205 Tape Library being configured.
751
SCSI 32-bit SE F/W RAID Adapter being configured.
754
1.1GB 16-bit SCSI disk drive being configured.
755
2.2GB 16-bit SCSI disk drive being configured.
756
4.5GB 16-bit SCSI disk drive being configured.
757
External 13GB 1.5M/s 1/4 inch tape being configured.
763
SP Switch MX Adapter being configured.
764
SP System Attachment Adapter being configured.
772
4.5GB SCSI F/W Disk Drive being configured.
773
9.1GB SCSI F/W Disk Drive being configured.
774
776
PCI Token-Ring Adapter being identified or configured.
777
10/100 Ethernet Tx PCI Adapter being identified or configured.
778
POWER GXT3000P 3D PCI Graphics adapter being configured.
77B
4-Port 10/100 Ethernet Tx PCI Adapter being identified or configured.
77c
A 1.0 GB 16-bit SCSI disk drive being identified or configured.
783
4 mm DDS-2 Tape Autoloader being configured.
789
2.6 GB External Optical Drive being configured.
78B
POWER GXT4000P PCI Graphics Adapter.
78C
PCI bus configuration executing.
78D
GXT300P 2D Graphics adapter being configured.
790
Multi-bus Integrated Ethernet Adapter being identified or configured.
797
TURBOWAYS(R) 155 UTP/STP ATM Adapter being identified or configured.
798
Video streamer adapter being identified or configured.
799
79c
ISA bus configuration executing.
7C0
CPU/System Interface being configured.
7C1
Business Audio Subsystem being identified or configured.
7cc
PCMCIA bus configuration executing.
800
TURBOWAYS 155 MMF ATM Adapter being identified or configured.
803
7336 Tape Library robotics being configured.
804
8x Speed SCSI-2 CD-ROM Drive being configured.
806
POWER GXT800 PCI Graphics adapter being configured.
807
SCSI Device Enclosure being configured.
80c
SSA 4-Port Adapter being identified or configured.
811
Processor complex being identified or configured.
812
Memory being identified or configured.
813
Battery for time-of-day, NVRAM, and so on being identified or configured, or system I/O control
logic being identified or configured.
814
NVRAM being identified or configured.
815
Floating-point processor test.
816
Operator panel logic being identified or configured.
817
Time-of-day logic being identified or configured.
819
Graphics input device adapter being identified or configured.
821
Standard keyboard adapter being identified or configured.
823
Standard mouse adapter being identified or configured.
824
Standard tablet adapter being identified or configured.
825
Standard speaker adapter being identified or configured.
826
Serial Port 1 adapter being identified or configured.
827
Parallel port adapter being identified or configured.
828
Standard diskette adapter being identified or configured.
831
3151 adapter being identified or configured, or Serial Port 2 being identified or configured.
834
64-port async controller being identified or configured.
835
16-port async concentrator being identified or configured.
836
128-port async controller being identified or configured.
837
16-port remote async node being identified or configured.
838
Network Terminal Accelerator Adapter being identified or configured.
839
7318 Serial Communications Server being configured.
840
PCI Single-Ended Ultra SCSI Adapter being configured.
841
8-port async adapter (EIA-232) being identified or configured.
842
8-port async adapter (EIA-422A) being identified or configured.
843
8-port async adapter (MIL-STD 188) being identified or configured.
844
7135 RAIDiant Array disk drive subsystem controller being identified or configured.
845
7135 RAIDiant Array disk drive subsystem drawer being identified or configured.
846
RAIDiant Array SCSI 1.3GB Disk Drive being configured.
847
16-port serial adapter (EIA-232) being identified or configured.
848
16-port serial adapter (EIA-422) being identified or configured.
849
X.25 Interface Coprocessor/2 adapter being identified or configured.
850
Token-Ring network adapter being identified or configured.
851
T1/J1 Portmaster(R) adapter being identified or configured.
852
Ethernet adapter being identified or configured.
854
3270 Host Connection Program/6000 connection being identified or configured.
855
Portmaster Adapter/A being identified or configured.
857
FSLA adapter being identified or configured.
858
5085/5086/5088 adapter being identified or configured.
859
FDDI adapter being identified or configured.
85c
Token-Ring High-Performance LAN adapter being identified or configured.
861
Optical adapter being identified or configured.
862
Block Multiplexer Channel Adapter being identified or configured.
865
ESCON Channel Adapter or emulator being identified or configured.
866
SCSI adapter being identified or configured.
867
Async expansion adapter being identified or configured.
868
SCSI adapter being identified or configured.
869
SCSI adapter being identified or configured.
870
Serial disk drive adapter being identified or configured.
871
Graphics subsystem adapter being identified or configured.
872
Grayscale graphics adapter being identified or configured.
874
Color graphics adapter being identified or configured.
875
Vendor generic communication adapter being configured.
876
8-bit color graphics processor being identified or configured.
877
POWER Gt3(TM)/POWER Gt4(TM) being identified or configured.
878
POWER Gt4 graphics processor card being configured.
879
24-bit color graphics card, MEV2 being configured.
880
POWER Gt1(TM) adapter being identified or configured.
887
Integrated Ethernet adapter being identified or configured.
889
SCSI adapter being identified or configured.
890
SCSI-2 Differential Fast/Wide and Single-Ended Fast/Wide Adapter/A being configured.
891
Vendor SCSI adapter being identified or configured.
892
Vendor display adapter being identified or configured.
893
Vendor LAN adapter being identified or configured.
894
Vendor async/communications adapter being identified or configured.
895
Vendor IEEE 488 adapter being identified or configured.
896
Vendor VME bus adapter being identified or configured.
897
S/370(TM) Channel Emulator adapter being identified or configured.
898
POWER Gt1x(TM) graphics adapter being identified or configured.
899
3490 attached tape drive being identified or configured.
89c
A multimedia SCSI CD-ROM being identified or configured.
900
GXT110P Graphics Adapter being identified or configured.
901
Vendor SCSI device being identified or configured.
902
Vendor display device being identified or configured.
903
Vendor async device being identified or configured.
904
Vendor parallel device being identified or configured.
905
Vendor other device being identified or configured.
908
POWER GXT1000 Graphics subsystem being identified or configured.
910
1/4GB Fiber Channel/266 Standard Adapter being identified or configured.
911
Fiber Channel/1063 Adapter Short Wave being configured.
912
2.0GB SCSI-2 differential disk drive being identified or configured.
913
1.0GB differential disk drive being identified or configured.
914
5GB 8 mm differential tape drive being identified or configured.
915
4GB 4 mm tape drive being identified or configured.
916
Non-SCSI vendor tape adapter being identified or configured.
917
A 2.0 GB 16-bit differential SCSI disk drive being identified or configured.
918
A 2 GB 16-bit single-ended SCSI disk drive being identified or configured.
920
Bridge Box being identified or configured.
921
101 keyboard being identified or configured.
922
102 keyboard being identified or configured.
923
Kanji keyboard being identified or configured.
924
Two-button mouse being identified or configured.
925
Three-button mouse being identified or configured.
926
5083 tablet being identified or configured.
927
5083 tablet being identified or configured.
928
Standard speaker being identified or configured.
929
Dials being identified or configured.
930
Lighted program function keys (LPFK) being identified or configured.
931
IP router being identified or configured.
933
Async planar being identified or configured.
934
Async expansion drawer being identified or configured.
935
3.5-inch diskette drive being identified or configured.
936
5.25-inch diskette drive being identified or configured.
937
An HIPPI adapter being configured.
938
Serial HIPPI PCI adapter being configured.
942
POWER GXT 100 graphics adapter being identified or configured.
943
A 3480 or 3490 control unit attached to a System/370 Channel Emulator/A adapter are being
identified or configured.
944
100MB ATM adapter being identified or configured.
945
1.0GB SCSI differential disk drive being identified or configured.
946
Serial port 3 adapter being identified or configured.
947
A 730MB SCSI disk drive being configured.
948
Portable disk drive being identified or configured.
949
Unknown direct bus-attach device being identified or configured.
950
Missing SCSI device being identified or configured.
951
670MB SCSI disk drive being identified or configured.
952
355MB SCSI disk drive being identified or configured.
953
320MB SCSI disk drive being identified or configured.
954
400MB SCSI disk drive being identified or configured.
955
857MB SCSI disk drive being identified or configured.
956
670MB SCSI disk drive electronics card being identified or configured.
957
120 MB DBA disk drive being identified or configured.
958
160 MB DBA disk drive being identified or configured.
959
160 MB SCSI disk drive being identified or configured.
960
1.37GB SCSI disk drive being identified or configured.
964
Internal 20 GB 8 mm tape drive identified or configured.
968
1.0 GB SCSI disk drive being identified or configured.
970
Half-inch, 9-track tape drive being identified or configured.
971
150 MB 1/4-inch tape drive being identified or configured.
972
2.3 GB 8 mm SCSI tape drive being identified or configured.
973
Other SCSI tape drive being identified or configured.
974
CD-ROM drive being identified or configured.
975
An optical disk drive being identified or configured.
977
M-Audio Capture and Playback Adapter being identified or configured.
981
540MB SCSI-2 single-ended disk drive being identified or configured.
984
1GB 8-bit disk drive being identified or configured.
985
M-Video Capture Adapter being identified or configured.
986
2.4GB SCSI disk drive being identified or configured.
987
An Enhanced SCSI CD-ROM drive being identified or configured.
989
200MB SCSI disk drive being identified or configured.
990
2.0GB SCSI-2 single-ended disk drive being identified or configured.
991
525MB 1/4-inch cartridge tape drive being identified or configured.
994
5 GB 8 mm tape drive being identified or configured.
995
1.2GB 1/4 inch cartridge tape drive being identified or configured.
996
A single-port, multiprotocol communications adapter being identified or configured.
997
FDDI adapter being identified or configured.
998
2.0 GB 4 mm tape drive being identified or configured.
999
7137 or 3514 Disk Array Subsystem being configured.
D46
Token-Ring cable
D81
T2 Ethernet Adapter being configured.
2530
10/100 Mbps Ethernet PCI Adapter II being configured.
The P2/E1 is the physical location code indicating an Ethernet port built into the P2 planar.
The 10-60 is an AIX location code indicating a PCI parent bus of 10, and a devfunc number of 60
(for more information , see AIX Location Codes).
These physical location codes can appear in many places while running diagnostics; for instance,
Physical location codes provide a mapping of logical functions in a platform (or expansion sites for
logical functions, such as connectors or ports) to their specific locations within the physical structure of
the platform.
The format for the location code is a string of alphanumeric characters separated by a dash (-), slash (/),
pound sign (#), or period (.). The base location is all of the information before the slash (/) or pound sign
(#). It identifies a device that is connected or plugged into the parent. Extended location information
follows the slash (/). It identifies a device that is part of the parent, a connector, or a cable. Cable
information follows the pound sign (#). It identifies a cable that is connected to the parent. The following
are examples:
The period (.) is used to identify sub-locations such as memory DIMMs on a base memory card or a
specific SCSI address. The following are examples:
Depending on the AIX and firmware levels, AIX Diagnostics may include extended location information
when identifying a planar or card. The extended location information or cable information is always
included when identifying a cable or connector. Location codes with extended location information that
display without a description identifying the devices, always identify the cable attached to the port.
The following table lists the assigned values for the location type prefixes. In most cases, the prefix value
assignments were chosen to provide some mnemonic characteristic, so that they would be easier to
remember. The underlined characters in the description field are intended to illustrate this mnemonic
relationship.
Prefix Value
Description (n=instance #)
Rack or drawer unit Un
Drawer unit mounted in a rack Un.n (U0.n if rack
cannot be sensed by
firmware)
Single enclosure platform (No enclosure location
code)
Planar (backplane, system, I/O) Pn
Planar riser card, extender Pn.n
Power/voltage supply, voltage regulator, backup battery Vn
Fan/sensor Fn
LED/LCD operator panel Ln
or
Logical device address n relative to adapter port
CPU/cache card (or pluggable module if on planar) Cn
CPU/cache module on CPU card (if pluggable) Cn.n
Memory card or SIMM/DIMM on planar Mn
Memory SIMM/DIMM on memory card Mn.n
Other extra-function base system cards (for example, service processor) Xn
I/O adapter In
Pluggable modules or daughter cards on I/O adapter In.n
Device in Bay n Dn
Ports/Connectors:
Graphics/video connector Gn
Keyboard/keyboard connector Kn
Mouse/mouse connector On
Serial port Sn
Parallel port Rn
Ethernet connector En
Token Ring connector Tn
SCSI (pronounced scuzzy) connector Zn
Other I/O ports or connectors Qn
SCSI device addresses (including SSA (Serial Storage Architecture))
Primary address (SCSI control unit ID) An
Primary and secondary address (SCSI ID and LUN (Logical Unit Number)) An.n
SCSI device location in SCSI Enclosure Services (SES)
SCSI bank Bn
SCSI bank and bay Bn.n
Undefined prefixes (reserved) H, J, N, Y
Unique device address, this address remains constant independent of which Wn
port the device is attached to.
Location Codes for RSPC Model Architecture System Units Top of page
Notes:
1. RSPC systems are only supported with AIX or Diagnostic versions below 5.2.0
2. You need to know which system architecture the system unit on which you are working uses. If
you are working with a CHRP model, use the Location Codes for CHRP Model Architecture
System Units. If you do not know which model you have, refer to Determining System
Architecture in Diagnostic Information for Multiple Bus Systems before proceeding.
Because the same diagnostic programs are used on all system units, a location code is used to physically
locate a failing device or unit. The location code is displayed along with the service request number
(SRN) when the diagnostic programs isolate a failure. If the location code is not known, you can run the
Display Previous Diagnostic Results service aid to display the results of the last time the diagnostic
programs were run.
AB-CD-EF-GH non-SCSI
AB-CD-EF-G,H SCSI
For planars, cards, and non-SCSI devices, the location code is defined as follows:
AB-CD-EF-GH
| | | |
| | | Device/FRU/Port ID
| | Connector ID
| Slot or Adapter Number
Bus Type
For systems that have memory modules that plug directly into the system planar, the
location code is 00-00-00-GH where GH is the memory module slot. For systems that have
memory cards with memory modules, the location code is 00-CD-EF-GH, where CD is the
card slot and GH is the memory module slot.
❍ For L2 caches, GH defines the cache. Values for GH are 1 through 16.
❍ For PCMCIA devices, GH defines the PCMCIA. Values for GH are 1 through 16.
❍ For async devices, GH defines the port on the fanout box. Values are 00 to 15.
❍ For a diskette drive, H defines which diskette drive 1 or 2. G is always 0.
❍ For all other devices, GH is equal to 00.
For integrated adapters, EF-GH is the same as the definition for a pluggable adapter. For example, the
location code for a diskette drive is 01-A0-00-00. A second diskette drive is 01-A0-00-01.
AB-CD-EF-G,H
| | | | |
| | | | Logical Unit Address of SCSI Device
| | | Control Unit Address of SCSI Device
| | Connector ID
| Slot or Adapter Number
Bus Type
Where:
Adapters and cards are identified with only AB-CD. The possible values for AB are as follows:
For pluggable adapters or cards, this is a two-digit slot number in the range from 01 to 99. However, in
the case of ISA cards these numbers do not actually correspond to the physical slot numbers. They simply
are based on the order in which the ISA cards are defined or configured, either by SMIT or the ISA
Adapter Configuration Service Aid.
For integrated adapters, the first character (C) is a letter in the range from A to Z. This letter is based on
the order in which the integrated adapters are defined in residual data. This ensures unique location codes
for the integrated adapters. The second character (D) is set to 0.
Processor-PCI bus
00-00 PCI bus
Memory module in system planar
00-00-00-01
Memory module in card
00-0A-00-01
Integrated PCI adapters
04-A0 ISA bus (Integrated PCI-ISA bridge)
04-B0 Secondary PCI bus (Integrated PCI-PCI bridge)
04-C0 Integrated PCI SCSI controller
Non-integrated PCI adapters
04-01 Any PCI card in slot 1
04-02 Any PCI card in slot 2
Integrated ISA adapters
01-A0 Diskette adapter
01-B0 Parallel port adapter
01-C0 Serial port 1 adapter
01-D0 Serial port 2 adapter
SA38-0509-17
Eighteenth Edition (December 2002)
Before using this information and the product it supports, read the information in Appendix F, “Notices” on page 697.
A reader’s comment form is provided at the back of this publication. If the form has been removed, address comments
to Information Development, Department H6DS-905-6C006, 11501 Burnet Road, Austin, Texas 78758-3493. To send
comments electronically, use this commercial internet address: [email protected]. Any information that you
supply may be used without incurring any obligation to you.
© International Business Machines Corporation 1995, 2002. All rights reserved. Note to U.S. Government Users,
Restricted Rights -- Use, duplication, or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
Contents
About This Book . . . . . . . . . . . . . . . . . . . . . . . ix
Who Should Use This Book . . . . . . . . . . . . . . . . . . . . ix
How to Use This Book . . . . . . . . . . . . . . . . . . . . . ix
ISO 9000 . . . . . . . . . . . . . . . . . . . . . . . . . . ix
Related Publications . . . . . . . . . . . . . . . . . . . . . . x
Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . x
CSU/SSR Feature Installation . . . . . . . . . . . . . . . . . . . xi
iii
Purpose of This MAP. . . . . . . . . . . . . . . . . . . . . . 63
Chapter 18. MAP 0270: SCSI RAID Problem Resolution and Verification . . . 103
Purpose of This MAP . . . . . . . . . . . . . . . . . . . . . 103
Chapter 20. MAP 0285: Multipath I/O (MPIO) Problem Resolution . . . . . 113
Purpose of This MAP . . . . . . . . . . . . . . . . . . . . . 113
Chapter 22. MAP 0291: Missing Device or Bus Problem Resolution . . . . 121
Purpose of This MAP . . . . . . . . . . . . . . . . . . . . . 121
Contents v
Display Machine Check Error Log . . . . . . . . . . . . . . . . . 194
Display Microcode Level . . . . . . . . . . . . . . . . . . . . 194
Display or Change Bootlist . . . . . . . . . . . . . . . . . . . 194
Display or Change Diagnostic Run-Time Options . . . . . . . . . . . . 195
Display Previous Diagnostic Results . . . . . . . . . . . . . . . . 197
Display Resource Attributes . . . . . . . . . . . . . . . . . . . 197
Display Service Hints . . . . . . . . . . . . . . . . . . . . . 197
Display Software Product Data . . . . . . . . . . . . . . . . . . 198
Display System Environmental Sensors . . . . . . . . . . . . . . . 198
Display Test Patterns . . . . . . . . . . . . . . . . . . . . . 200
Display USB Devices . . . . . . . . . . . . . . . . . . . . . 200
Download Microcode . . . . . . . . . . . . . . . . . . . . . 200
Fault Indicators . . . . . . . . . . . . . . . . . . . . . . . 203
Fibre Channel RAID Service Aids . . . . . . . . . . . . . . . . . 204
Flash SK-NET FDDI Firmware . . . . . . . . . . . . . . . . . . 204
Format Media . . . . . . . . . . . . . . . . . . . . . . . . 205
Gather System Information . . . . . . . . . . . . . . . . . . . 207
Generic Microcode Download . . . . . . . . . . . . . . . . . . 207
Hot Plug Task . . . . . . . . . . . . . . . . . . . . . . . . 207
Identify Indicators . . . . . . . . . . . . . . . . . . . . . . 211
Identify and System Attention Indicators . . . . . . . . . . . . . . . 211
Local Area Network Analyzer . . . . . . . . . . . . . . . . . . . 212
Log Repair Action . . . . . . . . . . . . . . . . . . . . . . 212
Periodic Diagnostics . . . . . . . . . . . . . . . . . . . . . 213
PCI RAID Physical Disk Identify . . . . . . . . . . . . . . . . . . 213
Process Supplemental Media . . . . . . . . . . . . . . . . . . 213
Run Diagnostics . . . . . . . . . . . . . . . . . . . . . . . 214
Run Error Log Analysis . . . . . . . . . . . . . . . . . . . . 214
Run Exercisers . . . . . . . . . . . . . . . . . . . . . . . 214
Save or Restore Hardware Management Policies . . . . . . . . . . . . 217
Save or Restore Service Processor Configuration . . . . . . . . . . . . 217
SCSI Bus Analyzer . . . . . . . . . . . . . . . . . . . . . . 218
SCSD Tape Drive Service Aid . . . . . . . . . . . . . . . . . . 219
Spare Sector Availability . . . . . . . . . . . . . . . . . . . . 220
SSA Service Aid . . . . . . . . . . . . . . . . . . . . . . . 220
System Fault Indicator . . . . . . . . . . . . . . . . . . . . . 220
System Identify Indicator . . . . . . . . . . . . . . . . . . . . 220
Update Disk-Based Diagnostics . . . . . . . . . . . . . . . . . . 220
Update System or Service Processor Flash . . . . . . . . . . . . . . 221
Update System Flash . . . . . . . . . . . . . . . . . . . . . 222
7135 RAIDiant Array Service Aid . . . . . . . . . . . . . . . . . 223
7318 Serial Communications Network Server Service Aid . . . . . . . . . 224
Contents vii
Index . . . . . . . . . . . . . . . . . . . . . . . . . . 699
This book combines operating instructions for hardware diagnostic programs with
Maintenance Analysis Procedures (MAPs), corresponding Service Request Numbers
(SRNs) and three-digit display codes. The SRNs are, in turn, keyed to failing function
codes (FFCs) and corresponding part numbers for specific machine types.
If you are analyzing a system problem, start with Chapter 2, “Start of Call MAP” on
page 15.
The SRN lists in Chapters 28-32 direct you to the correct failing function. A
cross-reference by name, (such as adapter or device) to Failing Function Code can be
found in Chapter 38, “FRU Cross-References” on page 651.
Refer to Chapter 25, “Installation Checkout” on page 135 if you are checking the system
unit or device after installation.
ISO 9000
ISO 9000 registered quality systems were used in the development and manufacturing
of this product.
A reader’s comment form is provided at the back of this publication. If the form has
been removed, address comments to Information Development, Department
H6DS-905-6C006, 11501 Burnet Road, Austin, Texas 78758-3493. To send comments
electronically, use this commercial internet address: [email protected]. Any
information that you supply may be used without incurring any obligation to you.
ix
Related Publications
The following books are used to isolate a problem in the system.
v Adapters, Devices, and Cable Information for Multiple Bus Systems contains
information about common device, adapter, and cabling of the system unit.
v Site and Hardware Planning Information is a planning and site preparation guide.
v Installation Guide contains the instructions for installing the operating system.
v RS/6000 SP System Service Guide contains the service procedures that are specific
to RS/6000 SP systems.
v If you are servicing a clustered Eserver, go to the Start of Call MAP 100 in the
Clustered Eserver Installation and Service Guide.
v The service documentation for the system units or devices contains the service
procedures that are specific to that device or system unit. The service documentation
contains the following:
– Reference information such as data flow, cable diagrams, and specifications
– System unit or device-specific Maintenance Analysis Procedures (MAPs), such as
the power MAP
– Removal and replacement procedures
– The system unit or device installation procedures
– Parts diagrams and parts lists.
Trademarks
The following terms are trademarks of International Business Machines Corporation in
the United States, other countries, or both:
v AIX
v Eserver
v ESCON
v IBM
v LANStreamer
v Portmaster
v POWER Gt1
v POWER Gt3
v POWER Gt4
v pSeries
v RETAIN
v RS/6000
v S/370
v SP
v TURBOWAYS
Other company, product, and service names may be trademarks or service marks of
others.
Preface xi
1 2
Machine Type Model System CSU Features/Options
SSR Installation Customer Installation
7046 (All) Yes FC 2856 & 6309 All Other Features
7236 (All) No All Features None
7248 (All) Yes FC 2856 All Other Features
7316 (All) No All Features None
7317 (All) No All Features None
7318 (All) No All Features None
7319 (All) No All Features None
On a partitioned system, some service aids may only be run in a partition that is set up
with service authority. To set up a partition with service authority, ask the system
administrator to refer to the Hardware Management Console Installation and Operations
Guide.
Some of the devices that are supported by the diagnostic programs also have their own
maintenance documentation. You may want to use the maintenance documentation for
that device before running the diagnostics for the system. Sometimes the maintenance
package for an attached device allows the customer to continue operating the system
while that device is being diagnosed. You can use the diagnostic programs to check the
adapter to which that device is attached.
Begin all problem analysis with Chapter 2, “Start of Call MAP” on page 15.
MAPs may direct you to run diagnostics. Information on how to run the diagnostics, in
various modes, is in Chapter 26, “General Diagnostic Information” on page 139. These
MAPs may direct you to other MAPs or to other service information. Be prepared to
record code numbers and other data while using these MAPs.
Some adapters may show a different SRN than expected if the adapter cannot be
configured. If the SRN or FFC are listed by onscreen diagnostics, use that information
first. Otherwise, use the part number that is on the adapter.
1
About Slow Boot
Some systems support a service mode fast or slow boot. If you suspect a problem in
the base system or can’t otherwise localize the defect, do a slow-mode boot in service
mode.
This can be specified using the System Power Control Menu on the service processor
main menu. A fast-mode boot skips much of the built-in diagnostic testing. A slow-mode
boot may yield a new 8-character error code on the operator panel and new errors in
the service processor error log. If a new error code is reported, use it in subsequent
steps to continue problem analysis.
When you run the diagnostics, more than one SRN or error code may be reported. This
may occur when there are multiple entries in the error log or when the diagnostic tests
detect multiple hardware problems. If additional SRNs are reported from a Previous
Diagnostics Results screen, they should be processed before processing any SRNs
from a problem reporting screen.
The SRNs are normally displayed in the order in which the devices are tested and the
error log entries analyzed. Error log entries logged against a single device type are
displayed in descending order with the newest entry first.
SRNs with a source code of F do not provide maximum isolation. Online diagnostics
must run in Advanced and Problem Determination Mode to obtain maximum isolation.
SRNs with a source code of G are the result of an error log entry.
Note: The priority for multiple SRNs with a source of G is determined by the time
stamp of the failure. Follow the action for the SRN with the earliest time
stamp first.
6. Device SRNs and error codes (5-digit SRNs).
These procedures determine if the PTC resistor is still tripped and then determine if
there is a short somewhere on the SCSI bus.
Note: Some multi-function meters label the leads specifically for voltage
measurements. When using this type of meter to measure
resistance, the plus lead and negative lead my not be labeled
correctly. If you are not sure that your meter leads accurately reflect
the polarity for measuring resistance, repeat this step with the leads
reversed. If the short circuit is not indicated with the leads reversed,
the SCSI bus is not faulted (shorted).
v If the resistance measured was high, proceed to the next step.
5. Reattach the external cable to the adapter, then do the following:
a. Measure across C1 as previously described.
b. If the resistance is still high, in this case above 10 Ohms, then there is no
apparent cause for a PTC failure from this bus. If there are internal
cables attached continue to the “Internal SCSI-2 Single-Ended Bus PTC
Isolation Procedure” on page 7.
Note: The SCSI-2 Fast/Wide and Ultra PCI Adapters use an onboard electronic
terminator on the external SCSI bus. When power is removed from the
adapter, as in the case of this procedure, the terminator goes to a high
impedance state and the resistance measured cannot be verified, other than it
is high. Some external terminators use an electronic terminator, which also
goes to a high impedance state when power is removed. Therefore, this
procedure is designed to find a short or low resistance fault as opposed to the
presence of a terminator or a missing terminator.
Note: Only the probe tips are touching the solder joints. Do not allow the probes to
touch any other part of the component.
Note: Some multi-function meters label the leads specifically for voltage
measurements. When using this type of meter to measure resistance, the
plus lead and negative lead my not be labeled correctly. If you are not
sure that your meter leads accurately reflect the polarity for measuring
resistance, repeat this step with the leads reversed. Polarity is important in
this measurement to prevent forward-biasing diodes which lead to a false
low resistance reading. If the short circuit is not indicated with the leads
reversed, the SCSI bus is not faulted (shorted).
v If the resistance is high and there is no internal cable to reattach, there is no
apparent cause for the PTC resistor diagnostic failure.
v If the resistance is high and there is an internal cable to reattach, proceed to the
next step.
5. Reattach the internal cable to the adapter, then do the following:
a. Measure across C1 as described above.
b. If the resistance is still high, above 25 Ohms, there is no apparent cause for a
PTC failure.
c. If the resistance is less than 10 Ohms, a fault on the internal SCSI bus is
possible. Troubleshoot the internal SCSI bus by disconnecting devices and
terminators. Measure across C1 to determine if the fault has been removed.
Note: The SCSI-2 Fast/Wide and Ultra PCI adapters use an onboard electronic
terminator on the internal SCSI bus. When power is removed from the
adapter, as in the case of this procedure, the terminator goes to a high
impedance state and the resistance measured cannot be verified, other than it
is high. Some internal terminators use an electronic terminator, which also
goes to a high impedance state when power is removed. Therefore, this
procedure is designed to find a short or low resistance fault as opposed to the
presence of a terminator or a missing terminator.
The differential adapter can be identified by the 4-B or 4-L on the external bracket plate.
These procedures determine if the PTC resistor is still tripped and then determine if
there is a short somewhere on the SCSI bus.
This procedure is used for SRNs 637-240 and 637-800 on the Dual-Channel Ultra SCSI
Adapter. If the TERMPWR Shorted LED is lit, use this procedure to help isolate the
source of the problem on the failing channel.
1. Identify the adapter by its label of 4-R on the external bracket. Then, determine if
the failure is on channel A or channel B.
2. The same PTC is used for both the internal and external buses. The PTC protects
the SCSI bus from high currents due to shorts on the cable, terminator, or device. It
is unlikely that the PTC can be tripped by a defective adapter. A fault (short-circuit)
causes an increase in PTC resistance and temperature. The increase in resistance
causes the PTC to halt current flow. The PTC returns to a low resistive and low
temperature state when the fault is removed from the SCSI bus or when the system
is turned off.
Wait 5 minutes for the PTC resistor to fully cool, then retest.
3. If this same error persists, or the TERMPWR Shorted LED is lit, replace the
components of the failing channel in the following order (wait five minutes between
steps):
a. If the failure is on the external cable, replace the following:
1) Cable
2) Device
3) Attached subsystem
4) Adapter
b. If the failure is on the internal cable, replace the following:
1) Cable
2) Device
3) Backplane
4) Adapter
c. If the failure persists, verify that the parts exchanged are in the correct channel
(internal or external, A or B).
If the errors are still occurring, continue isolating the problem by going to “Step
0050-9” on page 53.
64-bit PCI-X Dual Channel SCSI Adapter PTC Failure Isolation Procedure
Use the following procedures if diagnostics testing indicates a potential self-resetting
thermal fuse problem. This procedure is used for SRN 2524-702 on the integrated
dual-channel SCSI adapter in a 7039/651 system.
1. Identify the adapter as the one embedded in the system board. Then, determine if
the failure is on channel 0 or channel 1.
2. The thermal fuse protects the SCSI bus from high currents due to shorts on the
terminator, cable, or device. It is unlikely that the thermal fuse can be tripped by a
If you are servicing an SP system, go to the Start of Call MAP 100 in the RS/6000 SP
System Service Guide.
If you are servicing a clustered Eserver, go to the Start of Call MAP 100 in the
Clustered Eserver Installation and Service Guide.
Note: Do not run the diagnostics until you are instructed to do so.
1. If this system has a Hardware Management Console (HMC) with Service Focal
Point (SFP) or has one or more partitions running operating systems other than AIX,
go to the ″Quick Entry MAP for Systems with Service Focal Point″ in the system
service guide. Otherwise, go to step 2.
2. The Fast Path MAP in Chapter 3 is provided to help you quickly resolve a problem.
Use the Fast Path MAP when you know or have been provided with a symptom.
Use the following table to help determine your next step.
Symptom Action
You do not have a problem or symptom, but Go to Chapter 4, “MAP 0020: Problem
you want to generate one. Determination Procedure” on page 29 in this
manual.
You have been provided with a problem or Go to Chapter 3, “Fast Path MAP” on page 17
symptom. in this manual.
Note: Some of the devices that are supported by the diagnostic programs also have
their own maintenance documentation. You may want to use the maintenance
documentation for that device before running the diagnostics for the system.
Sometimes the maintenance package for the attached device allows the
customer to continue operating the system while that device is being diagnosed.
You can use the diagnostic programs to check the adapter to which that device
is attached.
15
16 Diagnostic Information for Multiple Bus Systems
Chapter 3. Fast Path MAP
In most cases, these procedures direct you to run the Online Diagnostics. If the Online
Diagnostics are not installed, the Standalone Diagnostics should be used.
Notes:
1. If the actions listed for a specific symptom do not resolve the problem, go to
Chapter 4, “MAP 0020: Problem Determination Procedure” on page 29.
2. If you replace a part, use Chapter 23, “MAP 0410: Repair Checkout” on page 125 to
verify the fix.
3. If you are directed to the system unit’s service guide, use the MAPs in the service
guide to complete the repair. If none of the MAPs in the service guide have you
verify the fix, return to this book and use Chapter 23, “MAP 0410: Repair Checkout”
on page 125.
4. If you are servicing an SP system, go to the Start of Call MAP 100 in the SP
System Service Guide.
5. If you are servicing a clustered Eserver, go to the Start of Call MAP 100 in the
Clustered Eserver Installation and Service Guide.
6. If the system you are servicing has a Hardware Management Console (HMC) with
Service Focal Point (SFP), the various codes that may display are all listed as error
codes by the SFP. The table below should be used to interpret the field listed as
″Error code″ by the SFP:
Number of Digits in
SFP Name Characteristic Interpretation
error code
error code any contains a ″-″ SRN
error code 5 does not contain ″-″ SRN
error code 6 does not contain ″-″ error code
error code 8 does not contain ″-″ error code
When possible, run the Online Diagnostics in Service Mode unless you are directed
otherwise.
17
Fast Path MAP Table of Contents
Problem Description Page No.
Eight-Digit Error Codes 19
Six-Digit Error Codes Containing No Dash (-) 19
888 Sequence in Operator Panel Display 19
The System Stops or Hangs With a Value Displayed in 19
the Operator Panel Display
Diagnostic SRNs 20
System Automatically Reboots 20
System does not Reboot When Reset Button Pushed 20
ASYNC Communication Problems 20
SCSI Adapter Problems 21
SCSI Bus Problems 21
Tape Drive Problems 21
CD-ROM Drive Problems 22
SCSI Disk Drive Problems 22
Diskette Drive Problems 23
Token-Ring Problems 23
Ethernet Problems 24
Display Problems 24
Keyboard or Mouse 24
Printer and TTY Problems 25
Other Adapter Problems 25
System Messages 25
Processor and Memory Problems 25
Service Processor Problems 25
Degraded Performance or Installed Memory Mismatch 26
Missing Resources 26
Missing Path on MPIO Resource 26
System Hangs or Loops When Running the Operating 27
System or Diagnostics
Exchanged FRUs Did Not Fix the Problem 27
RAID Problems 27
System Date and Time Problems 27
SSA Problems 28
Power Indicator Light is Not On 28
System Power Problems 28
You Cannot Find the Symptoms in This Table 27
v Modem does not dial out or answer calls Remove the Service Processor and then run
using Service Processor functions. advanced diagnostics on the built-in serial
ports. If an SRN is reported, lookup the SRN
v Service Processor menus do not display or
and do the indicated action. If no SRN is
display incorrectly on a TTY terminal. reported, replace the Service Processor. If the
v Cannot input to the Service Processor problem remains after replacing the Service
menus using a TTY keyboard. Processor, replace the system planar.
Degraded Performance or Installed Memory Mismatch
Degraded Performance or Installed Memory Degraded Performance can be caused by
Mismatch memory problems that cause a reduction in the
size of available memory. To verify that the
system detected the full complement of
installed memory use one of the following
methods based on the level of AIX being used.
v AIX 4.2.1 and Higher
From the Task Selection Menu select the
’Display Resource Attribute’. From the
Resource Selection menu select one of the
listed memory resources. Verify the amount
of memory listed matches the amount
actually installed.
v All Other AIX Versions
Use the Display or Change Configuration
Service Aid to verify that all installed memory
is being detected. The Display Vital Product
Data (VPD) option shows the amount of
memory that is installed.
If an installed memory module or card does
not appear or appears as the wrong size,
replace it. If the problem is not corrected,
replace the card or board that contains the
missing memory.
Missing Resources
Missing Resources Note: ISA resources must be configured before
they appear in the configuration. The ISA
Adapter Configuration Service Aid is used to
configure ISA adapter for Standalone
Diagnostics. SMIT can be used to configure
during Online Diagnostics.
v The system does not retain the calendar 1. Run the sysplanar0 option under the
date after the system has been booted. Advanced Diagnostics in Problem
v The system does not retain the time of day Determination mode. If an SRN is reported,
after the system has been booted. record the SRN and location code
Note: It is normal for the system time of day information and do the indicated action for
to gain or lose a few seconds each month. the SRN.
2. Replace the TOD (NVRAM) battery. If this
does not fix the problem, replace the
system planar.
SSA Problems
You suspect an SSA problem. A potential problem with an SSA adapter exists.
Run diagnostics on the SSA adapter. If the
system has external SSA drives, refer to the
SSA Adapters User’s Guide and Maintenance
Information or the service guide for your disk
subsystem. If the system has internal SSA
drives, go to the SSA MAP in either the system
unit’s service guide or user’s guide.
Power Indicator Light is Not On
A drawer power indicator is not on. Refer to the Entry MAP section of the
Maintenance Analysis Procedures (MAPs) in
the system service manual.
System Power Problem
The system does not power on. Go to the Entry MAP in the service guide.
The system powers on when it should not. Go to the Entry MAP in the service guide.
If you are unable to power the system on, refer to the MAP 1520 in the system’s
service guide.
Note: If another system connects to this system, refer to Chapter 27, “Using
Standalone and Online Diagnostics” on page 153 for pertinent information before
proceeding further.
Step 0020-1
Visually check the system for obvious problems such as unplugged power cables or
external devices that are powered off.
Step 0020-2
Are the Online Diagnostics installed?
Note: If you are uncertain how to answer the above question, answer it yes.
NO If the operating system is running, perform its shutdown procedure. Get help if
needed. Go to “Step 0020-4” on page 30.
YES Go to “Step 0020-3” on page 30.
29
Step 0020-3
Note: When possible, run Online Diagnostics in Service Mode. Online diagnostics
perform additional functions, compared to Standalone Diagnostics.
Run Online Diagnostics in Concurrent Mode when the customer does not let you
power-off the system unit. To run Online Diagnostics in Service Mode, go to substep 5.
If the system unit is already running in the service mode and you want to run online
diagnostics, proceed to the question in this MAP, otherwise do substeps 1 through 4.
1. Log in with root authority or use CE login. If necessary, ask the customer for the
password.
2. Enter the diag -a command to check for missing resources.
a. If you see a command line prompt, proceed to substep 3 below.
b. If the DIAGNOSTIC SELECTION menu is displayed, with the letter M shown next
to any resource, select that resource, then press Commit (F7 key). Follow any
instructions displayed. If you are prompted with a message Do you want to
review the previously displayed error select Yes and press Enter. If an SRN
displays, record it, and go to “Step 0020-15” on page 39. If there is no SRN, go
to substep 3 below.
c. If MISSING RESOURCE menu is displayed, follow any instructions displayed. If
you are prompted with a message Do you want to review the previously
displayed error select Yes and press Enter. If an SRN displays, record it, and
go to “Step 0020-15” on page 39. If there is no SRN, go to substep 3 below.
3. Enter the diag command.
4. Go to “Step 0020-5” on page 32.
5. If the operating system is running, perform its shut down procedure (get help if
needed).
6. Turn off the system unit power and wait 45 seconds before proceeding.
7. Turn on the system unit power.
8. Load the Online Diagnostics in Service Mode (if needed, refer to the service guide).
9. Wait until the Diagnostic Operating Instructions display or the system appears to
have stopped.
Step 0020-4
1. Turn off the system unit power and wait 45 seconds before proceeding.
2. Turn on the system unit power. If the system supports Slow boot (See “About Slow
Boot” on page 2), do a slow boot on the system. If the system does not support
slow boot, do a normal boot
3. Load the Standalone Diagnostics in Service Mode. Refer to the system unit service
guide if needed.
Step 0020-6
Press the Enter key.
Step 0020-7
1. Select the ADVANCED DIAGNOSTICS ROUTINES option.
Notes:
a. If the terminal type is not defined, do so now. You cannot proceed until this is
complete.
b. If you have SRNs from a Previous Diagnostics Results screen, process these
Previous Diagnostics Results SRNs prior to processing any SRNs you may have
received from an SRN reporting screen.
2. If the DIAGNOSTIC MODE SELECTION menu (screen number 801003) displays,
select the PROBLEM DETERMINATION option.
3. Find your system response in the following table. Follow the instructions in the
Action column.
Notes:
1. If the SCSI Enclosure Services device
appears on the Missing Resource list
along with the other resources, select it
first.
2. ISA adapters cannot be detected by the
system. The ISA Adapter Configuration
Service Aid in Standalone Diagnostics
allows the identification and configuration
of ISA adapters.
Step 0020-8
On the DIAGNOSTIC SELECTION or ADVANCED DIAGNOSTIC SELECTION menu,
look through the list of resources to make sure that all adapters and SCSI devices are
listed including any new resources.
Notes:
1. Resources attached to serial and parallel ports may not appear in the resource list.
2. ISA adapters cannot be detected by the system. The ISA Adapter Configuration
Service Aid in Standalone Diagnostics allows the identification and configuration of
ISA adapters.
Did you find the all the adapters or devices on the list?
NO Go to “Step 0020-9”.
YES Go to “Step 0020-11”.
Step 0020-9
Is the new device or adapter an exact replacement for a previous one installed at
same location?
NO Go to “Step 0020-10”.
YES The replacement device or adapter may be defective. If possible, try installing
it in an alternate location if one is available; if it works in that location, then
suspect that the location where it failed to appear has a defective slot;
schedule time to replace the hardware that supports that slot. If it does not
work in alternate location, suspect a bad replacement adapter or device. If you
are still unable to detect the device or adapter, contact your service support
structure.
Step 0020-10
Is the operating system software to support this new adapter or device installed?
NO Load the operating system software.
YES The replacement device or adapter may be defective. If possible, try installing
it in an alternate location if one is available; if it works in that location, then
suspect that the location where it failed to appear has a defective slot;
schedule time to replace the hardware that supports that slot. If it does not
work in alternate location, suspect a bad replacement adapter or device. If you
are still unable to detect the device or adapter, contact your service support
structure.
Step 0020-11
Select and run the diagnostic test problem determination or system verification on one
of the following:
v The resources with which the customer is having problems. If the resource is not
shown on the DIAGNOSTIC SELECTION menu, then run diagnostics on its parent
(the adapter or controller to which the resource is attached).
v The resources you suspect are causing a problem.
v All resources.
Note: When choosing All Resources, interactive tests are not done. If no problem is
found running All Resources you should choose each of the individual
resouces on the selection menu to run diagnostics tests on to do the
interactive tests
Step 0020-12
The following step analyzes a console display problem.
Find your type of console display in the following table. Follow the instructions given in
the Action column.
Step 0020-13
There is a problem with the keyboard.
Find the type of keyboard you are using in the following table. Follow the instructions
given in the Action column.
Step 0020-14
The diagnostics did not detect a problem.
If the problem is related to either the system unit or the I/O expansion box, refer to the
service guide for that unit.
If a problem occurs when running Online Diagnostics but not when running the
Standalone Diagnostics, suspect a software problem.
This is possibly a problem with software or intermittent hardware. If you think that you
have an intermittent hardware problem, go to Chapter 7, “MAP 0040: Intermittent
Problem Isolation” on page 47.
Note: The priority for multiple SRNs with a source of G is determined by the
time stamp of the failure. Follow the action for the SRN with the earliest
time stamp first.
f. Device SRNs and error codes (5-digit SRNs).
If a group has multiple SRNs, it does not matter which SRN is handled first.
2. Find the SRN in Chapter 30, “Using the SRN List” on page 251.
Step 0020-17
The physical location code, AIX location code, or device name displays on system units
with a multiple-line operator panel display if AIX 4.3.3 or later is installed.
Step 0025-1
Determining the Status of the SCSI RAID Subsystem.
Ask the customer to determine the status of the RAID subsystem. Below is an example
of this procedure:
1. Login as root (if not already root).
2. Type smit pdam.
3. Select List PCI SCSI RAID Arrays.
4. Select the adapter associated with the RAID upon which you want to do problem
determination.
5. Press Enter.
Step 0025-2
Is the status of the RAID Array listed as Degraded?
NO Go to “Step 0025-3”.
YES Make a note of the failing drive Channel and ID, and go to Chapter 18, “MAP
0270: SCSI RAID Problem Resolution and Verification” on page 103 to replace
the drive on that channel.
Step 0025-3
Is the status of the RAID Array listed as Reconstructing?
NO Go to “Step 0025-5” on page 42.
YES Go to “Step 0025-4”.
Step 0025-4
Is the RAID Array reconstructing but no percentage is shown?
NO Wait for the RAID Array to complete reconstructing, then go to “Step 0025-12”
on page 44.
YES Go to “Step 0025-5” on page 42.
41
Step 0025-5
Is the status of the RAID Array listed as Defined?
NO Go to “Step 0025-11” on page 43.
YES Go to “Step 0025-6”.
Step 0025-6
Ask the customer to determine the status of the RAID adapter. For example, type:
lsdev -C -l scraidX
Step 0025-7
Ask the customer to remove the adapter and reconfigure it. For example, to remove the
adapter type:
rmdev -l scraidX
Step 0025-8
Ask the customer to configure the defined disk array. An example of this procedure:
1. Type smitty pdam
2. Select Configure a Defined PCI SCSI Disk Array.
3. Select the defined array that you want to make available.
4. A message displays indicating the outcome of the procedure. Make a note of the
message.
5. Press PF10 to exit.
Step 0025-10
Run Diagnostics on the RAID system.
1. Type diag.
2. Select Advanced Diagnostic Routines.
3. Select System Verification.
4. If any menus other than the Advanced Diagnostics Selection menu are displayed,
follow their instructions.
5. When the Advanced Diagnostic Selection menu displays, select scraidX (where
X is the ID of the SCSI RAID adapter). Press F7. (an alternative method of running
steps 2-6 from AIX command line is to type diag -v -d scraidX where X is the ID
of the SCSI RAID adapter).
Step 0025-11
If the RAID array is not online, it is most likely due to multiple drive failures. If you
believe that a failed drive is actually good, ask the customer to revive a failing drive.
Below is an example of this procedure:
1. Login as root ( if not already root).
2. Type smit pdam.
3. Select Revive a failed Drive in a PCI SCSI Disk Array.
If the drive can be recovered, it will be put into Reconstructing or Degraded mode,
repeat “Step 0025-1” on page 41 using the indicated mode. If the drive cannot be
recovered, go to Chapter 18, “MAP 0270: SCSI RAID Problem Resolution and
Verification” on page 103 to replace the drive.
If the resource is in use and you do not wish to take it offline, select Testing Should
Stop. This will show information about the RAID subsystem including the address of the
failing disk drives used by that SCSI RAID channel and ID.
Note: Problem determination will not be performed unless the device is NOT in use.
Step 0025-13
1. Type diag
2. Select Task Selection
3. Select Analyze Adapter Internal Log
CAUTION:
Only for the 4-Channel PCI SCSI RAID adapter, drives will be decoded as
channels 0-3 rather than channels 1-4. So if an error is indicated on
channel 1 ID 4, the error is really on channel 2 ID 4.
Step 0025-14
Are there other RAID problems?
NO No RAID problems were found. Exit this MAP.
YES Contact your service support structure.
Step 0030-1
Some external devices (including rack drawers that contain devices) have their own
problem-determination procedures. If the problem is related to an external device that
has its own problem-determination procedure, run those procedures if not already run. If
they do not correct the problem, continue with this MAP.
Step 0030-2
The problem may have been caused by a resource that has not been tested. System
Checkout tests all resources. If the Online Diagnostics are installed and you are able to
load them, then All Resources under the Diagnostic Selection menu should be run. If
you get a different SRN, then look up the SRN in the SRN chapters and do the listed
action. If you are unable to run All Resources under the Diagnostic Selection menu or
you do not get another SRN when running it, continue with this MAP.
Step 0030-3
If the problem is related to a SCSI device, SCSI bus, or SCSI controller, go to
Chapter 8, “MAP 0050: SCSI Bus Problems” on page 51. If you are unable to isolate
the problem with MAP 0050, continue with “Step 0030-4” on page 46.
45
Step 0030-4
1. Find the resource(s) that are identified by the SRN or error code in the following
table.
2. Perform the first action listed for the resource.
3. If you exchange a FRU or change a switch setting, test the resource again.
4. If the action does not correct the problem, perform the next action until all actions
have been tried. If an action says to exchange a FRU that you have already
exchanged, go to the next action. If an action corrects the problem, go to
Chapter 23, “MAP 0410: Repair Checkout” on page 125.
5. If you perform all of the actions and do not correct the problem, check the Service
Hints service aid for information. If the service aid does not help, call your service
support structure.
Because software or hardware can cause intermittent problems, consider all symptoms
relevant to your problem.
When you exchange a FRU, go to Chapter 23, “MAP 0410: Repair Checkout” on
page 125 to check out the system.
Hardware Symptoms
Note: This table spans several pages.
47
Symptom of Hardware Things to Check For
Problem
System unit powers off a few v Fan speed. Some fans contain a speed-sensing circuit. If one
seconds after powering On. of these fans is slow, the power supply powers the system unit
off.
v Correct voltage at the outlet into which the system unit is
plugged.
v Loose power cables and fan connectors, both internal and
external.
System unit powers off after v Excessive temperature in the power supply area.
running for more than a few
seconds. v Loose cable connectors on the power distribution cables.
v Fans turning at full speed after the system power has been on
for more than a few seconds.
Only internally installed Check the following items that are common to more than one
devices are failing. device:
v Ground connections on all of the disk drives and other types of
drives installed.
v Loose connections on the power cables to the planars, drives,
fans, and battery.
v System unit cooling. Is the input air temperature within limits?
Are all the fans running at full speed? Are any of the vent
areas blocked?
v Signal cables to the diskette drives, and the power supply.
v SCSI device signal cables for loose connectors and
terminators.
v Loose SCSI device address jumpers.
v Possible contamination of any device that has a cleaning
procedure. See the operator guide for cleaning instructions.
v Excessive static electricity.
v Correct voltage at the system unit power outlet
Software Symptoms
Symptom of Software Things to Check For
Problem
Any symptom you suspect is Use the software documentation to analyze software problems.
related to software.
Be sure to check RETAIN® for known problems with your type of
system unit or software.
Software-caused system Check the following software items:
crashes v Is the problem only with one application program?
v Is the problem only with one device?
v Does the problem occur on a recently installed program?
v Was the program recently patched or modified in any way?
v Is the problem associated with any communication lines?
v Check for static discharge occurring at the time of the failure.
For additional information about this adapter, see the Adapters, Devices, and Cable
Information for Multiple Bus Systems.
Considerations
v Remove power from the system before connecting and disconnecting cables or
devices to prevent hardware damage or erroneous diagnostic results.
v Also, use this MAP for SCSI adapters that are built into system boards or I/O boards.
Replace the system board or I/O board when the procedure calls for replacing the
adapter.
v If the failure is a terminator power failure (SRNs xxx-226, xxx-240, xxx-800), always
allow five minutes for the PTC to cool.
v The differential version of the adapter has socket-type terminators to support
high-availability. If this is the adapter’s configuration, the terminators would have
been removed from the adapter. MAP steps requiring the removal of the cable from
the adapter are inapplicable, since an adapter that is not terminated always fails
diagnostics. Proper SCSI diagnostics require proper termination. If the configuration
involves a Y-cable, leave it, with the appropriate terminator, attached to the adapter.
Or, place an external differential terminator on the external port.
v If the system uses shared DASD or high-availability configuration, be sure that the
other system sharing the devices is not using those devices. For additional
information concerning high-availability configurations, see Chapter 1, “Service Hints”
on page 1.
v For intermittent problems that cannot be resolved with this MAP, refer to Chapter 1,
“Service Hints” on page 1.
v If the SCSI bus is attached to a RAID subsystem, refer to the RAID subsystem
documentation for any problem determination.
Step 0050-1
Have recent changes been made to the SCSI configuration?
NO Go to “Step 0050-2” on page 52.
YES Go to “Step 0050-5” on page 52.
51
Step 0050-2
Are there any hot-swap devices controlled by the adapter?
NO Go to “Step 0050-3”.
YES Go to “Step 0050-11” on page 54.
Step 0050-3
Are there any devices other than hot-swappable devices controlled by the
adapter?
NO Go to “Step 0050-4”.
YES Go to “Step 0050-13” on page 54.
Step 0050-4
Is an enclosure or drawer that supports hot-swap devices controlled by the
adapter?
NO Go to “Step 0050-22” on page 57.
YES Go to “Step 0050-15” on page 55.
Step 0050-5
This step handles cases where recent changes have been made to the SCSI
configuration.
Using the first three digits of the SRN, refer to the FFC listing and determine if the
adapter is single-ended or differential.
Step 0050-6
The adapter’s termination jumper settings may be incorrect. Power off the system, and
inspect Jumper J7. Refer to the ″SCSI Cabling″ section of the Adapters, Devices, and
Cable Information for Multiple Bus Systems for the correct jumper settings.
Go to “Step 0050-9”.
Step 0050-8
1. Correct the jumper settings and reinstall the adapter and all cables.
2. Power on the system, and run diagnostics in system verification mode on the
adapter.
Step 0050-9
Check for the following problems:
v Address conflicts between devices.
v Cabling problems such as; configurations that exceed the maximum cable lengths,
missing termination, or excessive termination.
Refer to the Adapters, Devices, and Cable Information for Multiple Bus Systems for
more details about supported SCSI cabling.
Step 0050-10
1. Correct the problem.
2. Power on the system, and run diagnostics in system verification mode on the
adapter.
Step 0050-12
Power off the system. Reconnect the hot-swap devices one at time. After reconnecting
each device, follow this procedure:
1. Power on the system.
2. Rerun the diagnostics on the adapter.
3. If the adapter fails, the problem may be with the last device reconnected. Perform
these substeps:
a. Follow repair procedures for that last device.
b. Rerun diagnostics on the adapter.
c. If diagnostics fail, replace the SES backplane corresponding to the slot for the
device.
d. Rerun diagnostics.
e. If diagnostics fail, replace the last device.
f. Rerun diagnostics on the adapter.
g. If diagnostics pass, go to Chapter 23, “MAP 0410: Repair Checkout” on
page 125. Otherwise, contact your support center.
Note: A device problem can cause other devices attached to the same SCSI
adapter to go into the Defined state. Ask the system administrator to
make sure that all devices attached to the same SCSI adapter as the
device that you replaced are in the Available state.
4. If no errors occur, the problem could be intermittent. Make a record of the problem.
Running the diagnostics for each device on the bus may provide additional
information.
Step 0050-13
This step determines if a device other than a hot-swappable device is causing the
failure. Follow these steps:
Step 0050-14
Reconnect the devices one at time. After reconnecting each device, follow this
procedure:
1. Rerun the diagnostics in system verification mode on the adapter.
2. If there is a failure, the problem should be with the last device reconnected. Follow
the repair procedures for that device, then go to Chapter 23, “MAP 0410: Repair
Checkout” on page 125.
3. If no errors occur, the problem could be intermittent. Make a record of the problem.
Running the diagnostics for each device on the bus may provide additional
information.
Step 0050-15
This step determines if the SCSI Enclosure Services (SES) is the problem. Note that
the SES is referred to as the DASD controller in some systems.
Refer to the system or enclosure service guide to determine if the SES (DASD
controller) is a FRU that plugs into the backplane or is integrated on the backplane.
Step 0050-16
Follow these steps:
1. Power off the system.
2. Remove the SES (DASD controller). Locate the SES (DASD controller) part number
under FFC 199.
3. Power on the system.
4. If the Missing Options menu displays, select the The resource has been turned
off, but should remain in the system configuration option for all the devices that
were disconnected.
Step 0050-17
Follow these steps:
1. Power off the system.
2. Replace the SES (DASD controller).
3. Go to Chapter 23, “MAP 0410: Repair Checkout” on page 125.
Step 0050-18
Follow these steps:
1. Power off the system.
2. Disconnect all cables attached to the adapter. For SCSI differential adapters in a
high-availability configuration, see “Considerations” on page 51.
3. Power on the system.
4. If the Missing Options menu displays, select the The resource has been turned
off, but should remain in the system configuration option for all the devices that
were disconnected.
5. Run the diagnostics in system verification mode on the adapter.
Step 0050-19
Follow these steps:
1. Power off the system.
2. Reconnect the cables to the adapter.
Step 0050-20
Follow these steps:
1. Replace the SES (DASD controller). Locate the SES (DASD controller) part number
under FFC 199.
2. Power on the system.
Step 0050-21
One of the cables remaining in the system is defective. Refer to FFC 190 for the cable
part numbers. Replace the parts one at time in the order listed. Follow these steps for
each FRU replaced:
1. Rerun the diagnostics for the adapter.
2. If there is any failure, continue with the next FRU.
3. If there is no failure, go to Chapter 23, “MAP 0410: Repair Checkout” on page 125.
Step 0050-22
Follow these steps:
1. Power off the system.
2. Disconnect all cables attached to the adapter.
3. Power on the system.
4. If the Missing Options menu displays, select the The resource has been turned
off, but should remain in the system configuration option for all the devices that
were disconnected.
5. Run the diagnostics on the adapter.
Step 0050-23
One of the cables remaining in the system is defective. Refer to FFC 190 for the cable
part numbers. Replace the parts one at time in the order listed. Follow these steps for
each FRU replaced:
1. Rerun the diagnostics for the adapter.
2. If there is any failure, continue with the next FRU.
3. If there is no failure, go to Chapter 23, “MAP 0410: Repair Checkout” on page 125.
Note: The 888 may or may not be flashing on the operator panel display.
Step 0070-1
Perform the following steps to record the information contained in the 888 sequence
message.
1. Wait until the 888 sequence displays.
2. Record, in sequence, every code displayed after the 888. On systems with a 3-digit
or a 4-digit operator panel, you may need to press the system’s ″reset″ button to
view the additional digits after the 888. Stop recording when the 888 digits reappear.
3. Go to “Step 0070-2”.
Step 0070-2
Using the first code that you recorded, use the following list to determine the next step
to use.
Type 102 Go to “Step 0070-3”.
Type 103 Go to “Step 0070-4” on page 60.
Step 0070-3
A Type 102 message generates when a software or hardware error occurs during
system execution of an application. Use the following information to determine the
content of the type 102 message. Descriptions of the crash codes and the dump status
codes are in Chapter 29, “Diagnostics Numbers and Location Codes” on page 225.
Record the Crash code and the Dump Status from the message you recorded in “Step
0070-1”. For an explanation of the Dump Status or the Crash Codes, see Chapter 29,
“Diagnostics Numbers and Location Codes” on page 225.
59
Are there additional codes following the Dump Status?
No Go to “Step 0070-5”.
YES The message also has a type 103 message included in it. Go to “Step 0070-4”
to decipher the SRN and field replaceable unit (FRU) information in the Type
103 message.
Step 0070-4
A Type 103 message generates upon hardware error detection. Use the following steps
and information you recorded in “Step 0070-1” on page 59 to determine the content of
the Type 103 message.
Note: The only way to recover from an 888 type of halt is to turn off the system unit.
Step 0070-5
Perform the following steps:
1. Turn off the system unit power.
2. Turn on the system unit power, and load the online diagnostics in service mode.
3. Wait until one of the following conditions occurs:
v You are able to load the diagnostics to the point where the Diagnostic Mode
Selection menu displays.
v The system stops with an 888 sequence.
v The system appears hung.
Note: Some devices installed in the system may require the loading of supplemental
diskettes for diagnostic support.
Step 0080-1
1. Perform a system shutdown and then, if necessary, turn off the system unit power.
2. Locate the diagnostic CD-ROM disc.
3. Turn on the system unit power, and then load the diagnostic CD-ROM disc into the
CD-ROM drive.
4. Load the Standalone Diagnostics.
5. Wait until the ″Please Define the System Console″ screen displays or all system
activity appears to have stopped.
Step 0080-2
Follow the displayed instructions until the Installed Resources menu displays.
Are all of the installed PCI adapters listed on the Installed Resources menu?
No Go to “Step 0080-3” on page 64 and make a note of all PCI adapters not listed
and their locations.
Yes You may have an intermittent problem. If you think that you have an
intermittent problem, go to Chapter 7, “MAP 0040: Intermittent Problem
Isolation” on page 47.
63
Step 0080-3
1. Perform a system shutdown, and then, if necessary, turn off the system unit power.
2. Remove all but one of the PCI adapters that was not listed on the Installed
Resources menu.
Step 0080-4
Follow the displayed instructions until the Installed Resources menu displays.
Is the adapter that you did not remove shown as an installed resource?
No Record SRN 111-78C and make a note of the adapter you just installed. Look
up the SRN in the SRN listings and perform the indicated action.
Yes Go to “Step 0080-5”.
Step 0080-5
Have you installed all of the removed adapters?
No Go to “Step 0080-6”.
Yes Call your service support structure.
Step 0080-6
1. Perform a system shutdown and then, if necessary, turn off the system unit power.
2. Install one of the remaining removed adapters into its original location.
3. Turn on the system unit power, and load Standalone Diagnostics from the CD-ROM.
4. Wait until the ″Please Define the System Console″ screen displays or all system
activity appears to stop.
Step 0210-1
Read the following information before proceeding.
v Version 4.3.3 and later diagnostics display the part number and the physical location
code, if available, on the Problem Report screen. The parts are listed in
probability-of-failure order.
v Part numbers display if they are available from vital product data (VPD). If the part
number displays, use it in place of the part number in the FFC List. In some cases,
the part number provided by the VPD may be incorrect. In that case, use the part
number in the FFC list.
v When identifying the location of a planar or a plug-in card, the physical location code
may contain the failing FRU’s extended location information. For additional
information, see “Location Codes for RSPC Model Architecture System Units” on
page 242, “Location Codes for CHRP Model Architecture System Units” on page 244,
or the system unit’s service guide.
Go to “Step 0210-2”.
Step 0210-2
Find the failing function codes in the “Failing Function Code List” on page 566, and if
not already recorded, record the FRU part number and description of each FRU.
Step 0210-3
1. Referring to “Service Request Number Lists” on page 252, record the SRN source
code and the failing function codes in the order of their listing, if not already
recorded.
2. Find the failing function codes in the ″Failing Function Code List,″ refer to
Chapter 37, “Failing Function Codes (FFCs)” on page 563, and if not already
recorded, record the part number and description for each FRU.
3. If the operating system is running, perform the operating system’s shutdown
procedure (get help if needed).
4. Turn off the system power.
67
Exchange one of the FRUs (normally the first one listed).
To verify the repair, find the SRN source code that you recorded in the following table.
Go to the step indicated in the Action column.
Step 0210-4
The following steps handle the problems when the system does not detect a resource.
Note: Use the following substeps for the SRNs having source code: C.
1. Turn on the system power.
2. Load Online Diagnostics in Service Mode (refer to the system’s service guide if
needed).
3. Wait until the diagnostics are loaded or the system appears to stop.
Note: If the terminal type is not defined, you are prompted to define it. You cannot
continue until this is done.
3. When the DIAGNOSTIC MODE SELECTION menu displays, select System
Verification.
Step 0210-6
Look at the failing function codes and the FRU part numbers you recorded for this SRN.
Have you exchanged all the FRUs that correspond to the failing function codes?
NO Go to “Step 0210-7”.
YES The SRN did not identify the failing FRU. Go to Chapter 6, “MAP 0030:
Additional Problem Determination” on page 45.
Step 0210-7
1. After performing a shutdown of the operating system, turn off the system unit power.
2. Remove the new FRU and install the original FRU.
3. Exchange the next FRU in the list.
4. Turn on the system unit power.
5. Load the Online Diagnostics in Service Mode. Refer to the system unit service
guide if needed.
6. Wait until the diagnostics are loaded or the system appears to stop.
Step 0210-9
1. Press Enter.
2. Select the Advanced Diagnostics Routines option.
Note: If the terminal type is not defined, you are prompted to define it. You cannot
continue until this is done.
3. If the MISSING RESOURCES menu displays, skip the next step and answer the
question below. If it does not display, proceed to the next substep.
4. When the DIAGNOSTIC MODE SELECTION menu displays, select the System
Verification option.
Note: The RESOURCE REPAIR ACTION menu allows a repair action to be logged for
the resource being tested. This prevents Error Log Analysis from reporting
problems on FRUs that have been replaced. If the RESOURCE REPAIR
ACTION menu displays, perform the following:
1. Select the resource that has been replaced from the menu.
2. After all selections have been made, select Commit (F7 key).
Step 0210-11
Look at the failing function codes and FRU part numbers you recorded for this SRN.
Have you exchanged all the FRUs that correspond to the failing function codes?
NO Go to “Step 0210-12”.
YES The SRN did not identify the failing FRU. Go to Chapter 6, “MAP 0030:
Additional Problem Determination” on page 45.
Step 0210-12
1. After performing a shutdown of the operating system, turn off the system unit power.
2. Remove the new FRU and install the original FRU.
3. Exchange the next FRU in the list.
4. Turn on the system unit power.
5. Load either the Online or Standalone Diagnostics in Service Mode. Refer to the
system’s service guide if needed.
6. Wait until the diagnostics are loaded or the system appears to stop.
Note: If the terminal type is not defined, you are prompted to define it. You cannot
continue until this is done.
Step 0210-14
Every time the Problem Determination option is selected from the Diagnostics Mode
Selection menu, the error log for the preceding time period is analyzed, and problems
are assigned the SRN source: G. Although a FRU may have already been replaced,
based on that error log analysis, repeated selection of the Problem Determination
option continues to reflect the same error for a period of time. The System Verification
option does not perform error analysis. Ensure that the indicated failing FRU has not
been replaced in the previous week.
Has the FRU called out by this SRN been replaced within the past week?
NO Go to “Step 0210-8” on page 70.
YES Disregard this SRN. If the Problem Determination option needs to be run,
use the operating system’s errclear command to delete the error log entry for
the replaced resource. If more information is needed, see the operating system
Commands Reference. Go to Chapter 4, “MAP 0020: Problem Determination
Procedure” on page 29.
Step 0210-16
Was on of the missing resources attached to SES, SAF-TE, or DASD backplane?
NO Go to “Step 0210-18”.
YES Go to “Step 0210-17”.
Step 0210-17
A defective backplane may cause phantom devices to appear at specific SCSI
addresses. Look through the configuration of SCSI devices attached to the backplane
you replaced and verify that the devices shown are actually present in the configuration.
If you find any extra SCSI devices that are not present in the system, remove them
from the configuration (check with the system administrator to verify the actual
configuration of SCSI devices that should be attached to the backplane).
Step 0210-18
Follow the displayed instructions. When you finish processing the missing resources,
answer the following:
Step 0210-19
The following steps handle the problem when the machine stops with an 888 sequence
displayed in the operator panel display while it is loading diagnostics.
1. Power on the system and load the Online Diagnostics in Service Mode (refer to the
system’s service guide if needed).
2. Wait until one of the following conditions occurs and answer the question.
v The machine stops for at least three minutes with a steady number in the
operator panel display.
Did the system stop with an 888 sequence in the operator panel display?
NO This completes the repair. Go to Chapter 23, “MAP 0410: Repair Checkout” on
page 125.
YES Go to “Step 0210-20”.
Step 0210-20
1. Read out the message in the operator panel display. If needed see Chapter 9, “MAP
0070: 888 Sequence in Operator Panel Display” on page 59.
2. Find and record the SRN in the message.
3. Find and record the location codes for the FRUs in the message.
Are the SRN and the location codes the same as the SRN you were analyzing?
NO Go to “Step 0210-1” on page 67 and analyze the new SRN.
YES Go to “Step 0210-21”.
Step 0210-21
Look at the failing function codes and FRU part numbers you recorded for this SRN.
Have you exchanged all the FRUs that correspond to the failing function codes?
NO Go to “Step 0210-22” on page 75.
YES The SRN did not identify the failing FRU. Go to Chapter 6, “MAP 0030:
Additional Problem Determination” on page 45.
Did the system stop with an 888 sequence in the operator panel display?
NO This completes the repair. Go to Chapter 23, “MAP 0410: Repair Checkout” on
page 125.
YES Go to “Step 0210-21” on page 74.
Step 0210-23
Is the SRN the Same as the Original SRN?
NO The symptom has changed. Check for loose cards, cables, and obvious
problems. If you do not find a problem, go to the Chapter 4, “MAP 0020:
Problem Determination Procedure” on page 29 and get a new SRN.
YES Go to “Step 0210-24”.
Step 0210-24
1. Exit the Missing Resources menu.
2. When the DIAGNOSTIC MODE SELECTION menu displays, select the System
Verification option.
3. Go to “Step 0210-10” on page 71.
Note: The FRU you want to hot plug might have a defect on it that can cause the
hot-plug operation to fail. If, after following the hot- plug procedure, you continue
to get an error message that indicates that the hot-plug operation has failed,
schedule a time for deferred maintenance when the system containing the FRU
can be powered down. Then go to MAP 210, “Step 0210-2” on page 67 and
answer NO to the question Do you want to exchange this FRU as a hot-plug
FRU?.
Attention: If the FRU is a disk drive or an adapter, ask the system administrator to
perform any steps necessary to prepare the device for removal.
Step 0220-1
1. If the system displayed a FRU part number on the screen, use that part number to
exchange the FRU.
If there is no FRU part number displayed on the screen, refer to the SRN listing.
Record the SRN source code and the failing function codes in the order listed.
2. Find the failing function codes in the FFC listing, and record the FRU part number
and description of each FRU.
3. Refer to the removal and replacements procedures in your system’s service guide
or the PCI Adapter Placement Reference Guide to determine if the FRU is
hot-swappable.
Does this system unit support hot-swapping of the first FRU listed?
NO Go to MAP 0210.
YES Go to “Step 0220-2”.
Step 0220-2
Is the FRU a hot-swap power supply or fan?
NO Go to “Step 0220-4” on page 78.
YES Go to “Step 0220-3”.
Step 0220-3
Note: Refer to the system unit’s service guide for removal and replacement
procedures.
1. Remove the old FRU.
2. Install the new FRU.
3. Enter the diag command.
77
Go to “Step 0220-14” on page 81.
Step 0220-4
Is the FRU a hot-plug PCI adapter?
NO Go to “Step 0220-5”.
YES Go to “Step 0220-12” on page 80.
Step 0220-5
Is the FRU a SCSI hot-plug device?
NO Go to “Step 0220-11” on page 79.
YES Go to “Step 0220-6”.
Step 0220-6
Is the hot-plug drive located within a system unit?
NO Go to “Step 0220-8”.
YES Go to “Step 0220-7”.
Step 0220-7
Refer to the system’s service guide in the ″Removing Hot-Plug SCSI Disk Drives″
section under ″Removal and Replacement Procedures″ for information about replacing
hot-plug SCSI Disk Drives.
Step 0220-8
Does the hot-plug drive’s enclosure have a system guide with procedures for
Removing and Replacing SCSI Disk drives?
NO Go to “Step 0220-9”.
YES If a hot-plug procedure exists, use that procedure to remove the old hot-plug
SCSI disk drive and replace it with a new hot-plug SCSI disk drive. Otherwise,
if no hot-plug procedure exists, use the existing procedure to remove the old
SCSI disk drive and replace it with a new SCSI disk drive. Go to “Step
0220-13” on page 80.
Step 0220-9
1. Ask the customer to back up the data on the drive that you intend to replace onto
another drive.
2. Verify that the disk drive is in the Defined state. The amber LED on the hot-plug
disk drive should be off.
Step 0220-10
Using the Hot-Plug Task service aid described in Chapter 28, “Introduction to Tasks and
Service Aids” on page 175, replace the hot-plug drive using the following procedure:
1. Use the List the SES Devices option to show the configuration of the hot-plug
slots. Identify the slot number of the adapter for the FRU you want to replace.
2. Select the REPLACE/REMOVE a Device Attached to an SES Device option.
3. Select the slot which contains the SCSI hot-plug drive you wish to replace. Press
Enter. You will see a fast blinking green light on the front on the hot-plug drive
indicating that it is ready for removal.
Note: Refer to service guide for the system unit or enclosure that contains the
hot-plug drive for removal and replacement procedures.
4. Remove the old hot-plug drive.
5. Install the new hot-plug drive. Once the hot-plug drive is in place, press Enter.
6. Press the exit key. Wait while configuration is done on the drive, until you see the
″hot-plug task″ on the service aid menu.
Step 0220-11
Using the Hot Plug Task service aid described in Chapter 28, “Introduction to Tasks and
Service Aids” on page 175, replace the hot-plug drive using the hot plug RAID service
aid:
Note: The drive you wish to replace must be either a SPARE or FAILED drive. Otherwise,
the drive would not be listed as an IDENTIFY AND REMOVE RESOURCES
selection within the RAID HOT PLUG DEVICES screen. In that case you must
ask the customer to put the drive into FAILED state. Refer them to the System
Management Guide for more information. Ask the customer to back up the data
on the drive that you intend to replace.
1. Select the RAID HOT PLUG DEVICES option within the HOT PLUG TASK under
DIAGNOSTIC SERVICE AIDS.
2. Select the RAID adapter that is connected to the RAID array containing the RAID
drive you wish to remove, then select COMMIT.
3. Choose the IDENTIFY option in the IDENTIFY AND REMOVE RESOURCES menu.
4. Select the physical disk which you wish to remove from the RAID array and press
Enter.
5. The disk will go into the IDENTIFY state, indicated by a flashing light on the drive.
Verify that it is the physical drive you wish to remove, then press Enter.
6. At the IDENTIFY AND REMOVE RESOURCES menu, choose the REMOVE option
and press Enter.
7. A list of the physical disks in the system which may be removed will be displayed. If
the physical disk you wish to remove is listed, select it and press Enter. The
physical disk will go into the REMOVE state, as indicted by the LED on the drive. If the
Note: There are no elective tests to run on a RAID drive itself under diagnostics
(the drives are tested by the RAID adapter).
9. This completes the repair. Return the system to the customer. Ask the customer to
add the physical disk drive to the original configuration within the RAID. Refer them
to system management guide for more information.
Step 0220-12
1. Remove the old adapter FRU and install the new adapter FRU. Refer to the
″Replacing a Hot-Pluggable PCI Adapter″ section within the ″Removal and
Replacement Procedures″ section of the system service guide.
2. Enter the diag command.
3. Go to the FUNCTION SELECTION menu, and select the Advanced Diagnostics
Routines option.
4. When the DIAGNOSTIC MODE SELECTION menu displays, select the System
Verification option.
5. Go to “Step 0220-14” on page 81.
Step 0220-13
1. If not already running diagnostics, enter the diag command.
Note: If you are already running service mode diagnostics and have just performed
the Configure Added/Replaced Devices task (under the SCSI Hot Swap
manager of the Hot Plug Task service aid), you must use the F3 key to
return to the DIAGNOSTIC OPERATING INSTRUCTIONS menu before
proceeding with the next step, or else the drive might not appear on the
resource list.
2. Go to the FUNCTION SELECTION menu, and select the Advanced Diagnostics
Routines option.
3. When the DIAGNOSTIC MODE SELECTION menu displays, select the System
Verification option.
Does the hot-plug SCSI disk drive you just replaced appear on the resource list?
NO Verify that you have correctly followed the procedures for replacing hot-plug
SCSI disk drives in the system service guide. If the disk drive still does not
Step 0220-14
Run the diagnostic test on the FRU you just replaced.
Step 0220-15
1. Use the Log Repair Action option in the TASK SELECTION menu to update the
AIX error log. If the repair action was reseating a cable or adapter, select the
resource associated with your repair action. If it is not displayed on the resource list,
select sysplanar0.
Note: On systems with a Fault Indicator LED, this changes the Fault Indicator LED
from the ″Fault″ state to the ″Normal″ state.
2. While in diagnostics, go to the FUNCTION SELECTION menu. Select the
Advanced Diagnostics Routines option.
3. When the DIAGNOSTIC MODE SELECTION menu displays, select the System
Verification option. Run the diagnostic test on the FRU you just replaced, or
sysplanar0.
Step 0220-16
Does the original problem persist?
NO If a FRU was replaced, run the Log Repair Action service aid under the Online
Diagnostics for the resource that was replaced. If the resource associated with
your action is not displayed on the Resource List, select sysplanar0. If steps
were taken to make the device ready for removal, inform the System
Administrator of the steps required to return the system to the original state.
Go to Chapter 23, “MAP 0410: Repair Checkout” on page 125.
YES Go to “Step 0220-17”.
Step 0220-17
Have you exchanged all the FRUs that correspond to the failing function codes?
NO Go to “Step 0220-18”.
YES The SRN did not identify the failing FRU. Schedule a time to run diagnostics in
service mode. If the same SRN is reported in service mode, go to Chapter 6,
“MAP 0030: Additional Problem Determination” on page 45.
Step 0220-18
Note: Before proceeding, remove the FRU you just replaced and install the original
FRU in its place.
Does the system unit support hot-swapping of the next FRU listed?
NO Go to Chapter 11, “MAP 0210: General Problem Resolution” on page 67.
YES The SRN did not identify the failing FRU. Schedule a time to run diagnostics in
service mode. If the same SRN is reported in service mode, go to “Step
0220-14” on page 81.
Step 0230-1
1. The last character of the SRN is bit-encoded as follows:
8 4 2 1
| | | |
| | | Replace all FRUs listed
| | Hot-swap is supported
| Software or Firmware could be the cause
Reserved
2. Refer to the last character in the SRN. A 4, 5, 6, or 7 indicates a possible software
or firmware problem.
Step 0230-2
Ask the customer if any software or firmware has been installed recently.
Step 0230-3
Suspect the new software or firmware.
Check with your support center for any known problems with the new software or
firmware.
83
Step 0230-4
Were any FRUs or location code reported with the SRN?
NO Go to “Step 0230-5”.
YES Go to “Step 0230-9”.
Step 0230-5
Run the diagnostics in problem determination mode on sysplanar0.
Step 0230-6
Did the system display: ″Previous Diagnostic Results - Do you want to review the
previously displayed error?″
NO Go to “Step 0230-7”.
YES You have a pending item in the error log for which there is no corresponding
Log Repair Action. To see this error, select YES at the prompt. Information
from the error log displays in order of last event first. Record the error code,
the FRU names and the location code of the FRUs. Go to “Step 0230-7”.
Step 0230-7
Were there any other SRNs that begin with an A00 to A1F reported?
NO Go to “Step 0230-8”.
YES Go to “Step 0230-1” on page 83 and use the new SRN.
Step 0230-8
Attempt to boot the system on slow boot mode.
If the system boots, run the diagnostics in problem determination mode on sysplanar0
Step 0230-9
1. Obtain the list of physical location codes and FRU numbers that were listed on the
Problem Report Screen. The list can be obtained by running the sysplanar0
diagnostics or using the Display Previous Diagnostic Results task.
2. Record the physical location codes and FRU numbers.
3. Refer to the last character in the SRN. A 2, 3, 6, or 7 indicates that hot-swap is
possible.
Step 0230-10
Note: If necessary, refer to the section under ″Reference Information″ in the system
service guide ″Powering the system on and off″ for information on system shut
down and powering the system on and off.
1. If the operating system is running, perform the operating system’s shutdown
procedure.
2. Turn off power to the system.
3. Refer to the last character in the SRN. A 1, 3, 5, or 7 indicates that all FRUs listed
on the Problem Report Screen need to be replaced. For SRNs ending with any
other character, exchange one FRU at a time, in the order listed.
Note: If the Diagnostics Operating Instructions do not display or you are unable to
select the Task Selection option, check for loose cards, cables, and
obvious problems. If you do not find a problem, go to Chapter 4, “MAP
0020: Problem Determination Procedure” on page 29 and get a new SRN.
6. Wait until the Diagnostics Operating Instructions are displayed or the system
appears to stop.
7. Press Enter.
8. Select Diagnostic Routines at the function selection menu.
9. Select System Verification.
10. If a missing options exist, particularly if it is related to the device that was replaced,
resolve the missing options before proceeding
11. Select the Task Selection option.
12. Select the Log Repair Action option.
Step 0230-11
Is the problem the same as the original problem?
NO The symptom has changed. Check for loose cards, cables, and obvious
problems. If you do not find a problem, go to Chapter 4, “MAP 0020: Problem
Determination Procedure” on page 29 and get a new SRN.
YES Go to “Step 0230-12”.
Step 0230-12
Look at the physical location codes and FRU part numbers you recorded.
Note: In substep 16, you will be asked to run exercisers. On a partitioned system,
if the FRU you replaced contains resources that may be allocated to
different partitions (for example, if the FRU is an MCM containing multiple
processors, where the processors may be assigned to different partitions)
you may wish to also run exercisers on those other partitions containing
these resources in addition to running them in the partition that reported the
SRN. If you run exercisers in those other partitions, you will need to repeat
substeps 5 through 18 of this MAP step for each partition in which you run
exercisers.
5. Load Online Diagnostics in service mode (see the system’s service guide if
needed).
Note: If the Diagnostics Operating Instructions do not display or you are unable to
select the Task Selection option, check for loose cards, cables, and
obvious problems. If you do not find a problem, go to Chapter 4, “MAP
0020: Problem Determination Procedure” on page 29 and get a new SRN.
6. Wait until the Diagnostics Operating Instructions are displayed or the system
appears to stop.
7. Press Enter.
8. Select Diagnostic Routines at the function selection menu.
9. Select System Verification.
10. If a missing options exist, particularly if it is related to the device that was replaced,
resolve the missing options before proceeding
11. Select the Task Selection option.
12. Select the Log Repair Action option.
13. Log a repair action for each replaced resource.
14. If the resource associated with your action does not appear on the Resource List,
select sysplanar0.
15. Return to the Task Selection Menu.
16. If the FRU that was replaced was memory, select Run Exercisers and run the
short exerciser on all the resources, otherwise proceed “Step 0230-15” on page 89.
17. After the exercisers are complete, return to the Task Selection Menu.
18. Select Run Error Log Analysis and run analysis on all the resources.
Note: See the hot-swap procedures in the ″Removal and Replacement″ section of your
system unit’s service guide.
1. Refer to the last character in the SRN. A 1, 3, 5, or 7 indicates that all FRUs listed
on the Problem Report Screen must be replaced. For SRNs ending with any other
character, exchange one FRU at a time, in the order listed.
Note: If CE Login is not available, have the system administrator enter superuser
mode and then enter the diag command.
3. After the Diagnostics Operating Instructions display, press Enter.
4. Select the Task Selection option.
5. Select the Log Repair Action option.
6. If the resource associated with your action is not displayed on the Resource List,
select sysplanar0.
7. Log a repair action for each replaced resource.
8. Return to the Task Selection menu.
9. Select Run Exercisers and run the short exerciser on all resources.
10. Use the Log Repair Action option in the Task Selection menu to update the AIX
error log. If the repair action was reseating a cable or adapter, select the resource
associated with your repair action. If it is not displayed on the resource list, select
sysplanar0.
Note: On systems with a Fault Indicator LED, this changes the Fault Indicator
LED from the ″fault″ state to the ″normal″ state.
Step 0230-16
Look at the physical location codes and FRU part numbers you recorded.
Step 0230-17
1. Remove the new FRU and install the original FRU.
2. Exchange the next FRU in the list.
Note: The following steps may require that the system be rebooted to invoke Array bit
steering, so you may wish to schedule deferred maintenance with the system
administrator to arrange a convenient time to reboot their system.
Step 0235-1
Was the SRN A11-560?
NO Go to “Step 0235-3”.
YES Go to “Step 0235-2”.
Step 0235-2
Logged in as root or using CE Login, at the AIX command line type diag then press
enter. Use the Log Repair Action option in the TASK SELECTION menu to update the
AIX error log. Select sysplanar0.
Note: On systems with fault indicator LED, this changes the fault indicator LED from
the FAULT state to the NORMAL state.
Were there any other errors on the resource reporting the array bit steering
problem?
NO Go to “Step 0235-4” on page 92.
YES Resolve those errors before proceeding.
Step 0235-3
Logged in as root or using CE Login, at the AIX command line type diag then press
enter. Use the Log Repair Action option in the TASK SELECTION menu to update the
AIX error log. Select procx, where x is the processor number of the processor number
of the processor that reported the error.
Note: On systems with fault indicator LED, this changes the fault indicator LED from
the FAULT state to the NORMAL state.
91
Step 0235-4
Schedule deferred Maintenance with the customer. When it is possible, reboot the
system to invoke Array Bit steering
Go to “Step 0235-5”.
Step 0235-5
After the system has been rebooted, log in as root or use CE Login. At the AIX
command line, run diagnostics in problem determination mode to determine if the array
bit steering was able to correct the problem.
If diagnostics are not run (for instance, if the system returns to Resource Selection
menu after running diagnostics in problem determination mode) or there is no problem
on the resource that originally reported the problem, then array bit steering was able to
correct the problem. Exit this MAP and go to Chapter 23, “MAP 0410: Repair Checkout”
on page 125.
If a problem is reported, use the SRN to resolve the problem. Go to the Chapter 3,
“Fast Path MAP” on page 17 with this symptom.
Step 0240-1
1. If the system displayed a FRU part number on the screen, use that part number to
exchange the FRU.
If there was no FRU part number displayed on the screen, find your SRN in the
“Service Request Number List” on page 255.
2. Record the SRN source code and the failing function codes in the order of their
listing.
3. Find the function codes in the “Failing Function Code List” on page 566 and record
the part number and description for each FRU.
4. If the operating system is running, perform the operating system’s shutdown
procedure (get help if needed).
5. Turn off the system power.
Go to “Step 0240-2”.
Step 0240-2
1. Turn off the system power.
2. Load either the Online or Standalone Diagnostics in Service Mode (refer to the
system’s service guide if needed).
3. Wait until the diagnostics are loaded or the system appears to stop.
93
Step 0240-3
1. Press Enter.
2. When the FUNCTION SELECTION menu is displayed, select the Advanced
Diagnostics option.
Note: If the terminal type is not defined, you are prompted to define it. You cannot
continue until this is done.
3. When the DIAGNOSTIC MODE SELECTION menu is displayed, select the System
Verification option.
4. If the ADVANCED DIAGNOSTIC SELECTION menu is displayed, select the
sysplanar0 option. If the RESOURCE SELECTION MENU is displayed, select
Memory test.
Step 0240-4
Have you exchanged all the FRUs that correspond to the failing function codes?
NO Go to “Step 0240-5”.
YES Go to “Step 0240-7” on page 95.
Step 0240-5
1. After performing a system shutdown, turn off the system power.
2. Remove the new FRU and install the original FRU.
3. Exchange the next FRU in the list. If more than one memory module is listed,
replace all of the memory modules simultaneously.
4. Turn on the system power.
5. Load either the Online or Standalone Diagnostics in Service Mode (refer to the
system unit’s service guide if needed).
6. Wait until the diagnostics are loaded or the system appears to stop.
Note: If the terminal type is not defined, you are prompted to define it. You cannot
continue until this is done.
3. Select the Advanced Diagnostics option.
4. When the DIAGNOSTIC MODE SELECTION menu displays, select the System
Verification option.
5. If the ADVANCED DIAGNOSTIC SELECTION menu is displayed, select the
sysplanar0 option. If the RESOURCE SELECTION MENU is displayed, select
Memory test.
Step 0240-7
Look at the FRU descriptions you recorded for this SRN.
Step 0240-8
1. After performing a shutdown, turn off the system power.
2. Exchange the planar or memory card that contains the memory module.
3. Turn on the system power.
4. Load either the Online or Standalone Diagnostics in Service Mode (refer to the
system unit’s service guide if needed).
5. Wait until the diagnostics are loaded or the system appears to stop.
Note: If the terminal type is not defined, you are prompted to define it. You cannot
continue until this is done.
3. When the DIAGNOSTIC MODE SELECTION menu is displayed, select the System
Verification option.
4. If the ADVANCED DIAGNOSTIC SELECTION menu is displayed, select the
sysplanar0 option. If the RESOURCE SELECTION MENU is displayed, select
Memory test.
Step 0250-1
The last three or four digits of the SRN following the dash (-) match a failing function
code number. Refer to “Failing Function Code List” on page 566 and find the failing
function code that matches the last three digits of your SRN. Record the FRU part
number and description.
Does this system unit contain only one of this kind of FRU?
NO Go to “Step 0250-2”.
YES Go to “Step 0250-3”.
Step 0250-2
One of the multiple FRUs of this kind is defective.
Remove this kind of FRU one at a time. Test the system unit after each FRU is
removed. When the test is successful or all FRUs of this kind have been removed.
Step 0250-3
1. Turn off the system unit power.
2. Exchange the FRU identified in “Step 0250-2” or the FRU identified by the last three
or four digits of the SRN following the dash (-).
3. Turn on the system power.
4. Load either the Online or Standalone Diagnostics in Service Mode (refer to the
system unit’s service guide if needed).
5. Wait until the diagnostics are loaded or the system appears to stop.
97
Were you able to load the diagnostics?
NO The symptom changed. Check for loose cards, cables, and obvious problems.
If you do not find a problem, go to the system unit’s service guide.
YES Go to “Step 0250-4”.
Step 0250-4
1. Press Enter.
2. When the FUNCTION SELECTION menu is displayed, select the Advanced
Diagnostics option.
Note: If the terminal type is not defined, you are prompted to define it. You cannot
continue until this is done.
3. When the DIAGNOSTIC MODE SELECTION menu is displayed, select the System
Verification option.
Step 0250-5
Run diagnostics on the FRU that you exchanged.
Step 0260-1
The last three or four digits of the SRN following the dash (-) match a failing function
code number.
Look at the “Failing Function Code List” on page 566 and find the failing function code
that matches the last three or four digits of your SRN, following the dash. Record the
FRU part number and description (use the first FRU part listed when multiple FRUs are
listed).
The physical location code, AIX location code, or device name displays on system units
with multiple-line LCD operator panel display if AIX 4.3.3 or higher operating system is
installed.
Do you have either a physical location code or AIX location code displayed?
NO Go to “Step 0260-4” on page 100.
YES Go to “Step 0260-2”.
Step 0260-2
Are there any FRUs attached to the device described by the physical location
code or AIX location code?
No Go to “Step 0260-6” on page 100
Yes Go to “Step 0260-3”
Step 0260-3
Remove this kind of FRU attached to the device described in the location code one at a
time. Note whether the system still hangs after each device is removed. Do this until
you no longer get a hang, or all attached FRUS have been removed from the adapter
or device.
99
Step 0260-4
Does your system unit contain only one of this kind of FRU?
NO Go to “Step 0260-5”.
YES Go to “Step 0260-6”.
Step 0260-5
One of the FRUs of this kind is defective.
Remove this kind of FRU one at a time. Test the system unit after each FRU is
removed. When the test completes successfully or when you have removed all of the
FRUs of this kind.
Step 0260-6
1. Turn off the system unit.
2. Exchange the FRU identified by the location code or “Step 0260-5”.
Step 0260-7
1. Turn on the system unit.
2. Load the Standalone diagnostics (if needed, refer to the Service Guide).
3. Wait until the Diagnostic Operating Instructions display or the system appears to
have stopped.
Step 0260-8
1. Turn on the system unit.
2. Load the Online Diagnostics in Service Mode (if needed, refer to the Service
Guide).
3. Wait until the Diagnostic Operating Instructions display or the system appears to
have stopped.
Step 0260-9
Look at the operator panel display.
Is the number displayed the same as the last three or four digits after the dash (-)
of your SRN?
NO The symptom changed. Check for loose cards, cables, and obvious problems.
If you do not find a problem, go to Chapter 4, “MAP 0020: Problem
Determination Procedure” on page 29 and get a new SRN.
YES Go to “Step 0260-10”.
Step 0260-10
Was the FRU you exchanged an adapter or a planar?
NO Go to “Step 0260-11”.
YES Go to MAP 1540 in the service guide for this system unit.
Step 0260-11
Was the FRU you exchanged a device?
NO Go to MAP 1540 in the service guide for this system unit.
YES Go to “Step 0260-12”.
Step 0260-12
The adapter for the device may be causing the problem.
1. Turn off the system unit.
2. Exchange the adapter for the device.
Note: If the AIX operating system is not used on the system, start diagnostics from
an alternate source.
3. Turn on the system unit. If c31 is displayed, follow the displayed instructions to
select a console display.
4. Load the Online Diagnostics in Service Mode (if needed, refer to the service guide).
5. Wait until the DIAGNOSTIC OPERATING INSTRUCTIONS display or the system
appears to have stopped.
Chapter 17. MAP 0260: System Hangs During Resource Configuration 101
102 Diagnostic Information for Multiple Bus Systems
Chapter 18. MAP 0270: SCSI RAID Problem Resolution and
Verification
Note: This MAP assumes that the RAID adapter and drive microcode is at the correct
level. To check microcode level, see “SCSI RAID Descriptions and Diagnostic
Procedures” on page 169.
Attention: If the FRU is a disk drive or an adapter, ask the system administrator to
perform any steps necessary to prepare the device for removal.
Step 0270-1
1. If the system displayed a FRU part number on the screen, use that part number. If
there is no FRU part number displayed on the screen, refer to the SRN listing.
Record the SRN source code and the failing function codes in the order listed.
2. Find the failing function codes in the FFC listing, and record the FRU part number
and description of each FRU.
Go to “Step 0270-2”.
Step 0270-2
Is the FRU a RAID drive?
NO Go to “Step 0270-6” on page 106.
YES Go to “Step 0270-3” on page 104.
103
Step 0270-3
If the RAID drive you want to replace is not already in the failed state, then ask the
customer to run the PCI SCSI Disk Array Manager using smit to fail the drive that you
wish to replace. An example of this procedure is:
1. Login as root.
2. Type smit pdam.
3. Select Fail a Drive in a PCI SCSI Disk Array.
4. Select the appropriate disk array by placing the cursor over that array and press
Enter.
5. Select the appropriate drive to fail based on the Channel and ID called out in
diagnostics.
6. The Fail a Drive screen will appear. Verify that you are failing the correct drive by
looking at the Channel ID row. Press Enter when verified correct. Press Enter again.
7. Press F10 and type smit pdam
8. Select ″Change/Show PCI SCSI RAID Drive Status -> Remove a Failed Drive
9. Select the drive that just failed.
Go to “Step 0270-4”.
Step 0270-4
Using the Hot Plug Task service aid described in “Hot Plug Task” on page 207, replace
the RAID drive using the RAID HOT PLUG DEVICES service aid:
Note: The drive you wish to replace must be either a SPARE or FAILED drive.
Otherwise, the drive would not be listed as an IDENTIFY AND REMOVE
RESOURCES selection within the RAID HOT PLUG DEVICES screen. In that
case you must ask the customer to put the drive into FAILED state. Refer them
to the System Management Guide for more information.
1. Select the RAID HOT PLUG DEVICES option within the HOT PLUG TASK under
DIAGNOSTIC SERVICE AIDS.
2. Select the RAID adapter that is connected to the RAID array containing the RAID
drive you wish to remove, then select COMMIT.
3. Choose the IDENTIFY option in the IDENTIFY AND REMOVE RESOURCES menu.
4. Select the physical disk which you wish to remove from the RAID array and press
Enter.
5. The disk will go into the IDENTIFY state, indicated by a flashing light on the drive.
Verify that it is the physical drive you wish to remove, then press Enter.
6. At the IDENTIFY AND REMOVE RESOURCES menu, choose the REMOVE option
and press Enter.
7. A list of the physical disks in the system which may be removed will be displayed. If
the physical disk you wish to remove is listed, select it and press Enter. The
physical disk will go into the REMOVE state, as indicted by the LED on the drive. If
the physical disk you wish to remove is not listed, it is not a SPARE or FAILED
Note: There are no elective tests to run on a RAID drive itself under diagnostics
(the drives are tested by the RAID adapter).
Go to “Step 0270-5”.
Step 0270-5
If the RAID did not begin reconstructing automatically, then perform the steps that
follow.
Ask the customer to run the PCI SCSI Disk Array Manager using smit. An example of
this procedure is:
1. Login as root.
2. Type smit pdam.
3. Select Change/Show PCI SCSI RAID Drive Status.
4. Select Add a Spare Drive.
5. Select the appropriate adapter.
6. Select the Channel and ID of the drive which was replaced.
7. Press Enter when verified.
8. Press F3 until you are back at the Change/Show PCI SCSI RAID Drive Status
screen.
9. Select Add a Hot Spare.
10. Select the drive you just added as a spare.
11. If there was no hot spare previously installed in the array, then the array will begin
reconstructing immediately. Reconstruction time will vary based on the size of the
RAID array. Allow 1-2 hours for completion.
Chapter 18. MAP 0270: SCSI RAID Problem Resolution and Verification 105
If the state of the RAID Array is reconstructing then it is in process of
reconstructing. If it is optimal, then reconstruction has completed.
5. Press F10 to exit.
Step 0270-6
Is the FRU a RAID adapter base card, RAID adapter cache card, or RAID adapter
battery?
NO Go to “Step 0270-15” on page 108.
YES Go to “Step 0270-7”.
Step 0270-7
Do you want to change the FRU using a hotplug operation?
No Power down the system, and remove the RAID adapter, if necessary refer to
the Removal and Replacement Procedures section of the system Service
Guide. Go to “Step 0270-8”.
Yes Remove the RAID adapter, if necessary refer to the Replacing a
Hot-Pluggable PCI Adapter within the Removal and Replacement Procedures
section of the system Service Guide. Go to “Step 0270-8”.
Step 0270-8
Is the FRU you want to replace a RAID adapter cache card or RAID adapter
battery?
NO Go to “Step 0270-10”.
YES Go to “Step 0270-9”.
Step 0270-9
Replace the FRU onto the existing base card.
Step 0270-10
After physically removing the base card from the system, remove any other good FRUs
(RAID cache card or cache battery) from the RAID base card adapter. Plug these FRUs
on to the replacement RAID base card adapter FRU.
Step 0270-12
Was the replacement FRU a RAID base card?
No NO Go to “Step 0270-14” on page 108.
Yes Go to “Step 0270-13”.
Step 0270-13
Attention: Prior to cabling the SCSI RAID adapter to the subsystem check for
pre-existing configurations on the replacement SCSI RAID Base Card. The replacement
base card can overwrite your system’s configuration data if it already has a
configuration written to it! Check it before cabling up the SCSI RAID Subsystem Array.
Ask to customer to check for pre-existing configuration on the SCSI RAID Base Card.
Below is an example of this procedure:
1. Login as root (if not already root).
2. Type smit pdam.
3. Select List PCI SCSI RAID Arrays.
4. If no RAID Arrays are listed, then there are no preexisting configurations on the
base card.
5. Press F10 key to exit
If a pre-existing configuration exists on the base card, ask the customer to run the PCI
SCSI Disk Array Manager using smitty.
1. Login as root (if not already root)
2. Type smit pdam from the AIX Command prompt (if not already in the RAID
manager)
3. Select Recovery Options
4. Select Clear PCI SCSI RAID Adapter Configuration. Select the adapter which you
just installed. Press Enter″ to confirm.
5. Return to the Recovery Options menu (if not already there). Select Resolve PCI
SCSI RAID Adapter Configuration. Select Accept Configuration on Drives.
Select the adapter which you just installed. Press Enter to confirm. The
configuration on the new adapter should now match the configuration existent on
the drives.
6. Press F10 to exit
Chapter 18. MAP 0270: SCSI RAID Problem Resolution and Verification 107
You may now proceed to cable up the RAID system array.
Go to “Step 0270-16”.
Step 0270-14
Ask the customer to resynchronize the RAID Array configuration. Below is an example
of this procedure:
1. Log in as root (if not already root).
2. Type smit pdam.
3. Select Recovery Options.
4. Select Resolve PCI SCSI RAID Adapter Configuration.
5. Select Retry Current Configuration.
6. Select the appropriate scraid adapter.
7. A message will be displayed as to the success of the operation.
8. Press F10 to exit.
Go to “Step 0270-16”.
Step 0270-15
Other RAID FRUs require that the system be shut down prior to replacement.
1. If the operating system is running, perform the operating system shutdown
procedure (get help if needed).
2. Turn off the system power.
3. Replace the FRU indicated by the FFC.
Go to “Step 0270-16”.
Step 0270-16
Run the diagnostics in system verification mode on the RAID subsystem.
Note: On systems with Fault Indicator LED, this changes the Fault Indicator LED
from the Fault state to the Normal state.
2. While in diagnostics, go to the FUNCTION SELECTION menu. Select the
Advanced Diagnostics Routines option.
3. When the DIAGNOSTIC MODE SELECTION menu displays, select the System
Verification option. Run the diagnostic test on scraidX (where X is the RAID adapter
number).
Step 0270-18
Have you exchanged all the FRUs that correspond to the failing function codes?
NO Go to “Step 0270-19”.
YES The SRN did not identify the failing FRU. Schedule a time to run diagnostics in
service mode. If the same SRN is reported in service mode, go to Chapter 6,
“MAP 0030: Additional Problem Determination” on page 45.
Step 0270-19
Note: Note: Before proceeding, remove the FRU you just replaced and install the
original FRU in it’s place.
Use the next FRU on the list and go to “Step 0270-2” on page 103.
Chapter 18. MAP 0270: SCSI RAID Problem Resolution and Verification 109
110 Diagnostic Information for Multiple Bus Systems
Chapter 19. MAP 0280: Boot Problem Resolution
Entry Table
Step 0280-1
The system fails to respond to keyboard entries.
This problem is most likely caused by a faulty keyboard, keyboard adapter, or keyboard
cable.
Try the FRUs in the order listed below: (Test each FRU by retrying the failing
operation.)
1. Keyboard
2. Keyboard adapter (normally located on the system board)
3. Keyboard cable (if not included with the keyboard)
Step 0280-2
1. Some systems have a graphic adapter POST. Check your system guide for
information about graphic adapter POSTs. If a graphic adapter POST is supported
and it indicates a failure, follow the procedures in the system guide to resolve the
problem.
2. If a graphic adapter POST is supported and it does not indicate a failure, suspect
the display or display cable.
3. If the system does not have a graphic adapter POST, go to the display problem
determination procedures. If you do not find a problem, replace the graphics
adapter.
111
Step 0280-3
Go to the problem determination procedure for the terminal. If you do not find a
problem, suspect the serial port adapter or terminal cable.
Note: Not all devices support MPIO. Before proceeding with this MAP, make sure that
the devices on both ends of the missing path support MPIO.
Step 0285-1
Look at the problem report screen for the missing path. After the resource name and
FRU, the next column identifies the missing path between resources (for example,
scsi0 -> hdisk1). This indicates the missing path between the two resources, scsi0
(the parent resource) and hdisk1 (the child resource).
Step 0285-2
1. Power off the system.
2. Connect the proper cable between the two resources.
3. Power on the system, rebooting AIX.
4. At the AIX command line, type smitty mpio.
5. Choose MPIO Path Management.
6. Select Enable Paths.
7. Select Enable Paths for a Device.
8. In the Device Name selection, choose the name of the child device (the device at
the end of the path).
9. In the Paths to Enable selection, select the missing path (between the parent and
child device).
10. Press Enter.
113
Step 0285-3
To verify that the device path is present, rerun diag -a.
When the resource list displays, select the child resource that had the missing path. If
the letter P is shown in front of the resource, you still have a path problem. If no letter P
is shown in front of the resource or if the system returns to the command prompt or you
get a message without the resource list being displayed, then the problem has been
resolved.
Step 0285-4
1. Power off the system.
2. Reseat the cables between the two devices that have the missing path.
3. Power on the system, rebooting AIX.
4. At the AIX command line, type smitty mpio.
5. Choose MPIO Path Management.
6. Select Enable Paths.
7. Select Enable Paths for a Device.
8. In the Device Name selection, choose the name of the child device (the device at
the end of the path).
9. In the Paths to Enable selection, select the missing path (between the parent and
child device).
10. Press enter.
Step 0285-5
To verify that the device path is present, rerun diag -a..
When the resource list displays, select the child resource that had the missing path. If
the letter P is shown in front of the resource, you still have a path problem. If no letter P
is shown in front of the resource or if the system returns to the command prompt or you
get a message without the resource list being displayed, then the problem has been
resolved.
Step 0285-7
To verify that the device path is present, rerun diag -a.
When the resource list displays, select the child resource that had the missing path. If
the letter P is shown in front of the resource, you still have a path problem. If no letter P
is shown in front of the resource or if the system returns to the command prompt or you
get a message without the resource list being displayed, then the problem has been
resolved.
Step 0285-8
You may have a problem with the driver interface or connector on the child device.
1. Power off the system.
2. Remove the cables to the child device.
3. Replace the child device.
4. Reattach the cables to the child device.
5. Power on the system, rebooting AIX.
Chapter 20. MAP 0285: Multipath I/O (MPIO) Problem Resolution 115
6. At the AIX command line, type smitty mpio.
7. Choose MPIO Path Management.
8. Select Enable Paths.
9. Select Enable Paths for a Device.
10. In the Device Name selection, choose the name of the child device (the device at
the end of the path).
11. In the Paths to Enable selection, select the missing path (between the parent and
child device).
12. Press enter.
Step 0285-9
Rerun diag -a to verify that the device path is present.
When the resource list displays, select the child resource that had the missing path. If
the letter P is shown in front of the resource, you still have a path problem. If no letter P
is shown in front of the resource or if the system returns to the command prompt or you
get a message without the resource list being displayed, then the problem has been
resolved.
Step 0290-1
Use the “Display Configuration and Resource List” on page 193 to display the resources
that were sensed by the configuration program when the diagnostic programs were
loaded. Go to “Step 0290-2”.
Notes:
1. Supplemental diskettes may be required for specific adapters and devices if service
aids are run from Standalone Diagnostics.
2. ISA adapters cannot be detected by the system. The ISA Adapter Configuration
Service Aid in Standalone Diagnostics allows the identification and configuration of
ISA adapters, based on user input.
3. A resource’s software must be installed on the AIX operating system before a
resource can be detected by the Online Diagnostics.
4. If a parent device and all of its children are the missing resources at the same time,
first use the parent device as the missing resource when going through this MAP. If
the problem with the parent device is resolved so that it is no longer missing, but
any of its children are still missing, use the children when going through this MAP.
Step 0290-2
Is the undetected resource a SCSI device installed in an externally attached
enclosure for a SCSI device(s)?
NO Go to “Step 0290-4”.
YES Go to “Step 0290-3”.
Step 0290-3
Go to the documentation for SCSI devices installed in an externally attached enclosure
for a SCSI device(s), and check the device(s) for proper power, cabling, fans running,
and any other checks available. Return to this step after you check the device.
Step 0290-4
Are you running Standalone Diagnostics?
NO Go to “Step 0290-7” on page 118.
YES Go to “Step 0290-5” on page 118.
117
Step 0290-5
Are multiple devices missing that are connected to the same adapter? (If the
adapter itself is also missing, answer ″No″ to this question.)
NO Go to “Step 0290-7”.
YES Suspect a problem with the device adapter. Run diagnostics on the device
adapter then go to “Step 0290-6”.
Step 0290-6
Did the diagnostics detect a problem with the adapter?
NO Go to “Step 0290-8”.
YES Record the SRN then find the SRN in the SRN List and do the listed action.
Step 0290-7
Take the following steps:
1. Exchange the undetected resource.
2. Use the “Display Configuration and Resource List” on page 193 to display the
resources sensed by the configuration program.
Step 0290-8
Are the missing devices attached to a backplane?
NO Go to “Step 0290-7”.
YES Exchange the backplane then go to “Step 0290-9”.
Note: Before exchanging the backplane check that all cables connected to the
backplane are properly seated and that all cables and connectors are in
good working condition. If a problem is found, correct it, and then go to
Chapter 23, “MAP 0410: Repair Checkout” on page 125.
Step 0290-9
Load the standalone diagnostics; then use the list of resources in the DIAGNOSTIC
SELECTION to determine if devices that were previously missing now appear on the
resource list.
Are the previously missing devices now listed on the resource list?
NO Go to “Step 0290-10” on page 119.
YES Go to Chapter 23, “MAP 0410: Repair Checkout” on page 125.
Step 0291-1
The device may be missing because of a power problem.
Go to “Step 0291-2”.
Step 0291-2
Inspect the cables (signal and power) of the missing device. Be sure all connections are
in place and power is present. Refer to the system or enclosure documentation
containing the power wiring diagram or system cable diagram to locate specific cables,
determine the cable numbering, and check for a problem-determination procedure. Look
for obvious power cabling problems, such as missing or loose cable connectors.
Power problems can sometimes be identified by checking other devices that use the
same power source (such as a diskette drive and a SCSI tape drive, even though they
have different controllers). If other devices that share a power source are reported as
missing devices, suspect the power source as the problem.
121
Step 0291-3
Is the missing device a SCSI device installed in a SCSI Enclosure Services (AIX
resource SESx) or SAF-TE (AIX resource SAFTEx device?
NO Go to “Step 0210-1” on page 67.
YES Go to “Step 0291-4”.
Step 0291-4
Run the Advanced Diagnostics in Problem Determination mode on the SCSI Enclosure
Services or SAF-TE device.
Note: A no trouble found result from the diagnostics indicates that you did not get a
different SRN.
Did you get a different SRN than when you ran the diagnostics previously?
NO Go to “Step 0291-5”.
YES Take the following action:
1. Find the SRN in Chapter 30, “Using the SRN List” on page 251.
Note: If the SRN is not listed a Service Request Number Lists, look for
additional information in the following:
v Any supplemental service manual for the device.
v The diagnostic Problem Report screen.
v The Service Hints service aid in Chapter 27, “Using Standalone and
Online Diagnostics” on page 153.
2. Perform the action listed.
Step 0291-5
Power off the system. Disconnect all hot-swap devices attached to the adapter.
Reconnect the hot-swap devices one at time. After reconnecting each device, do the
following:
1. Power on the system and boot the system in the same mode that you were in when
you received the symptom that led you to this MAP.
2. At AIX command prompt, run missing options (diag -a).
3. Verify whether the device you just added to the system is missing from the system
configuration, or if additional devices have been made missing, the problem may be
with the last device reconnected. Perform these substeps:
a. If the device you just added to the system shows as missing, or if additional
devices were made missing, replace the last device.
b. Rerun missing options (diag -a)
c. If devices are no longer missing, go to Chapter 23, “MAP 0410: Repair
Checkout” on page 125. Otherwise, contact your support center.
Note: A device problem can cause other devices attached to the same SCSI
adapter to go into the Defined state. Ask the system administrator to
Did you get a different SRN than when you ran the diagnostics previously?
NO Go to “Step 0291-5” on page 122.
YES Take the following action:
1. Find the SRN in Chapter 30, “Using the SRN List” on page 251.
Note: If the SRN is not listed a Service Request Number Lists, look for
additional information in the following:
v Any supplemental service manual for the device.
v The diagnostic Problem Report screen.
v The Service Hints service aid in Chapter 27, “Using Standalone and
Online Diagnostics” on page 153.
2. Perform the action listed.
Chapter 22. MAP 0291: Missing Device or Bus Problem Resolution 123
124 Diagnostic Information for Multiple Bus Systems
Chapter 23. MAP 0410: Repair Checkout
Note: Only use Standalone Diagnostics for repair checkout when no other diagnostics
are available on the system. Standalone Diagnostics do not log repair actions.
If you are servicing an SP system, go to the End of Call MAP in the SP System Service
Guide.
If you are servicing a clustered Eserver, go to the End of Call MAP in the Clustered
Eserver Installation and Service Guide.
Step 0410-1
Did you use a hot-swap operation to change the FRU?
NO Go to “Step 0410-2”.
YES Go to “Step 0410-6” on page 126.
Step 0410-2
Note: If the system planar or battery has been replaced and you are loading
diagnostics from a server over a network, it may be necessary for the customer
to set the network boot information for this system before diagnostics can be
loaded. The system time and date information should also be set when the
repair is completed.
Do you have cards, adapters, cables, devices, or any other FRUs that were
removed during problem analysis that you want to put back into the system?
NO Go to “Step 0410-4” on page 126.
YES Go to “Step 0410-3”.
Step 0410-3
1. After performing a system shutdown, turn off the system power and unplug all
power cords (cables) from electrical outlets.
2. Install all of the cards, adapters, cables, devices, and any other FRUs that were
removed during problem analysis.
125
Step 0410-4
Do you want to run the concurrent diagnostics?
NO Go to “Step 0410-9” on page 127.
YES Go to “Step 0410-5”.
Step 0410-5
1. If the system supports Slow boot (See “About Slow Boot” on page 2) do a slow boot
on the system. If the system does not support slow boot, do a normal boot.
2. Power on the system.
3. Wait until the AIX operating system login prompt displays or until apparent system
activity on the operator panel or display has stopped.
Step 0410-6
Is the system or partition containing the FRU running service mode diagnostics?
NO Go to “Step 0410-8”.
YES Go to “Step 0410-7”.
Step 0410-7
Exit service mode diagnostics, and reboot the system or partition that contains the FRU.
Wait for the operating system login prompt or until the activity on the operator panel or
display stops.
Step 0410-8
If the Resource Repair Action menu is already displayed, go to “Step 0410-15” on
page 129. Otherwise, do the following:
1. Log into the operating system either with root authority (if needed, ask the customer
to enter the password) or use the CE login.
2. Enter the diag -a command and check for missing resources. Follow any
instructions that display. If an SRN displays, suspect a loose card or connection. If
no instructions display, no resources were detected as missing.
3. Enter the diag command.
4. Press Enter.
5. Select the Advanced Diagnostics option.
Step 0410-9
Note: If you are uncertain, answer Yes to the following question.
Step 0410-10
1. Load the Standalone diagnostics.
2. Wait until the diagnostics are loaded or the system appears to stop.
3. Press Enter.
4. Select the Advanced Diagnostics option.
Note: If the terminal type is not defined. You are prompted to define it. You cannot
continue until this is done.
5. When the DIAGNOSTIC MODE SELECTION menu displays, choose System
Verification. On the DIAGNOSTIC SELECTION menu, test the FRUs you
exchanged by selecting the diagnostics for the individual FRUs. If the resource
associated with your action is not displayed on the Resource List, select planar0.
Did the TESTING COMPLETE, no trouble was found menu (801010) display?
NO There is still a problem. Go to Chapter 4, “MAP 0020: Problem Determination
Procedure” on page 29.
YES Go to “Step 0410-17” on page 130.
Step 0410-11
Note: If you are already in Service Mode, go to “Step 0410-12” on page 128.
1. Power on the system (if it is not already powered on).
2. Load the Online Diagnostics in Service Mode (refer to the system unit’s service
guide if needed).
Step 0410-12
1. Press Enter.
2. Select the Task Selection option.
Note: If the terminal type is not defined. You are prompted to define it. You cannot
continue until this is done.
3. Select the Log Repair Action option.
4. Select the resource associated with the repair action. If the repair action was
reseating a cable or adapter, select the resource associated with that repair action.
If the resource associated with your repair action is not displayed on the Resource
List, select sysplanar0.
5. Press commit after you have made your selection.
6. Press the key for the previous menu two times to get to the Function Selection
menu.
7. Select the Advanced Diagnostics Routines option.
8. If the DIAGNOSTIC MODE SELECTION menu displays, select the System
Verification option.
Step 0410-13
Test the FRUs you exchanged, and any devices that are attached to the FRU(s), by
selecting the individual FRUs.
Note: If the resource associated with your action is not displayed on the Resource List,
select sysplanar0.
Note: On systems with a Fault Indicator LED, this changes the Fault Indicator
LED from the Fault state to the Normal state.
Step 0410-15
When a test is run on a resource in System Verification mode, and that resource has an
entry in the AIX error log, if the test on the resource was successful, the RESOURCE
REPAIR ACTION menu displays.
After replacing a FRU, you must select the resource for that FRU from the RESOURCE
REPAIR ACTION menu. This updates the AIX error log to indicate that a
system-detectable FRU has been replaced.
Note: On systems with a Fault Indicator LED, this changes the Fault Indicator LED
from the Fault state to the Normal state.
Do the following:
1. Select the resource that has been replaced from the RESOURCE REPAIR ACTION
menu. If the repair action was reseating a cable or adapter, select the resource
associated with that repair action.
If the resource associated with your action is not displayed on the Resource List,
select sysplanar0.
2. Press Commit after you make your selections.
Step 0410-16
The parent or child of the resource you just replaced may also require that you run the
RESOURCE REPAIR ACTION service aid on it.
After replacing that FRU, you must select the resource for that FRU from the
RESOURCE REPAIR ACTION menu. This updates the AIX error log to indicate that a
system-detectable FRU has been replaced.
Note: On systems with a Fault Indicator LED, this changes the Fault Indicator LED
from the FAULT state to the NORMAL state.
Do the following:
1. From the RESOURCE REPAIR ACTION menu, select the parent or child of the
resource that has been replaced . If the repair action was reseating a cable or
adapter, select the resource associated with that repair action. If the resource
associated with your action is not displayed on the Resource List, select
sysplanar0.
2. Press COMMIT after you make your selections.
3. If the No Trouble Found menu displays, go to “Step 0410-17”.
Step 0410-17
If you changed the Service Processor or Network settings, as instructed in previous
MAPs, restore the settings to the value they had prior to servicing the system. If you
performed service on a RAID subsystem involving changing of the RAID adapter cache
card or changing the configuration on RAID disks, ask the customer to run the PCI
SCSI Disk Array Manager using smitty. Use the Recovery Options selection to resolve
the RAID configuration. To do this, perform the following:
1. At the AIX command line, type smitty pdam.
2. On the PCI SCSI Disk Array Manager screen, select Recovery Options.
3. If a previous configuration exists on the replacement adapter, this must be cleared.
Select Clear PCI SCSI RAID Adapter Configuration. Press F3.
4. On the Recovery Options screen, select Resolve PCI SCSI RAID Adapter
Configuration.
5. On the Resolve PCI SCSI RAID Adapter Configuration screen, select Accept
Configuration on Drives.
6. On the PCI SCSI RAID Adapter selection menu, select the adapter that you
changed.
7. On the next screen, press Enter.
8. When you get the Are You Sure selection menu, press Enter to continue.
9. You should get an OK status message when the recovery is complete. If you get a
Failed status message, verify that you selected the correct adapter, then repeat this
procedure again. When recovery is complete, exit smitty to return to the AIX
command line.
Step 0420-1
1. If the operating system is running, perform the operating system’s shutdown
procedure (get help if needed).
2. Power off the system.
3. Power on the system.
4. Load either the Online or Standalone Diagnostics in Service Mode (refer to the
system unit’s service guide if necessary).
5. Wait until the diagnostics are loaded or the system appears to stop.
Step 0420-2
1. Press Enter.
2. When the FUNCTION SELECTION menu displays, select Advanced Diagnostics.
3. When the DIAGNOSTIC MODE SELECTION menu displays, select the System
Verification option.
Note: If the terminal type is not defined, you are prompted to define it. You cannot
continue until this is done.
4. On the DIAGNOSTIC SELECTION or ADVANCED DIAGNOSTIC SELECTION
menu, look through the list of resources to make sure that all adapters and SCSI
devices are listed including any new resources.
Notes:
a. Resources attached to serial and parallel ports may not appear in the resource
list.
b. ISA adapters cannot be detected by the system. The ISA Adapter Configuration
Service Aid in Standalone Diagnostics allows the identification and configuration
of ISA adapters.
c. If running diagnostics in a partition within a partitioned system, resources
assigned to other partitions are displayed on the resource list.
133
Step 0420-3
Is the new device or adapter an exact replacement for a previous one installed at
same location?
No Go to “Step 0420-4”.
Yes The replacement device or adapter may be defective. If possible, try installing
it in an alternate location if one is available. If it works in that location, suspect
that the location where it failed to appear has a defective slot; schedule time to
replace the hardware that supports that slot. If it does not work in an alternate
location, suspect a bad replacement adapter or device. If you are still unable to
detect the device or adapter, contact your service support structure.
Step 0420-4
Is the operating system software to support this new adapter or device installed?
No Load the operating system software.
Yes The replacement device or adapter may be defective. If possible, try installing
it in an alternate location if one is available. If it works in that location, suspect
that the location where it failed to appear has a defective slot; schedule time to
replace the hardware that supports that slot. If it does not work in an alternate
location, suspect a bad replacement adapter or device. If you are still unable to
detect the device or adapter, contact your service support structure.
Step 0420-5
1. The MISSING RESOURCE menu only displays when a resource was removed or
moved. If the MISSING RESOURCE menu is displayed, follow the instructions.
2. If the ADVANCED DIAGNOSTIC SELECTION menu displays, select the System
Verification option, then use the All Resources option to test the system or select
the individual tests you want to run. If the RESOURCE SELECTION menu is
displayed, select the All Resources option to test the system or select each test
you want to run.
If you have trouble selecting the console display and you are using an attached
terminal, check the TTY Terminal Attributes again.
When you run the diagnostic programs from an attached TTY terminal, the attributes for
the terminal must be set to match the defaults of the diagnostic programs. The TTY
terminal must be attached to the S1 serial port on the system unit.
135
Step 3. Loading the Diagnostics
The diagnostics can be run from a CD-ROM disc, from a locally attached disk, or from
a server if the AIX operating system is installed on the system. If you are not sure
whether the AIX operating system is installed, you can check by turning the system unit
on. If the System Management Service menu displays, the AIX operating system is not
installed.
If the AIX operating system is installed, the diagnostic programs load from a locally
attached disk or from a server. If the AIX operating system is not installed, diagnostics
can be loaded from the diagnostic CD-ROM disc.
The following procedure attempts to load the diagnostics from a disk or from a server. If
they cannot be loaded from a disk or server, the diagnostic CD-ROM disc is used to
load and run the checkout.
1. Set the power switches on all of the attached devices to On.
2. Set the power switch on the system unit to On.
Note: After the first icon displays on the system console, press F6 if you are using
a directly attached console, or press 6 on a TTY console.
3. If the System Management Services menu displays, the AIX operating system is not
installed. Do the following:
a. Insert the diagnostic CD-ROM disc into the CD-ROM drive.
b. Power off the system unit, wait 45 seconds and then power on the system unit.
Note: After the first icon displays on the system console, press F6 if you are
using a directly attached console, or press 6 on a TTY console.
c. If the system stops with an eight-digit error code displayed or stops with an icon
or icons displayed, a problem was detected.
Check for loose cables or cards. If you do not find a problem, go to Chapter 4,
“MAP 0020: Problem Determination Procedure” on page 29.
4. When the diagnostic programs load correctly, the DIAGNOSTIC OPERATING
INSTRUCTIONS display.
Microcode must be installed during system installation or after the AIX operating system
is installed. If the system is using the AIX operating system, all microcode is preinstalled
on the boot disk for all adapters and devices that were shipped with the system.
Microcode is shipped on microcode diskettes, option diskettes and on the boot disk. For
the AIX operating system, runtime microcode maintenance can be selected from the
SMIT INSTALLATION AND MAINTENANCE MENU or from the Diagnostic Service Aid.
The adfutil -m (command and flag) is normally used to install microcode shipped on
option diskettes.
If the system is using another type of operating system, that operating system should
include microcode installation instructions.
If you have the X.25 Interface Coprocessor, the microcode for them is normally not
shipped with the AIX operating system. The microcode for these adapters must be
installed before the adapters can be used. The X.25 Interface Coprocessor microcode
is shipped on an option diskette.
Contact the person that is going to install the software or turn the system over to the
customer.
If needed, go to the AIX operating system Installation Kit to install and configure the AIX
operating system.
Microcode
There are several types of microcode used by the system:
v System firmware microcode
v Service processor microcode
v Adapter microcode
v Device microcode
AIX diagnostic tasks provide the capability to display microcode levels as well as install
microcode for system firmware and service processor. These same functions are also
available for many adapters and devices.
To determine the level of microcode for system firmware, service processor, adapter, or
device, use the Display Microcode Level task. This task presents a list of resources
that are currently installed and supported by this task. The AIX lsmcode command can
also be used to display microcode levels. For additional information, refer to “Display
Microcode Level” on page 194. For adapters and devices not supported by this task,
refer to the manufacturer’s instructions to determine the microcode level.
Use the Update System or Service Processor Flash task to update a system’s
firmware or service processor for CHRP platform systems. Use the Update System
Flash task to update the system’s firmware for RSPC platform systems. Following the
microcode installation, the system automatically reboots to complete the update.
Microcode images can be installed from disk, diskette, or NIM server. For additional
information, refer to “Update System or Service Processor Flash” on page 221 or the
“Update System Flash” on page 222.
Use the Microcode Download task to install microcode for adapters or devices. This
task presents a list of resources that are currently installed and supported by this task.
Microcode images can be installed from disk, diskette, or NIM server. Refer to the
“Download Microcode” on page 200 for additional information. For adapters and devices
with microcode that can be updated but are not supported by this task, refer to the
manufacturer’s instructions.
139
For systems not using AIX, these tasks can be used via the standalone CD or NIM
server. Otherwise refer to the corresponding documentation for the operating system on
installing microcode.
CEREADME File
A CEREADME (CE readme) file is available on all diagnostic media. This file may
contain information such as:
v Errata information for the service guides
v Service hints for problems
v Diagnostic information that may not be included in service guides
v Other pertinent (usually release-specific) information
You can view the CEREADME file by using the Service Hints service aid after the
diagnostics are loaded. Also, you can read the file directly from the disk using the AIX
pg command to display /usr/lpp/diagnostics/CEREADME. The CEREADME file can
be copied or printed using the normal commands. For information about using the
service hints, refer to “Display Service Hints” on page 197.
A copy of this file should be printed and stored with the Service Information. lp0 is
normally the printer attached to the parallel port. If a printer is attached to the parallel
port and is considered as lp0, the command for printing the file is as follows:
cat /usr/lpp/diagnostics/CEREADME > /dev/lp0
If the directory does not exist, the message /tmp/diag: not found displays. Do not
attempt to print the CEREADME file if this message is not displayed. To print the
CEREADME file, choose the appropriate section below and follow the steps listed.
The CEREADME file prints on lp0, which is the printer normally attached to the parallel
port. If this file is not the same as the CEREADME file on the disk, a copy of this file
should be printed and stored with the Service Information.
CE Login
CE login enables a user to perform operating system commands that are required to
service the system without being logged in as a root user. CE login must have a role of
RunDiagnostics and a primary group of system. This enables the user to:
v Run the diagnostics including the service aids, such as hot plug tasks, certify, format,
and so forth.
v Run all the operating system commands run by system group users.
v Configure and unconfigure devices that are not busy.
To use CE login, ask the customer to create a unique user name and configure these
characteristics for that name. (Refer to the Users and Groups section of the AIX 5L
Version 5.1 System Management Guide: Operating System and Devices.) After this is
set up, you will need to obtain the user name and password from the customer to log in
with these capabilities. The recommended CE login user name is qserv.
The automatic diagnostic tests display event indicators to track test progress. If a test
stops, the indicator for that test remains displayed to identify the unsuccessful test.
Built-In Self-Test
Note: This set of programs is not supported on all system units.
The Built-In Self-Test (BIST) programs run first after the system unit is turned on. These
programs test the central electronics complex.
The configuration program displays numbers between 500 and 999 in the operator panel
display. Refer to Chapter 29, “Diagnostics Numbers and Location Codes” on page 225
for a listing of program actions associated with displayed numbers.
Devices attached to serial and parallel ports are not configured. The Dials and Lighted
Program Function Keys (LPFKs) can be tested from Online Diagnostics after they are
manually configured. No other device attached to the serial and parallel ports is
supported by the diagnostics.
If any memory errors were logged, they are reported by the Base System or Memory
Diagnostics, which must be run to analyze the POST results. Normally, most memory
problems that are detected by the POST are isolated to a single FRU.
The CPU and memory cannot be tested after the AIX based diagnostics are loaded;
however, they are monitored for correct operation by various checkers such as
Checkstop or Machine Check. If one of these checks intermittently occurs, it is logged
into the error log.
Single-bit memory errors are corrected by ECC (Error Checking and Correction) on
systems equipped with ECC memory.
Diagnostic Controller
The diagnostic controller runs as an application program on the AIX operating system.
The diagnostic controller carries out the following functions:
v Displays diagnostic menus
v Checks availability of needed resources
v Checks error log entries under certain conditions
v Loads diagnostic application programs
v Loads task and service aid programs
v Displays test results
To test an adapter or device, select the device or adapter from the Diagnostic Selection
menu. The diagnostic controller then loads the diagnostic application program for the
selected device or adapter.
The diagnostic application program loads and runs test units to check the functions of
the device or adapter.
The diagnostic controller checks the results of the tests done by the diagnostic
application and determines the action needed to continue the testing.
The amount of testing that the diagnostic application does depends on the mode
(service, maintenance, or concurrent) under which the diagnostic programs are running.
When you select the Problem Determination option, the diagnostic programs read and
analyze the contents of the error log.
Note: Most hardware errors in the operating system error log contain sysplanar0 as the
resource name. The resource name identifies the resource that detected the
error, it does not indicate that the resource is faulty or should be replaced. Use
the resource name to determine the appropriate diagnostic to analyze the error.
If the error log contains recent errors (approximately the last 7 days), the diagnostic
programs automatically select the diagnostic application program to test the logged
function.
If there are no recent errors logged or the diagnostic application program runs without
detecting an error, the Diagnostic Selection menu is displayed. This menu allows you to
select a resource for testing.
Note: After a FRU is replaced based on an error log analysis program, the error log
entries for the problem device must be removed or the program may continue to
indicate a problem with the device. To accomplish this task, run the errclear
command from the command line, or use System Management Interface Tool
(SMIT) to select Problem Determination / Error Log / Clear the Error Log. Fill
out the appropriate menu items.
If the diagnostic application for the selected resource detects a problem with that
resource, the diagnostic controller checks for an associated resource. For example, if
the test of a disk drive detects a problem, the diagnostic controller tests a sibling device
on the same controller to determine if the drive or the controller is failing. This extra
FRU isolation is apparent when you test a resource and notice that the diagnostic
controller continues to test another resource that you did not select.
The advanced diagnostics run in the same modes as the diagnostics used for normal
hardware problem determination. The advanced diagnostics provide additional testing
by allowing the service representative to do the following:
v Use wrap plugs for testing.
v Loop on a test (not available in concurrent mode) and display the results of the
testing.
Tasks and service aids provide a means to display data, check media, and check
functions without being directed by the hardware problem determination procedure.
Refer to Chapter 28, “Introduction to Tasks and Service Aids” on page 175 for
information and procedures about tasks and service aids.
In diagnostics version 5.2.0 and later, missing devices are identified on the Diagnostic
Selection screen by an uppercase M preceding the name of the device that is missing.
The Diagnostic Selection menu is displayed anytime you run the Diagnostic Routines or
the Advanced Diagnostics Routines. The Diagnostic Selection menu can also be
entered by running diag -a when there are missing devices or missing paths to a
device.
When a missing device is selected for processing, the Missing Resource menu will ask
whether the device has been turned off, removed from the system, moved to a different
physical location, or if it is still present.
When a single device is missing, the fault is probably with that device. When multiple
devices with a common parent are missing, the fault is most likely related to a problem
with the parent device.
The diagnostic procedure may include testing the device’s parent, analyzing which
devices are missing, and any manual procedures that are required to isolate the
problem.
When a device with missing paths is selected from the Diagnostic Selection menu, the
Missing Path Selection menu displays showing the missing paths for the device. The
menu requests the user to select a missing path for processing. If the device has only
one missing path, then the selection menu is bypassed. In either case, a menu is
displayed showing the selected missing path and other available paths to the device
(which may be missing or available). The menu asks if the missing path has been
removed, has not been removed, or should be ignored. The procedures are as follows:
v If the Path Has Been Removed option is selected, diagnostics removes the path
from the data base.
v If the Path Has Not Been Removed option is selected, diagnostics determines why
the path is missing.
Periodic Diagnostics
Periodic testing of the processors is enabled by default. Problems are reported by a
message to the system console, and either a serviceable event sent to the Service
Focal Point for a system with an HMC (hardware management console) or to an e-mail
message sent to all users in the system group.
Periodic diagnostics are controlled by the periodic diagnostic service aid. The periodic
diagnostic service aid allows a hardware resource to be tested once a day. If the
resource cannot be tested because it is busy, error log analysis is performed if
supported on the resource. In diagnostics version 5.1.0.15 and later, periodic
diagnostics only supports testing of processors. Disk drives and system planars are no
longer supported by periodic diagnostics because testing is not supported in concurrent
diagnostics and coverage is provided by automatic error log analysis.
Processors that are dynamically removed from the system are also removed from the
periodic test list. Processors that are dynamically added are automatically added to the
periodic test list.
In diagnostics version 5.1.0.15 and later, periodic diagnostic reminder mode is only run
for processors. Periodic diagnostic reminder mode for processors also performs testing
on the processors. Problems are reported by a message to the system console, and
either a serviceable event sent to the Service Focal Point for a system with an HMC or
an e-mail message sent to all users in the system group.
The diagela program determines if the error should be analyzed by the diagnostics. If
the error should be analyzed, a diagnostic application is invoked and the error is
analyzed. No testing is done if the diagnostics determine that the error requires a
service action. Instead it sends a message to your console, and either the Service
Focal Point for systems with a HMC, or to all system groups. The message contains the
SRN.
Running diagnostics in this mode is similar to using the diag -c -e -d Device command.
Notification can also be customized by adding a stanza to the PDiagAtt object class.
The following example illustrates how a customer’s program can be invoked in place of
the normal mail message, or in addition to sending the message to the Service Focal
Point when there is a HMC:
PDiagAtt:
DClass = " "
DSClass = " "
DType = " "
attribute = "diag_notify"
value = "/usr/bin/customer_notify_ program $1 $2 $3 $4 $5"
rep = "s"
If DClass, DSClass, and DType are blank, then the customer_notify_program applies
for all devices. Filling in the DClass, DSClass, and DType with specifics causes the
customer_notify_program to be invoked only for that device type.
After the above stanza is added to the ODM data base, problems are displayed on the
system console and the program specified in the value field of the diag_notify
predefined attribute is invoked. The following keyword is expanded automatically as
arguments to the notify program:
v $1 the keyword diag_notify
v $2 the resource name that reported the problem
v $3 the Service Request Number
v $4 the device type
v $5 the error label from the error log entry
If DClass, DSClass, and DType are blank, then the customer_analyzer_program applies
for all devices. Filling in the DClass, DSClass, and DType with specifics causes the
customer_analyzer_program to be invoked only for that device type.
After the above stanza is added to the ODM data base, the program specified is
invoked if there is no diagnostic program specified for the error, or if analysis was done,
but no error found. The following keywords expand automatically as arguments to the
analyzer program:
v $1 the keyword diag_analyze
v $2 the resource name that reported the problem
v $3 the error label from the error log entry if from ELA, the keyword PERIODIC if from
Periodic Diagnostics, or the keyword REMINDER if from a Diagnostic Reminder.
v $4 the device type
v $5 the keywords:
– no_trouble_found if the analyzer was run, but no trouble was found.
– no_analyzer if the analyzer is not available.
To activate the Automatic Error log Analysis feature, log in as root user (or use CE
login) and type the following command:
/usr/lpp/diagnostics/bin/diagela ENABLE
To disable the Automatic Error Log Analysis feature, log in as root user (or use CE
login) and type the following command:
/usr/lpp/diagnostics/bin/diagela DISABLE
The diagela program can also be enabled and disabled using the Periodic Diagnostic
Service Aid.
The Log Repair Action task lists all resources. Replaced resources can be selected
from the list, and when commit (F7 key) is selected, a repair action is logged for each
selected resource.
The System Identify Indicator is used to help physically identify a particular system in a
room. The System Fault Indicator is used to help physically identify a particular system
that has a fault condition.
On a system that supports System Fault Indicator, the indicator is set to Fault condition
when a fault is detected. After the problem with the system is fixed, the system fault
indicator should be set back to Normal. This is done by using the Log Repair Action
task. For additional information, see “Log Repair Action”.
Note: This action keeps the System Fault Indicator from being set to the Fault state
due to a previous error, that has already been serviced, in the error log.
Both of these indicator functions can be managed by using the System Identify Indicator
and System Fault Indicator Tasks. See “System Fault Indicator” on page 220 or “System
Identify Indicator” on page 220 for additional information.
This command displays a list of printers currently defined on the system. Only those
printers that are in the available state can be used (for example, those printers marked
as defined cannot be used). Ensure that a printer is actually connected at the location
that was specified in the output of the command.
Note: In the above step, you must substitute for x the value obtained from the lsdev
command.
In the above example, the contents of the CEREADME file will print.
This reconfiguration can be used to correct arrays for faults detected at IPL or run time.
In the case of a fault detected during run time, the recoverable fault is reported with a
″Repair Disposition Pending Reboot″ indicator set. This allows diagnostics to call out a
Service Request Number that identifies the array and directs the Service
Representative to a MAP for problem resolution that uses array bit steering. If the array
bit steering cannot be used for the reported fault, then the FRU with that array is
replaced.
Running Diagnostics on the Native Serial Port on a Full Machine Partition with a
HMC Attached
For a Full Machine Partition, the output of the S1 serial port is redirected, or wrapped to
the virtual console window. When the S1 serial port is wrapped, the output of any
command is directed from the S1 serial port to the virtual console terminal. If you close
the virtual console window on the managed system, normal function is restored to the
S1 serial port.
The implication of this is that the serial port wrap diagnostics may fail with a false SRN
if the virtual console window on the managed system is not closed. Another method to
prevent the false SRN is to unplug the HMC from its port before running the wrap tests.
Online Diagnostics, when installed, are resident with AIX on the disk or server. They
can be booted in single user mode (called service mode), run in maintenance mode
(called maintenance mode), or run concurrently (called concurrent mode) with other
applications. They have access to the AIX Error Log and the AIX Configuration Data.
Attention: If this system unit is attached to another system, be sure you isolate this
system unit before stopping the operating system or running diagnostic programs.
Some system-cabling changes (such as installing wrap plugs or removing a device from
the configuration) may require action by the operator of the attached system before
making the cabling changes on this system.
Attention: The AIX operating system must be installed in order to run Online
Diagnostics. If the AIX operating system is not installed, use the standalone diagnostic
procedures.
153
v When running Online Diagnostics, device support for some devices may not have
been installed. If this is the case, that device does not appear in the resource list.
v When running Standalone Diagnostics, device support for some devices may be
contained on supplemental diagnostic media. If this is the case, the device does not
appear in the resource list when running diagnostics unless the supplemental media
has been processed.
v Support for some TTY terminals is optionally installed. If you attach a TTY terminal to
a system to run diagnostics, it might not work correctly because the AIX support for
the terminal might not be installed.
On systems with an HMC, diagnostics displays on the virtual terminal on the HMC. On
an LPAR system, diagnostics displays on the virtual terminal associated with a given
partition.
When you run diagnostics, you must identify what type of terminal you are using. If the
terminal type is not known when the FUNCTION SELECTION menu is displayed, the
diagnostics do not allow you to continue until a terminal is selected from the DEFINE
TERMINAL option menu. Select LFT for adapter-attached displays.
If running diagnostics from a virtual terminal on a system with an HMC, select VT100
when prompted for the type of terminal emulation.
(Where SPOTNAME is the name of the SPOT from which you want to do a NIM
boot (for example, SPOT433).
Use one of the following methods to determine the amount of available system
memory:
v Run the Display Resource Attributes task for resource.
v Use the Config option under System Management Services (see the system unit
service guide).
v Use the following AIX command:
lsattr -E -l mem0
3. All operations to configure the NIM server require root authority.
To verify that the client system is registered on the NIM server and diagnostic boot is
enabled; from the command line on the NIM server, run the following command:
lsnim -a Cstate -Z ClientName
Note: The ClientName is the name of the system on which you are wanting to run the
Standalone Diagnostics.
0042-053 lsnim: there is no NIM object The client is not registered on the NIM server.
named "ClientName"
Note: RIPL is located under the Utility menu in System Management Services
(SMS) and should be referred to for information on setting these
parameters.
11. If the NIM server is setup to allow the pinging of the client system, use the ping
option in the RIPL utility to verify that the client system can ping the NIM server.
Under the Ping utility, choose the network adapter that provides the attachment to
the NIM server to do the ping operation. If the ping comes back with an OK
prompt, the client is prepared to boot from the NIM server. If ping returns with a
FAILED prompt, the client does not proceed with the boot.
Note: If the ping fails, refer to ″Boot Problems/Concerns″ in the service guide for
the system unit. Then follow the steps for network boot problems.
Use the following procedure to temporarily changes the system bootlist so that the
network adapter attached to the NIM server network, is first in the bootlist:
1. Exit to the SMS Main screen.
2. Depending on the console type, select Multiboot or Select Boot Devices.
3. Depending on the console type, select Boot Sequence or Select Boot Devices.
4. Record the current bootlist settings. (You will have to set the bootlist back to the
original settings after running diagnostics from the NIM server.)
5. Change the bootlist so the network adapter attached to the NIM server is first in the
bootlist.
Verify the settings and the status of the network. If you continue to have problems, refer
to ″Boot Problems/Concerns″ in the service guide for the system unit. Then follow the
steps for network boot problems.
After running diagnostics, reboot the system and use SMS to change the bootlist
sequence back to the original settings.
If a file named servdir.lck is listed, the Electronic Service Agent is already locked out. If
it is not listed, use the following command to create the file:
/usr/lpp/servdir/servdir.lck
Use the following command to check that the servdir.lck file was created (if the file is
present the Electronic Service Agent is locked out):
/usr/lpp/servdir/servdir.lck
Use the following command to check that the servdir.lck file was deleted (the file
should not be present):
ls /usr/lpp/servdir/servdir.lck
If the file is listed, the Electronic Service Agent is still locked out.
Note: The system does not report problems while the service agent is locked out.
When you complete your service action, be sure to unlock the service agent to
allow it to report problems.
Error log analysis is done in service mode when you select the Problem
Determination option on the DIAGNOSTIC MODE SELECTION menu.
To run the Online diagnostics in service mode, use the following steps:
1. Stop all programs including the AIX operating system (get help if needed).
2. Remove all tapes, diskettes, and CD-ROM discs.
3. If the system is running in a full machine partition, turn off the system unit power.
4. If the system is running in a full machine partition, turn on the system unit power.
If the system is running on a logically partitioned system, restart the AIX operating
system in the partition you wish to run online diagnostics.
5. After the first POST indicator appears on the system unit’s console, press F6 on
the directly-attached keyboard or 6 on the TTY keyboard to indicate that
diagnostics are to load.
Note: Pressing the F3 key (from a defined terminal) produces a Confirm Exit
menu which offers two options: continuing with the shut down by pressing
F3; or returning to diagnostics by pressing Enter.
Because the system is running in normal operation, the following resources cannot be
tested in concurrent mode:
v SCSI adapters connected to paging devices
v Disk drive used for paging
v Any graphics-related device running X, CDE, or windowing environment
v Memory
v Processor.
To run the Online diagnostics in concurrent mode you must be logged onto the AIX
operating system and have proper authority to issue the commands (if needed, get
help).
The diag command loads the diagnostic controller and displays the Online diagnostic
menus.
Error log analysis is done in maintenance mode when you select the Problem
Determination option on the DIAGNOSTIC MODE SELECTION menu.
The shutdown -m command is used to stop all activity on the AIX operating system
and put the operating system into maintenance mode. The diag command is then used
to invoke the diagnostic controller so you can run the diagnostics. After the diagnostic
controller is loaded, follow the normal diagnostic instructions.
To run the Online diagnostics in maintenance mode, you must be logged in to the
customer’s version of the AIX operating system as root or superuser and use the
shutdown -m and diag commands. Use the following steps to run the Online
diagnostics in maintenance mode:
1. Stop all programs except the AIX operating system (get help if needed).
2. Log in to the AIX operating system as root or superuser.
3. Enter the shutdown -m command.
4. When a message indicates the system is in maintenance mode, enter the diag
command.
Note: Error Log Analysis is not done by the Standalone Diagnostics. If running from the
diagnostic CD-ROM disc, the CD-ROM drive and the SCSI controller that
controls it cannot be tested by the Standalone Diagnostics.
These considerations and actions are not detailed step-by-step instructions, but are
used to ensure that you have considered the attached system before you run
diagnostics on this system unit.
You are directed to the detailed procedures for the various activities as needed.
When this system unit is attached to another system, be sure you isolate this system
unit before stopping the operating system or running diagnostic programs. Some
system-cabling changes (such as installing wrap plugs or removing a device from the
configuration) may require action by the operator of the attached system before making
the cabling changes on this system.
To determine the type of platform on which you are working, enter one of the following
commands.
v If you are running AIX 4.2.1 or later; from an AIX shell enter: lscfg | pg. A screen
displays a message similar to one of the following:
Model Architecture: RS6k
Model Implementation: Uniprocessor, MCA bus
The actions needed to isolate a particular system unit or device from the configuration
depends on the software controlling the systems and devices. Therefore, be sure you
use the software documentation to prepare the configuration before turning off a system
unit or device.
High-Availability Cabling
For additional cabling information, refer to Adapters, Devices, and Cable Information for
Multiple Bus Systems.
Diagnostic Summary
The following topics provide summary type information about some features of the
diagnostics.
Resources are devices used by the system unit. Diskette drive and CD-ROM drive are
examples of resources.
The FUNCTION SELECTION menu contains two selections allowing either all
resources, or all tasks to be displayed. When task selection is made and a task has
been selected, a list of supporting resources displays. Alternatively, when resource
selection is made, and a resource or group of resources are selected, a list of common
tasks displays. Also, to aid with backward compatibility, the FUNCTION SELECTION
menu contains Diagnostic routines and Advanced Diagnostic routines selection.
The Display or Change Diagnostic Run Time Options task can be used to set advanced
mode diagnostics, looping capability, and error log analysis (ELA) mode when running
diagnostics from the Task Selection menu.
Flags
Note: Tasks are platform and device dependent. Some tasks may not be
available on the system.
-v Runs diagnostics in System Verification Mode, no error log analysis performed.
The default is Problem Determination mode that tests the device and runs
error log analysis. If used with the -e flag, the -v flag takes precedence and the
-e flag is ignored. Must be used with the -d flag to specify a device to run
diagnostics on.
Attention: Do not power off the target server at any time before the flash process
completes. If power is removed or the system is reset while flash_update is being run,
irrecoverable failures may occur. This could require return of the adapter for exchange.
Use one of the two procedures listed, depending on which level of microcode you want
to install relative to the existing level it has:
Note: You must be logged in as root’ or use CE login to perform these commands.
v If the current microcode level on the adapter is below the level you wish to install
use the following command:
diag -c -d scraidX -T"download"
Where X is the ID of the desired adapter, for example, 4.20.18 is currently on the
adapter and you wish to install 4.70.11.
v If the current microcode level on the adapter is greater than the level you wish to
install use the following command:
diag -c -d scraidX -T"download -P"
Where X is the ID of the desired adapter, for example 3.70.01 is currently on the
adapter and you wish to install 3.12.01.
After following either the Diagnostic Menu or the Command Line procedure, you
should receive a confirmation that microcode has been updated. If you do not, then
make sure the adapter is not in use and repeat the appropriate diagnostic step shown
above.
To confirm the new installed level of firmware on the adapter, use the “Viewing the
Existing Firmware Level Installed on a SCSI RAID Adapter” on page 169 procedure.
The first way is through diagnostics. However, this cannot be performed concurrently
with normal RAID operation. The RAID volume group must be varied off to run from
diagnostic menus or diagnostics from the command line. A command the customer
Note: The underlying filesystems may need to be unmounted before varying off the
volume group can complete successfully.
Where X is replaced by the number of the desired adapter. A list containing the VPD
information of all the drives in the RAID subsystem will be displayed on the screen.
Look at the ROS and RAM code levels. This is drive microcode level. Compare this to
the desired level.
When you are done, ask customer (logged on as root) to vary on the volume group
using command:
varyonvg RAID_volumegroup
where RAID_volumegroup is the name of the volumegroup you previously varied off. Any
filesystem unmounted will have to be remounted.
Note: For either of these two methods, if you see the error message: An adapter
configuration error was detected.... Run the Disk Array Manager to
resolve the adapter configuration error.
1. Log in as root (if not already root).
2. Type smit pdam.
3. Select Recovery Options -> Resolve PCI SCSI RAID Adapter Configuration ->
Display/Accept Configuration Changes .
4. Select the appropriate SCSI RAID adapter containing the drive.
5. Select YES for the Display only option.
The third way to display VPD is through the scraidmgr. This can be performed while
the RAID array is operational.
Note: The underlying filesystems may need to be unmounted before varying off the
volume group can complete successfully. Then, the volume group containing the
disk must be varied off. A command the customer (logged in as root) might use
to vary off the volume group is:
varyoffvg RAID_volumegroup
Where RAID_volumegroup is the volume group containing the RAID drive you
wish to test. When you are done, ask customer (logged on as root) to vary on
the volume group using command:
varyonvg RAID_volumegroup
Where RAID_volumegroup is the name of the volume group you previously varied
off. Any filesystem unmounted will have to be remounted.
Note: Microcode levels are consecutive going from 0-9 then A-Z. So, level S9HA
would be a later level than S96F.
7. Select the location where the microcode image is stored (from above).
v On system’s /etc/microcode directory
v On a diskette
8. A message will be displayed if download was successful or failed.
where:
-A all disk drives
-D Microcode is on diskette (default to /etc/microcode directory)
-I RAID disk drive-physical disk channel/ID (example 27)
-P Download the previous level of microcode (default to latest level)
A command the customer (logged in as root) might use to vary off the volume group is:
varyoffvg RAID_volumegroup
where RAID_volumegroup is the volume group containing the RAID drive you wish to
test.
Note: The underlying filesystems may need to be unmounted before varying off the
volume group can complete successfully.
Diagnostic Menus
When you are done, ask customer (logged in as root) to vary on the volume group
using the command: ″
varyonvg [RAID_volumegroup]
where RAID_volumegroup is the name of the volume group you previously varied off.
Note: Many of these programs work on all system model architectures. Some
programs are only accessible from Online Diagnostics in Service or Concurrent
mode, while others might be accessible only from Standalone Diagnostics. Still
other programs might only be supported on a particular system architecture,
such as CHRP (Common Hardware Reference Platform) or RSPC (PowerPC
Reference Platform). Refer to “Determining System Architecture” on page 165 to
identify the platform type of your system unit.
Note: If the system is running on a logically partitioned system, the following tasks may
only be executed in a partition with service authority:
v Configure Reboot Policy
v Configure Remote Maintenance Policy
v Configure Ring Indicate Power On
v Configure Ring Indicate Power-On Policy
v Update System or Service Processor Flash
v Save or Restore Hardware Management Policies
v Configure Scan Dump Policy
To perform one of these tasks, use the Task Selection option from the FUNCTION
SELECTION menu.
After a task is selected, a resource menu may be presented showing all resources
supported by the task.
A fast path method is also available to perform a task by using the diag command and
the -T flag. By using the fast path, the user can bypass most of the introductory menus
to access a particular task. The user is presented with a list of resources available to
support the specified task. The fast path tasks are as follows:
v Certify - Certifies media
v Chkspares - Checks for the availability of spare sectors
v Download - Downloads microcode to an adapter or device
v Disp_mcode - Displays current level of microcode
v Format - Formats media
v Identify - Identifies the PCI RAID physical disks
v IdentifyRemove - Identifies and removes devices (Hot Plug)
To run these tasks directly from the command line, specify the resource and other
task-unique flags. Use the descriptions in this chapter to understand which flags are
needed for a given task.
175
Tasks
The following tasks are described in this chapter:
v Add Resource to Resource List
v AIX Shell Prompt
v Analyze Adapter Internal Log
v Backup and Restore Media
v Certify Media
v Change Hardware Vital Product Data
v Configure Dials and LPF Keys
v Configure ISA Adapters
v Configure Reboot Policy
v Configure Remote Maintenance Policy
v Configure Ring Indicate Power On
v Configure Ring Indicate Power-On Policy
v Configure Service Processor (RSPC)
v Configure Scan Dump Policy
v Configure Surveillance Policy
v Create Customized Configuration Diskette
v Delete Resource from Resource List
v Disk Maintenance
v Display Configuration and Resource List
v Display Firmware Device Node Information
v Display Hardware Error Report
v Display Hardware Vital Product Data
v Display Machine Check Error Log
v Display Microcode Level
v Display or Change Bootlist
v Display or Change Diagnostic Run Time Options
v Display Previous Diagnostic Results
v Display Resource Attributes
v Display Service Hints
v Display Software Product Data
v Display System Environmental Sensors
v Display Test Patterns
v Display USB Devices
v Download Microcode
v Fibre Channel RAID Service Aids
v Flash SK-NET FDDI Firmware
v Format Media
v Gather System Information
v Generic Microcode Download
v Hot Plug Task
v Local Area Network Analyzer
v Log Repair Action
v Periodic Diagnostics
v PCI RAID Physical Disk Identify
v Process Supplemental Media
v Run Diagnostics
v Run Error Log Analysis
Note: Only resources that were previously detected by the diagnostics and deleted
from the Diagnostic Test List are listed. If no resources are available to be
added, then none are listed.
This service aid allows access to the AIX command line. To use this service aid, the
user must know the root password (when a root password has been established).
Note: Do not use this task to install code or to change the configuration of the system.
This task is intended to view files, configuration records, and data. Using this
service aid to change the system configuration or install code can produce
unexplained system problems after exiting the diagnostics.
The Analyze Adapter Internal Log service aid analyzes these entries in the AIX system
error log. The service aid displays the errors and the associated service actions. Entries
that do not require any service actions are ignored.
When running this service aid, a menu is presented to enter the start time, the end
time, and the file name. The start time and end time have the following format:
[mmddHHMMyy]. (where mm is the month (1-12), dd is the date (1-31) HH is the hour
To invoke the service aid task from the command line, type:
Flag Description
-c Specifies not console mode.
-d devicename Specifies the device whose internal log you
want to analyze (for example, SCRAID0)
-s start date Specifies all errors after this date are analyzed.
-e end date Specifies all errors before this date are
analyzed.
-T Specifies the Analyze Adapter Internal Log task
Note: To specify a file name from the command line, use the redirection operator at the
end of the command to specify where the output of the command is to be sent,
for example > filename (where filename is the name and location where the
user wants to store the output data (for example, /tmp/adaptlog).
Normally, one would run certify if after running diagnostics on a drive and its media, no
problem is found, but one still suspects that a problem still does exist.
Hard files can be connected either to a SCSI adapter (non-RAID) or a PCI SCSI RAID
adapter. The usage and criteria for a hard file connected to a non-RAID SCSI adapter
are different from those for a hard file connected to a PCI SCSI RAID adapter.
This task can be run directly from the AIX command line. See the following command
syntax: diag -c -d deviceName -T "certify"
Flag Description
-c No console mode
-d Specifies a device
-T Specifies the certify task
v Format and/or Erase Hardfile Attached to a Non-RAID SCSI Adapter
Attention: The following commands WILL change the content of the hardfile. Be
sure to backup data prior to running the command, and take care in choosing the
hardfile upon which you run this task.
– Hardfile Format
Writes all of the disk. The pattern written on the disk is device-dependant; for
example, some drives may write all 0s, while some may write the hexadecimal
number 5F. No bad block reassignment occurs.
– Hardfile Format and Certify
Performs the same function as Hardfile Format. After the format is completed,
Certify is run. Certify then reassigns all bad blocks encountered.
– Hardfile Erase Disk
This option can be used to overwrite (remove) all data currently stored in
user-accessible blocks of the disk. The Erase Disk option writes one or more
patterns to the disk. An additional option allows data in a selectable block to be
read and displayed on the system console.
To use the Erase Disk option, specify the number (0-3) of patterns to be written.
The patterns are written serially; that is, the first pattern is written to all blocks.
Then the next pattern is written to all blocks, overlaying the previous pattern. A
random pattern is written by selecting the Write Random Pattern option.
Note: The Erase Disk service aid has not been certified as meeting the
Department of Defense or any other organization’s security guidelines.
For a newly installed drive, you can ensure that all blocks on the drive are
overwritten with your pattern if you use the following procedure:
1. Format the drive.
2. Check the defect MAP by running the Erase Disk selection.
Note: If you use Format and Certify option, there may be some blocks which
get placed into the grown defect MAP.
3. If there are bad blocks in the defect MAP, record the information presented
and ensure that this information is kept with the drive. This data is used later
when the drive is to be overwritten.
4. Use you drive as you would normally.
5. When the drive is no longer needed and is to be erased, run the same version
of the Erase Disk selection which was used in step 2.
Note: Using the same version of the service aid is only critical if any bad
blocks were found in step 3.
6. Compare the bad blocks which were recorded for the drive in step 3 with
those that now appear in the grown defect MAP.
Note: If there are differences between the saved data and the newly obtained
data, then all sectors on this drive cannot be overwritten. The new bad
blocks are not overwritten.
7. If the bad block list is the same, continue running the service aid to overwrite
the disk with the chosen pattern(s).
This task can be run directly from the AIX command line. See the command
syntax: diag -c -d deviceName -T "format [-s* fmtcert | erase -a {read |
write}] [-F]*
Flag Description
fmtcert Formats and certifies the disk.
erase Overwrites the data on the disk.
* Available in no-console mode only.
-F Force the disk erasure even if all blocks cannot be erased due to errors
accessing the grown defect map.
Note: The Erase Disk option in command line mode uses default values. To
selectively read or write, use the diag command in console mode.
v Certify Hardfile attached to a PCI SCSI RAID Adapter
This selection is used to certify physical disks attached to a PCI SCSI RAID adapter.
Certify reads the entire disk and checks for recovered errors, unrecovered errors,
and reassigned errors. If these errors exceed the threshold values, the user is
prompted to replace the physical disk.
Note: The user cannot alter the VPD for a specific resource unless it is not
machine-readable.
Note: The Dials and LPF Keys service aid is not supported in standalone mode
(CD-ROM and NIM) on systems with 32 MB or less memory. If you have
problems in standalone mode, use the hardfile-based diagnostics.
This service aid provides a tool for configuring and removing dials and LPF keys to the
asynchronous serial ports.
This selection invokes the SMIT utility to allow Dials and LPF keys configuration. A TTY
must be in the available state on the async port before the Dials and LPF keys can be
configured on the port. The task allows an async adapter to be configured, then a TTY
port defined on the adapter. Dials and LPF keys can then be defined on the port.
Before configuring Dials or LPF keys on a serial port, you must remove all defined
TTYs. To determine if there are any defined TTYs, select List All Defined TTYs. Once
all defined TTYs have been removed, then add a new TTY and configure the Dials or
LPF keys.
Diagnostic support for ISA adapters not shown in the list may be supported from a
supplemental diskette. You can use the Process Supplemental Media task to add ISA
adapter support from a supplemental diskette.
If diagnostics are run on an ISA adapter that has been removed from the system, the
diagnostics fail because the system cannot detect the ISA adapter.
This service aid controls how the system tries to recover from a system crash.
Use this service aid to display and change the following settings for the Reboot Policy.
Note: Because of system capability, some of the following settings might not be
displayed by this service aid.
v Maximum Number of Reboot Attempts
Enter a number that is 0 or greater.
This number is the maximum number of consecutive attempts to reboot the system.
The term reboot, in the context of this service aid, is used to describe bringing
system hardware back up from scratch; for example, from a system reset or
power-on.
When the reboot process completes successfully, the reboot attempts count is reset
to 0, and a restart begins. The term restart, in the context of this service aid, is used
to describe the operating system activation process. Restart always follows a
successful reboot.
When a restart fails, and a restart policy is enabled, the system attempts to reboot
for the maximum number of attempts.
v Use the O/S Defined Restart Policy (1=Yes, 0=No)
When ’Use the O/S Defined Restart Policy’ is set to Yes, the system attempts to
reboot from a crash if the operating system has an enabled Defined Restart or
Reboot Policy.
When ’Use the O/S Defined Restart Policy’ is set to No, or the operating system
restart policy is undefined, then the restart policy is determined by the ’Supplemental
Restart Policy’.
v Enable Supplemental Restart Policy (1=Yes, 0=No)
The ’Supplemental Restart Policy’, if enabled, is used when the O/S Defined Restart
Policy is undefined, or is set to False.
You can access this service aid directly from the AIX command line, by typing:
/usr/lpp/diagnostics/bin/uspchrp -b
The Remote Maintenance Policy includes modem configurations and phone numbers to
use for remote maintenance support.
Use this service aid to display and change the following settings for the Remote
Maintenance Policy.
Note: Because of system capability, some of the following settings might not be
displayed by this service aid.
v Configuration File for Modem on serial port 1 (S1) Configuration File for Modem on
serial port 2 (S2).
Enter the name of a modem configuration file to load on either S1 or S2. The modem
configuration files are located in the directory /usr/share/modems. If a modem file is
already loaded, it is indicated by Modem file currently loaded.
v Modem file currently loaded on S1 Modem file currently loaded on S2
This is the name of the file that is currently loaded on serial port 1 or serial port 2.
Note: These settings are only shown when a modem file is loaded for a serial port.
v Call In Authorized on S1 (on/off) Call In Authorized on S2 (on/off)
Call In allows the Service Processor to receive a call from a remote terminal.
v Call Out Authorized on S1 (on/off) Call Out Authorized on S2 (on/off)
Call Out allows the Service Processor to place calls for maintenance.
v S1 Line Speed S2 Line Speed
A list of line speeds is available by using List on the screen.
Note: Knowing the system name aids the support team in quickly identifying the
location, configuration, history, and so on of your system.
You can access this service aid directly from the AIX command line, by typing:
/usr/lpp/diagnostics/bin/uspchrp -m
This service aid allows the user to power-on a system by telephone from a remote
location. If the system is powered off, and Ring Indicate Power On is enabled, the
system powers on at a predetermined number of rings. If the system is already on, no
action is taken. In either case, the telephone call is not answered, and the caller
receives no feedback that the system has powered on.
Use this service aid to display and change the following settings for the Ring Indicate
Power-On Policy:
Note: Because of system capability, some of the following settings might not be
displayed by this service aid.
v Power On Via Ring Indicate (on/off)
v Number of Rings Before Power On
You can access this service aid directly from the AIX command line, by typing:
/usr/lpp/diagnostics/bin/uspchrp -r
This service aid allows the user to display and change the NVRAM settings for the Ring
Indicate Power On capability of the service processor.
Note: This function works only on CHRP systems and only under diagnostics 5.1.0.35
or later.
Configure Scan Dump Policy allows the user to set or view the scan dump policy (scan
dump control and size) in NVRAM. Scan Dump data is a set of chip data that the
service processor gathers after a system malfunction. It consists of chip scan rings, chip
trace arrays, and Scan COM (SCOM) registers. This data is stored in the scan-log
partition of the system’s Nonvolatile Random Access Memory (NVRAM).
Use this service aid to display and change the following settings for the Scan Dump
Policy at run time:
v Scan Dump Control (how often the dump is taken)
and
v Scan Dump Size (size and content of the dump)
You can access this task directly from the AIX command line by typing:
/usr/lpp/diagnostics/bin/uspchrp -d
This service aid allows you to display and change the NVRAM settings for the service
processor.
Surveillance Setup
Note: Surveillance is only supported for systems running in full machine partition.
This selection allows you to display and change the NVRAM settings for the
surveillance capability of the service processor.
The current settings are read from NVRAM and displayed on the screen. Any changes
made to the data shown are written to NVRAM.
Modem Configuration
Use this selection when setting the NVRAM for a modem attached to any of the Service
Processor’s serial ports. The user inputs the file name of a modem configuration file
and the serial port number. The formatted modem configuration file is read, converted
for NVRAM, and then loaded into NVRAM. Refer to the Service Processor Installation
and User’s Guide for more information.
The current settings are read from NVRAM and displayed on the screen. Any changes
made to the data shown are written to NVRAM.
Use this service aid to display and change the following settings for the Reboot Policy
Setup.
v Maximum Number of Reboot Attempts
Enter a number that is 0 or greater.
This number is the maximum number of consecutive attempts allowed to reboot the
system. The term reboot, when used in context of this service aid, describes the
system hardware being brought back up from scratch. Examples would be a system
reset or turning on the power.
When the maximum number of reboot attempts is exceeded, the system calls out if
that function has been enabled.
When the reboot process completes successfully, the reboot-attempts count is reset
to 0, and a restart begins. The term restart, when used in context of this service aid,
describes the operating system activation process. Restart always follows a
successful reboot.
When a restart fails, and a restart policy is enabled, the system attempts to reboot
for the maximum number of reboot attempts.
v Enable Restart Policy (1=Yes, 0=No)
When the service processor detects operating system inactivity, an enabled ″Restart
Policy″ causes a system reset, and the reboot process begins.
v Call-Out Before Restart (on/off)
When enabled, ″Call-Out Before Restart″ allows the system to call out (on a serial
port that is enabled for call-out) when an operating system restart is initiated. Such
call-outs can be valuable if the number becomes excessive, thus signalling bigger
problems.
v Enable Unattended Start Mode (1=Yes, 0=No)
When enabled, ″Unattended Start Mode″ allows the system to recover from the loss
of ac power.
Note: This service aid runs on CHRP system units only. This service aid is only
supported for systems running in full machine partition.
This service aid monitors the system for hang conditions; that is, hardware or software
failures that cause operating system inactivity. When enabled, and surveillance detects
operating system inactivity, a call is placed to report the failure.
Use this service aid to display and change the following settings for the Surveillance
Policy:
Note: Because of system capability, some of the following settings might not be
displayed by this service aid:
v Surveillance (on/off)
v Surveillance Time Interval
This is the maximum time between heartbeats from the operating system.
v Surveillance Time Delay
This is the time to delay between when the operating system is in control and when
to begin operating system surveillance.
v Changes are to Take Effect Immediately
Set this to Yes if the changes made to the settings in this menu are to take place
immediately. Otherwise the changes take effect beginning with the next system boot.
You can access this service aid directly from the AIX command line, by typing:
/usr/lpp/diagnostics/bin/uspchrp -s
Note: Only resources that were previously detected by the diagnostics and have not
been deleted from the Diagnostic Test List are listed. If no resources are
available to be deleted, then none are listed.
Disk Maintenance
This service aid provides the following options for the fixed-disk maintenance:
v Disk to Disk Copy
v Display/Alter Sector
This selection allows you to recover data from an old drive when replacing it with a new
drive. The service aid recovers all logical volume manager (LVM) software reassigned
blocks. To prevent corrupted data from being copied to the new drive, the service aid
stops if an unrecoverable read error is detected. To help prevent possible problems with
the new drive, the service aid stops if the number of bad blocks being reassigned
reaches a threshold.
To use this service aid, both the old and new disks must be installed in or attached to
the system with unique SCSI addresses. This requires that the new disk drive SCSI
address must be set to an address that is not currently in use and the drive be installed
in an empty location. If there are no empty locations, then one of the other drives must
be removed. When the copy is complete, only one drive can remain installed. Either
remove the target drive to return to the original configuration, or perform the following
procedure to complete the replacement of the old drive with the new drive:
1. Remove both drives.
2. Set the SCSI address of the new drive to the SCSI address of the old drive.
3. Install the new drive in the old drive’s location.
4. Install any other drives (that were removed) into their original location.
Display/Alter Sector
Note: Use caution when you use this service aid because inappropriate modification to
some disk sectors can result in the total loss of all data on the disk.
This selection allows the user to display and alter information on a disk sector. Sectors
are addressed by their decimal sector number. Data is displayed both in hex and in
ASCII. To prevent corrupted data from being incorrectly corrected, the service aid does
not display information that cannot be read correctly.
This service aid displays the item header only for all installed resources. Use this
service aid when there is no need to see the vital product data (VPD). (No VPD is
displayed.)
This task displays the firmware device node information that appears on CHRP
platforms. This service aid is intended to gather more information about individual or
particular devices on the system. The format of the output data may differ depending on
which level of the AIX operating system is installed.
The Display Error Summary and Display Error Detail selection provide the same type of
report as the errpt command. The Display Error Analysis Summary and Display Error
Analysis Detail selection provide additional analysis.
Note: The Machine Check Error Log Service Aid is available only on Standalone
Diagnostics.
When a machine check occurs, information is collected and logged in an NVRAM error
log before the system unit shuts down. This information is logged in the AIX error log
and cleared from NVRAM when the system is rebooted from the hard disk, LAN, or
standalone media. When booting from Standalone Diagnostics, this service aid converts
the logged information into a readable format that can be used to isolate the problem.
When booting from the hard disk or LAN, the information can be viewed from the AIX
error log using the Hardware Error Report Service Aid. In either case, the information is
analyzed when the sysplanar0 diagnostics are running in Problem Determination
Mode.
You can display the current level of the microcode on an adapter, the system, or a
device by using the AIX diag command. See the following command syntax: diag -c
-d device -T "disp_mcode"
Flag Description
-c No console mode.
-d Used to specify a device.
-T Use the disp_mcode option to display microcode.
The AIX lsmcode command serves as a command line interface to the ″Display
Microcode Level″ task. For information on the lsmcode command, refer to the AIX
Commands Reference manual.
The system attempts to perform an IPL from the first device in the list. If the device is
not a valid IPL device or if the IPL fails, the system proceeds in turn to the other
devices in the list to attempt an IPL.
Note: The run-time options are used only when selecting the Run Diagnostic task.
Note: This option is only displayed when you run the Online Diagnostics in Service
Mode.
v Include Advanced Diagnostics
This option allows the user to turn on or off including the Advanced Diagnostics (the
default is off).
v Include Error Log Analysis (not available in diagnostics 5.2.0 or later)
This option allows the user to turn on or off including the Error Log Analysis (ELA)
(the default is off).
v Number of Days Used to Search Error Log
This option allows the user to select the number of days for which to search the AIX
error log for errors when running the Error Log Analysis. The default is seven days,
but it can be changed from one to sixty days.
v Display Progress Indicators
This option allows the user to turn on or off the progress indicators when running the
Diagnostic Applications. The progress indicators are in a popup box at the bottom of
the screen which indicate that the test being run (the default is on).
v Diagnostic Event Logging
This option allows the user to turn on or off logging information to the Diagnostic
Event Log (the default is on).
v Diagnostic Event Log File Size
This option allows the user to select the maximum size of the Diagnostic Event Log.
The default size for the Diagnostic Event Log is 100 KB. The size can be increased
in 100 KB increments to a maximum of 1 MB.
Use the diaggetrto command to display one or more diagnostic run-time options. Use
the following AIX command line syntax:
/usr/lpp/diagnostics/bin/diaggetrto [-a] [-d] [-l] [-m] [-n] [-p] [-s]
Use the diagsetrto command to change one or more diagnostic run-time options. Use
the following AIX command line syntax:
Note: This service aid is not available when you load the diagnostics from a source
other than a hard disk drive or a network.
This service aid allows a service representative to display results from a previous
diagnostic session. When the Display Previous Diagnostic Results option is selected,
the user can view up to 25 no trouble found (NTF) and service request number (SRN)
results.
This service aid displays Diagnostic Event Log information. You can display the
Diagnostic Event Log in a short version or a long version. The Diagnostic Event Log
contains information about events logged by a diagnostic session.
This service aid displays the information in reverse chronological order. If more
information is available than can be displayed on one screen, use the Page Down and
Page Up keys to scroll through the information.
This information is not from the AIX operating system error log. This information is
stored in the /var/adm/ras directory.
You can run the command from the AIX command line by typing:
/usr/lpp/diagnostics/bin/diagrpt [[-o] | [-s mmddyy] | [-a] | [-r]]
Flag Description
-o Displays the last diagnostic results file stored in the
/etc/lpp/diagnostics/data directory
-s mmddyy Displays all diagnostic result files logged since the date specified
-a Displays the long version of the Diagnostic Event Log
-r Displays the short version of the Diagnostic Event Log
Use the arrow keys to scroll through the information in the file.
This service aid displays the environmental sensors implemented on a CHRP system.
The information displayed is the sensor name, physical location code, literal value of
the sensor status, and the literal value of the sensor reading.
This service aid can also be run as a command. You can use the command to list the
sensors and their values in a text format, list the sensors and their values in numerical
format, or a specific sensor can be queried to return either the sensor status or sensor
value.
Examples
The following are examples from this command:
1. Display a list of the environmental sensors: /usr/lpp/diagnostics/bin/uesensor -l
3 0 11 87 P1
9001 0 11 2345 F1
9004 0 11 2 V1
9004 1 9 2 V2
3. Return the status of sensor 9004, index 1: /usr/lpp/diagnostics/bin/uesensor -t
9004 -i 1
9
4. Return the value of sensor 9004, index 1: /usr/lpp/diagnostics/bin/uesensor -t
9004 -i 1 -v
To run the USB devices service aid, go to the diagnostics ″TASKS SELECTION″ menu,
select ″Display USB Devices″. From the controller list that displayed on the screen,
select one of the items that begins with ″OHCDX″, where ″X″ is a number. A list of
devices attached to the controller displays.
Download Microcode
This service aid provides a way to copy microcode to an adapter or device. The service
aid presents a list of adapters and devices that use microcode. After the adapter or
device is selected, the service aid provides menus to guide you in checking the current
level and downloading the needed microcode.
This task can be run directly from the AIX command line. See the following sections for
instructions on downloading to a particular type adapter or device.
Fault Indicators
Note: Prior to Diagnostics version 5.1.0.35, this task was also known as the System
Fault Indicator task.
This task is used to display or reset the Fault Indicators on the systems that support
this function. This task may also be used to set the Fault Indicators for testing
purposes, but the Indicators are not set back to normal when you exit this task.
The Fault Indicators are used to identify a fault with the system. These indicators may
be set automatically by hardware, firmware, or diagnostics when a fault is detected in
the system.
The Fault Indicators are turned off when a Log Repair Action is performed. After a
serviceable event is complete, do a System Verification to verify the fix. Also do a Log
Repair Action if the test on the resource was good, and that resource had an entry in
the error log. If the serviceable event was not a result of an error log entry, use the Log
Repair Action Task to turn off the system fault indicator.
For additional information concerning the use of these indicators, refer to the system
guide.
Note: The AIX command does not allow you to set the fault indicators to the Fault
state.
Flag Description
-s normal Sets the Fault Indicator to the normal state.
-l location code Identifies the resource by physical location code.
When the command is used without the -s flag, the current state of the indicator is
displayed as normal or attention.
When teh command is used without the -l flag, the Primary Enclosure resource is used.
Note: See also the “Identify and System Attention Indicators” on page 211. Some
systems that do not support Fault Indicators have a similar System Attention
Indicator.
Note: The Erase Disk service aid has not been certified as meeting the Department
of Defense or any other organization’s security guidelines.
Note: If you use the Format and Certify option, there may be some blocks
which get placed into the grown defect MAP.
3. If there are bad blocks in the defect MAP, record the information presented and
ensure that this information is kept with the drive. This data is used later when
the drive is to be overwritten.
4. Use the drive as you would normally.
5. When the drive is no longer needed and is to be erased, run the same version of
the Erase Disk selection which was used in step 2.
Note: Using the same version of the service aid is only critical if any bad blocks
were found in step 3.
Note: If there are differences between the saved data and the newly obtained
data, then all sectors on this drive cannot be overwritten. The new bad
blocks are not overwritten.
7. If the bad block list is the same, continue running the service aid to overwrite the
disk with the chosen pattern(s).
This task can be run directly from the AIX command line. See the command syntax:
diag -c -d deviceName -T "format [-s* fmtcert | erase -a {read | write}] [-F]"
Flag Description
fmtcert Formats and certifies the disk.
* Available in no-console mode only.
-F Force the disk erasure even if all blocks cannot be erased due to errors
accessing grown defect map.
Note: The Erase Disk option in command line mode uses default values. To selectively
read or write, use the diag command in console mode.
Optical Media
Use the following functions to check and verify optical media:
v Optical Media Initialize
Formats the media without certifying. This function does not reassign the defective
blocks or erase the data on the media. This option provides a quick way of formatting
the media and cleaning the disk.
This task can be run directly from the AIX command line. See the following command
syntax: diag -c -d deviceName -T "format [-s {initialize | fmtcert} ]"
Option Description
Diskette Format
This selection formats a diskette by writing patterns to it.
The output of the SNAP service aid can be used by field service personnel or it can be
put on removable media and transferred to remote locations for more extensive
analysis.
To use the SNAP task, select Gather System Information from the task list. You can
select which components you want to collect information for, and where to store the
data (hard disk or removable media).
The genucode program should be downloaded onto diskette or tape in tar format while
the microcode image itself goes onto another one in restore format. Running the
Generic Microcode Download task will search for the genucode script on diskette or
tape and execute it. It will ask for a Genucode media to be inserted into the drive. The
service aid moves the genucode script file to the /tmp directory and runs the program
that downloads the microcode to the adapter or device.
This service aid is supported in both concurrent and standalone modes from disk, LAN,
or loadable media.
The Hot Plug Task has a restriction when running in Standalone or Online Service
mode; new devices may not be added to the system unless there is already a device
Depending on the environment and the software packages installed, selecting this task
displays the following three subtasks:
v PCI Hot Plug Manager
v SCSI Hot Swap Manager
v RAID Hot Plug Devices
To run the Hot Plug Task directly from the command line, type the following: diag
-T"identifyRemove"
If you are running the diagnostics in Online Concurrent mode, run the Missing Options
Resolution Procedure immediately after removing any device.
If the Missing Options Resolution Procedure runs with no menus or prompts, then
device configuration is complete. Otherwise, work through the missing options menu to
complete device configuration (in diagnostics prior to version 5.2.0) or (in diagnostics
version 5.2.0 and later) select the device that has an uppercase M shown in front of it in
the resource list for missing options processing to be done on that resource.
The List PCI Hot Plug Slots function lists all PCI hot-plug slots. Empty slots and
populated slots are listed. Populated slot information includes the connected logical
device. The slot name consists of the physical location code and the description of the
physical characteristics for the slot.
The Add a PCI Hot Plug Adapter function is used to prepare a slot for the addition of
a new adapter. The function lists all the empty slots that support hot plug. When a slot
is selected, the visual indicator for the slot blinks at the Identify rate. After the slot
location is confirmed, the visual indicator for the specified PCI slot is set to the Action
state. This means the power for the PCI slot is off and the new adapter can be plugged
in.
The Replace/Remove a PCI Hot Plug Adapter function is used to prepare a slot for
adapter exchange. The function lists all the PCI slots that support hot plug and are
occupied. The list includes the slot’s physical location code and the device name of the
The Identify a PCI Hot Plug Slot function is used to help identify the location of a PCI
hot-plug adapter. The function lists all the PCI slots that are occupied or empty and
support hot plug. When a slot is selected for identification, the visual indicator for the
slot is set to the Identify state.
The Unconfigure Devices function attempts to put the selected device, in the PCI
hot-plug slot, into the Defined state. This action must be done before any attempted
hot-plug function. If the unconfigure function fails, it is possible that the device is still in
use by another application. In this case, the customer or system administrator must be
notified to quiesce the device.
The Configure Devices function allows a newly added adapter to be configured into
the system for use. This function should also be done when a new adapter is added to
the system.
The Install/Configure Devices Added After IPL function attempts to install the
necessary software packages for any newly added devices. The software installation
media or packages are required for this function.
Standalone Diagnostics has restrictions on using the PCI Hot-Plug Manager. For
example:
v Adapters that are replaced must be exactly the same FRU part number as the
adapter being replaced.
v New adapters cannot be added unless a device of the same FRU part number
already exists in the system, because the configuration information for the new
adapter is not known after the Standalone Diagnostics are booted.
v The following functions are not available from the Standalone Diagnostics and will
not display in the list:
– Add a PCI Hot Plug Adapter
– Configure Devices
– Install/Configure Devices Added After IPL
You can run this task directly from the command line by typing the following command:
diag -d device -T"identifyRemove"
However, note that some devices support both the PCI Hot-Plug task and the RAID
Hot-Plug Devices task. If this is the case for the device specified, then the Hot Plug
Task displays instead of the PCI Hot Plug Manager menu.
More detailed information concerning PCI Hot-Plug Manager can be found in the AIX
Operating System, System Management Guide.
The List the SES Devices function lists all the SCSI hot-swap slots and their contents.
Status information about each slot is also available. The status information available
includes the slot number, device name, whether the slot is populated and configured,
and location.
The Identify a Device Attached to an SES Device function is used to help identify the
location of a device attached to a SES device. This function lists all the slots that
support hot swap that are occupied or empty. When a slot is selected for identification,
the visual indicator for the slot is set to the Identify state.
The Attach a Device to an SES Device function lists all empty hot-swap slots that are
available for the insertion of a new device. After a slot is selected, the power is
removed. If available, the visual indicator for the selected slot is set to the Remove
state. After the device is added, the visual indicator for the selected slot is set to the
Normal state, and power is restored.
Note: Be sure that no other host is using the device before you remove it.
Standalone Diagnostics has restrictions on using the SCSI Hot Plug Manager. For
example:
v Devices being used as replacement devices must be exactly the same type of device
as the device being replaced.
v New devices may not be added unless a device of the same FRU part number
already exists in the system, because the configuration information for the new
device is not known after the Standalone Diagnostics are booted.
You can run this task directly from the command line. See the following command
syntax:
diag -d device-T"identifyRemove"
OR
diag [-c] -d device -T"identifyRemove -a [identify|remove]"
Flag Description
-a Specifies the option under the task.
-c Run the task without displaying menus. Only command line prompts are used.
This flag is only applicable when running an option such as identify or remove.
-d Indicates the SCSI device.
-T Specifies the task to run.
Identify Indicators
See the “Identify and System Attention Indicators” for a description of the Identify
Indicators task.
Note: Prior to Diagnostics version 5.1.0.35, this task may also have been known as the
System Identify Indicator or Identify Indicators task.
This task is used to display or set the Identify Indicators and the single system attention
indicator on the systems that support this function.
Some systems may support the Identify Indicators or the Attention Indicators. The
Identify Indicator is used to help physically identify the system in a large equipment
room. The Attention Indicator is used to help physically identify a system with a fault in
a large equipment room.
Note: It is important to run the Log Repair Action function. This action keeps the Fault
Indicator from going back to the Fault state due to a previous error in the error
log that has already been serviced.
For additional information concerning the use of this indicator, refer to the system guide.
This task can also be run directly from the command line by typing
/usr/lpp/diagnostics/bin/usysident [-s {normal | identify}] [-l location code]
Flag Description
-s {normal | identify} Sets the state of the System Identify Indicator to
either normal or identify.
-l location code Identifies the resource by physical location code.
When this command is used without the -l flag, the Primary Enclosure resource is used.
Use the -l flag only in systems that have more than one Identify and System attention
indicator.
When this command is used without -s flag, the current state of the identify indicator is
displayed.
For more information on the tasks, see Chapter 28, “Introduction to Tasks and Service
Aids” on page 175.
You can run this task directly from the AIX command line. See the following command
syntax: diag -c -d pci RAID adapter -T identify
This task processes the Diagnostic Supplemental Media. Insert the supplemental media
when you are prompted; then press Enter. After processing has completed, go to the
Resource Selection list to find the resource to test.
Notes:
1. This task is supported in Standalone Diagnostics only.
2. Process and test one resource at a time. Run diagnostics after each supplemental
media is processed. (for example; If you need to process two supplemental media
you need to run diagnostic twice. Once after each supplement media is processed.)
The procedures for running the diagnostics depends on the state of the diagnostics
run-time options. See “Display or Change Diagnostic Run-Time Options” on page 195.
Run Exercisers
The Run Exercisers task provides a tool to troubleshoot intermittent system problems in
AIX 4.3.2 or later, to test hardware, and to verify replacement parts. When AIX error
logging is enabled, the Run Error Log Analysis task can be used to analyze errors after
the exerciser completes. Hardware errors are logged in the AIX Error Log. Miscompares
and recoverable errors are not reported, however, they may be logged in the AIX Error
Log when logging thresholds are exceeded.
The diagnostic supervisor typically sets up temporary work files in /tmp to log
messages and device statistics. These files are deleted before an exerciser session
begins. In addition to individual exerciser requirements, the following requirements
pertain to all exercisers:
v Only supported on CHRP platforms
v Only supported in concurrent or service modes
v Not supported from standalone diagnostics
v System performance will be degraded while running the exerciser, so it is
recommended that customer applications be shut down before it is run.
v At least 1 MB of free storage in /tmp is available
From the TASK SELECTION LIST menu select Run Exercisers then the RESOURCES
SELECTION LIST menu displays. From this menu choose the resources you want to
exercise and then select commit to start the Run Exerciser task. An intermediate
popup may appear stating system performance will be degraded. (The popup does not
appear if the task had previously been selected). Press Enter and the EXERCISER
OPTIONS menu prompts for the type of test to run.
The EXERCISER OPTIONS menu displays two options (1) Short Exercise and (2)
Extended Exercise. The Short Exercise option exercises the resources within a
relatively short time and exits. The Extended Exercise option allows greater flexibility
and control over resources and test duration.
After choosing the Short Exercise option, additional menus, pop-ups and prompts may
appear for each resource. Please read any text and carefully fill out any prompts before
committing. Next, the exercisers start, and the Device Status Screen displays. The
exercisers runs 5 to 10 minutes depending on the number of processors, processor
When the task completes any errors that were encountered are displayed for review.
Finally, an Exerciser Complete popup displays. To continue, press the Enter key.
Following, you will be taken back to the TASK SELECTION LIST menu. If miscompare
errors were encountered, run diagnostics on the resource. If the problem is not reported
then contact your service support structure. If any other errors were encountered, select
and run the Error Log Analysis task. If Error Log Analysis does not report a problem
then contact your service support structure.
Abbreviations
The following list describes abbreviations used in the exerciser reports.
Acronym Description
COE Continue on error (use number to select).
CP Device has run the specified number of cycles and is not running.
DD The exerciser has been terminated by a signal.
ER Device has stopped with an error.
HG The device is hung.
HOE Halt on error (use number to select).
RN Device is running.
ST Device is stopped.
On systems with multiple processors, a process is started for each processor. The free
memory space is split evenly between the available processors, thus reducing the time
required to exercise all of the memory.
Tape Exerciser
The tape exerciser is labeled rmtx, where x is the number of a specific device. The
exerciser performs read, write, and compare operations using known data patterns. A
tape device and Test Diagnostic Cartridge are required to run this exerciser. The actual
Test Diagnostics Cartridge depends upon the specific tape device being tested. The
exerciser automatically rewinds the tape. Test Requirements are:
v Tape device
v Test Diagnostic Cartridge; (P/N depends upon tape device)
Diskette Exerciser
The diskette exerciser is labeled fdx, where x is the number of a specific device. The
exerciser performs read, write, and compare operations using known data patterns. A
scratch diskette is required to run this exerciser, data on the scratch diskette is
destroyed. Test requirements are:
v Diskette device
v Scratch diskette (data on diskette is destroyed)
CD-ROM Exerciser
The CD-ROM exerciser is labeled cdx, where x is the number of a specific device. The
exerciser performs read and compare operations using known data patterns. A
CD-ROM device and a Test Disc is required to run this exerciser. Test requirements are:
v CD-ROM device
v Test Disc P/N 81F8902
Use this service aid to save or restore the settings from Ring Indicate Power-On Policy,
Surveillance Policy, Remote Maintenance Policy and Reboot Policy. The following
options are available:
v Save Hardware Management Policies
This selection writes all of the settings for the hardware-management policies to the
following file: /etc/lpp/diagnostics/data/hmpolicies
v Restore Hardware Management Policies
This selection restores all of the settings for the hardware-management policies from
the contents of the following file: /etc/lpp/diagnostics/data/hmpolicies
You can access this service aid directly from the AIX command line, by typing:
/usr/lpp/diagnostics/bin/uspchrp -a
Use this service aid to save or restore the Service Processor Configuration to or from a
file. The Service Processor Configuration includes the Ring Indicator Power-On
Configuration. The following options are available:
v Save Service Processor Configuration
This selection writes all of the settings for the Ring Indicate Power On and the
Service Processor to the following file: /etc/lpp/diagnostics/data/spconfig
v Restore Service Processor Configuration
This selection restores all of the settings for the Ring Indicate Power On and the
Service Processor from the following file: /etc/lpp/diagnostics/data/spconfig
To use this service aid, the user should understand how a SCSI bus works. Use this
service aid when the diagnostics cannot communicate with anything on the SCSI bus
and cannot isolate the problem. Normally the procedure for finding a problem on the
SCSI bus with this service aid is to start with a single device attached, ensure that it is
working, then start adding additional devices and cables to the bus, ensuring that each
one works. This service aid works with any valid SCSI bus configuration.
The SCSI Bus Service Aid transmits a SCSI Inquiry command to a selectable SCSI
Address. The service aid then waits for a response. If no response is received within a
defined amount of time, the service aid displays a timeout message. If an error occurs
or a response is received, the service aid then displays one of the following messages:
v The service aid transmitted a SCSI Inquiry Command and received a valid
response back without any errors being detected.
v The service aid transmitted a SCSI Inquiry Command and did not receive any
response or error status back.
v The service aid transmitted a SCSI Inquiry Command and the adapter indicated
a SCSI bus error.
v The service aid transmitted a SCSI Inquiry Command and an adapter error
occurred.
v The service aid transmitted a SCSI Inquiry Command and a check condition
occur.
When the SCSI Bus Service Aid is started a description of the service aid displays.
Pressing the Enter key displays the Adapter Selection menu. Use this menu to enter the
address to transmit the SCSI Inquiry Command.
When the adapter is selected, the SCSI Bus Address Selection menu displays. Use this
menu to enter the address to transmit the SCSI Inquiry Command.
After the address is selected, the SCSI Bus Test Run menu displays. Use this menu to
transmit the SCSI Inquiry Command by pressing Enter. The service aid then indicates
the status of the transmission. When the transmission is completed, the results of the
transmission displays.
Notes:
1. A Check Condition can be returned when the bus or device is working correctly.
2. AIX does not allow the command to be sent if the device is in use by another
process.
This service aid can be run directly from the AIX command line. See the following
command syntax (path is /usr/lpp/diagnostics/bin/utape):
utape [-h | -?] [-d device] [-n | -l | -t]
OR
utape -c -d device [-v] {-n | {-l | -t} { -D | -f [ filename]}}
Flag Description
-c Run the service aid without displaying menus. The return code indicates
success or failure. The output is suppressed except for the usage statement
and the numeric value for hours since cleaning (if -n and -D flags are used).
-D Copy data to diskette.
-f Copy data to the file name given after this flag or to a default file name if no
name is specified.
-h, -? Display a usage statement and/or return code. If the -c flag is present, only the
return code displays to indicate the service aid did not run. If the -c is not
used, a usage statement displays and the service aid exits.
-l Display or copy log sense information.
-n Display time since drive was last cleaned.
-t Copy trace table.
-v Verbose mode. If the -c flag is present, the information displays on the screen.
If the -n flag is present, the information about tape-head cleaning is printed.
You can run this task directly from the AIX command line. See the following command
syntax: diag -c -d deviceName -T chkspares
This task invokes the SMIT Update Software by Fix (APAR) task. The task allows the
input device and APARs to be selected. You can install any APAR using this task.
Attention: If the system is running on a logically partitioned system, ask the customer
or system administrator if a service partition has been designated.
v If it has, ask the customer or system administrator to shut down all of the partitions
except the one with service authority. The firmware update can then be done using
the service aid or the AIX command line in that partition.
v If a service partition has not been designated, the system must be shut down. If the
firmware update image is available on backup diskettes, the firmware update can
then be done from the service processor menus as a privileged user. If the firmware
update image is in a file on the system, reboot the system in a full machine partition
and use the following normal firmware update procedures.
If the system is already in a full machine partition, use the following normal firmware
update procedures:
This selection updates the system or service processor flash for CHRP system units.
Some systems may have separate images for system and service processor firmware;
newer systems have a combined image that contains both in one image.
Look for additional update and recovery instructions with the update kit. You need to
know the fully qualified path and file name of the flash update image file provided in the
kit. If the update image file is on a diskette, the service aid can list the files on the
diskette for selection. The diskette must be a valid backup format diskette.
Refer to the update instructions with the kit, or the service guide for the system unit to
determine the current level of the system unit or service processor flash memory.
When this service aid is run from online diagnostics, the flash update image file is
copied to the /var file system. It is recommended that the source of the microcode that
you want to download be put into the /etc/microcode directory on the system. If there
is not enough space in the /var file system for the new flash update image file, an error
is reported. If this error occurs, exit the service aid, increase the size of the /var file
system, and retry the service aid. After the file is copied, a screen requests confirmation
before continuing with the flash update. When you continue the update flash, the
system reboots using the shutdown -u command. The system does not return to the
diagnostics, and the current flash image is not saved. After the reboot, you can remove
the /var/update_flash_image file.
When this service aid is run from standalone diagnostics, the flash update image file is
copied to the file system from diskette or from the NIM server. Using a diskette, the
user must provide the image on backup format diskette because the user does not
have access to remote file systems or any other files that are on the system. If using
the NIM server, the microcode image must first be copied onto the NIM server in the
/usr/lib/microcode directory pointed to the NIM SPOT (from which you plan to have
the NIM client boot standalone diagnostics) prior to performing the NIM boot of
You can use the update_flash command in place of this service aid. The command is
located in the /usr/lpp/diagnostics/bin directory. The command syntax is as follows:
update_flash [-q] -f file_name
update_flash [-q] -D device_name -f file_name
update_flash [-q] -D update_flash [-q] -D device_name -l
Flag Description
-D Specifies that the flash update image file is on diskette. The device_name
variable specifies the diskette drive. The default device_name is /dev/fd0.
-f Flash update image file source. The file_name variable specifies the fully
qualified path of the flash update image file.
-l Lists the files on a diskette, from which the user can choose a flash update
image file.
-q Forces the update_flash command to update the flash EPROM and reboot
the system without asking for confirmation.
Attention: The update_flash command reboots the entire system. Do not use this
command if more than one user is logged on to the system.
Note: This service aid runs on RSPC system units only and is not supported on AIX
5.2 or later.
This selection updates the system flash for RSPC systems. The user provides a valid
binary image either on a diskette or with a qualified path name. The diskettes can be in
DOS or backup format.
The flash update image is copied to the /var file system. If not enough space is
available in the file system for the flash update image file, an error is reported. If this
error occurs, increase the file size of the /var file system. The current flash image is not
saved. The command automatically removes the /var/update_flash_image file.
After user confirmation, the command reboots the system twice to complete the flash
update.
Note: Some systems may produce 4-digit codes. If the leftmost digit of a 4-digit code is
0, use the three rightmost digits.
2E6 The PCI Differential Ultra SCSI adapter or the Universal PCI Differential Ultra
SCSI adapter being configured.
2E7 Configuration method unable to determine if the SCSI adapter type is SE or
DE type.
440 9.1GB Ultra SCSI Disk Drive being identified or configured.
441 18.2GB Ultra SCSI Disk Drive being identified or configured.
444 2-Port Multiprotocol PCI Adapter (ASIC) being identified or configured.
447 PCI 64-bit Fibre Channel Arbitrated Loop Adapter being configured.
500 Querying Standard I/O slot.
501 Querying card in Slot 1.
502 Querying card in Slot 2.
503 Querying card in Slot 3.
504 Querying card in Slot 4.
505 Querying card in Slot 5.
506 Querying card in Slot 6.
507 Querying card in Slot 7.
508 Querying card in Slot 8.
510 Starting device configuration.
511 Device configuration completed.
225
512 Restoring device configuration files from media.
513 Restoring basic operating system installation files from media.
516 Contacting server during network boot.
517 Mounting client remote file system during network IPL.
518 Remote mount of the root (/) and /usr file systems failed during network boot.
520 Bus configuration running.
521 /etc/init invoked cfgmgr with invalid options; /etc/inithas been corrupted or
incorrectly modified (irrecoverable error).
522 The configuration manager has been invoked with conflicting options
(irrecoverable error).
523 The configuration manager is unable to access the ODM database
(irrecoverable error).
524 The configuration manager is unable to access the config.rules object in the
ODM database (irrecoverable error).
525 The configuration manager is unable to get data from a customized device
object in the ODM database (irrecoverable error).
526 The configuration manager is unable to get data from a customized device
driver object in the ODM database ( irrecoverable error).
527 The configuration manager was invoked with the phase 1 flag; running phase 1
at this point is not permitted (irrecoverable error).
528 The configuration manager cannot find sequence rule, or no program name
was specified in the ODM database (irrecoverable error).
529 The configuration manager is unable to update ODM data (irrecoverable error).
530 The program savebase returned an error.
531 The configuration manager is unable to access the PdAt object class
(irrecoverable error).
532 There is not enough memory to continue (malloc failure); irrecoverable error.
533 The configuration manager could not find a configuration method for a device.
534 The configuration manager is unable to acquire database lock (irrecoverable
error).
535 HIPPI diagnostics interface driver being configured.
536 The configuration manager encountered more than one sequence rule
specified in the same phase (irrecoverable error).
537 The configuration manager encountered an error when invoking the program in
the sequence rule.
538 The configuration manager is going to invoke a configuration method.
Note: Some systems may produce 4-digit codes. If the leftmost digit of a 4-digit code is
0, use the three rightmost digits.
c00 AIX Install/Maintenance loaded successfully.
c01 Insert the first diagnostic diskette.
c02 Diskettes inserted out of sequence.
c03 The wrong diskette is in diskette drive.
c04 The loading stopped with an irrecoverable error.
c05 A diskette error occurred.
c06 The rc.boot configuration shell script is unable to determine type of boot.
c07 Insert the next diagnostic diskette.
c08 RAM file system started incorrectly.
c09 The diskette drive is reading or writing a diskette.
c20 An unexpected halt occurred, and the system is configured to enter the kernel
debug program instead of entering a system dump.
c21 The ifconfig command was unable to configure the network for the client
network host.
c22 The tftp command was unable to read client’s ClientHostName info file during
a client network boot.
c24 Unable to read client’s ClientHostName.info file during a client network boot.
c25 Client did not mount remote miniroot during network install.
c26 Client did not mount the /usr file system during the network boot.
c29 The system was unable to configure the network device.
c31 Select the console display for the diagnostics. To select No console display,
set the key mode switch to Normal then to Service. The diagnostic programs
then load and run the diagnostics automatically. If you continue to get the
message, check the cables and make sure you are using the serial port.
c32 A directly attached display (HFT) was selected.
c33 A TTY terminal attached to serial ports S1 or S2 was selected.
c34 A file was selected. The console messages store in a file.
c35 No console found.
c40 Configuration files are being restored.
c41 Could not determine the boot type or device.
c42 Extracting data files from diskette.
Note: When a lowercase c is listed, it displays in the lower half of the character
position. Some systems produce 4-digit codes, the two leftmost positions can
have a blanks or zeros. Use the two rightmost digits.
0c0 The dump completed successfully.
0c1 The dump failed due to an I/O error.
0c2 A dump, requested by the user, is started.
Crash Codes
Note: Some systems may produce 4-digit codes. If the leftmost digit of a 4-digit code is
0, use the three rightmost digits.
The crash codes that follow are part of a Type 102 message. These crash codes are
grouped into three categories:
Category 1 Dump analysis is the appropriate first action in Problem
Determination, begin the Problem Determination process with
software support.
Category 2 Dump analysis most likely will not aid in Problem Determination, begin
the Problem Determination process with hardware support.
Category 3 Both software and hardware support may be needed in Problem
Determination, go to Chapter 9, “MAP 0070: 888 Sequence in
Operator Panel Display” on page 59 to assist in problem isolation.
Category 1
300 Data storage interrupt from the processor.
32x Data storage interrupt because of an I/O exception from IOCC.
38x Data storage interrupt because of an I/O exception from SLA.
400 Instruction storage interrupt.
700 Program interrupt.
Category 2
200 Machine check because of a memory bus error.
201 Machine check because of a memory timeout.
202 Machine check because of a memory card failure.
203 Machine check because of a out of range address.
204 Machine check because of an attempt to write to ROS.
Category 3
000 Unexpected system interrupt.
558 There is not enough memory to continue the IPL.
600 AIX 4.3.3.3 and above: Alignment Interrupt. If pre-AIX 4.3.3.3: AIX has crashed
because the Portability Assist Layer (PAL) for this machine type has detected a
problem.
605 AIX has crashed because the Portability Assist Layer (PAL) for this machine
type has detected a problem (AIX 4.3.3.3 and above).
Because the same diagnostic programs are used on all system units, a location code is
used to physically locate a failing device or unit. The location code is displayed along
with the service request number (SRN) when the diagnostic programs isolate a failure.
If the location code is not known, you can run the Display Previous Diagnostic Results
service aid to display the results of the last time the diagnostic programs were run.
For planars, cards, and non-SCSI devices, the location code is defined as follows:
AB-CD-EF-GH
| | | |
| | | Device/FRU/Port ID
| | Connector ID
| Slot or Adapter Number
Bus Type
v AB identifies a bus type.
v CD identifies a slot or adapter number.
v EF is the connector identifier, used to identify the adapter connector to which a
resource is attached.
v GH identifies a port, address, memory module, device, or FRU. GH has several
meanings depending upon the resource type, they are as follows:
– For memory cards, GH defines a memory module. Values for GH are 1 through
16.
For systems that have memory modules that plug directly into the system planar,
the location code is 00-00-00-GH where GH is the memory module slot. For
systems that have memory cards with memory modules, the location code is
00-CD-EF-GH, where CD is the card slot and GH is the memory module slot.
– For L2 caches, GH defines the cache. Values for GH are 1 through 16.
– For PCMCIA devices, GH defines the PCMCIA. Values for GH are 1 through 16.
– For async devices, GH defines the port on the fanout box. Values are 00 to 15.
– For a diskette drive, H defines which diskette drive 1 or 2. G is always 0.
– For all other devices, GH is equal to 00.
Where:
v AB-CD-EF are the same as non-SCSI devices.
v G defines the control unit address of the device. Values of 0 to 15 are valid.
v H defines the logical unit address of the device. Values of 0 to 255 are valid.
Adapters and cards are identified with only AB-CD. The possible values for AB are as
follows:
00 for processor bus
01 for ISA buses
04 for PCI buses
05 for PCMCIA buses (not supported on 7024)
For pluggable adapters or cards, this is a two-digit slot number in the range from 01 to
99. However, in the case of ISA cards these numbers do not actually correspond to the
physical slot numbers. They simply are based on the order in which the ISA cards are
defined or configured, either by SMIT or the ISA Adapter Configuration Service Aid.
For integrated adapters, the first character (C) is a letter in the range from A to Z. This
letter is based on the order in which the integrated adapters are defined in residual
data. This ensures unique location codes for the integrated adapters. The second
character (D) is set to 0.
Note: You need to know which system architecture the system unit on which you are
working uses. If you are working with a RSPC model use the “Location Codes
for RSPC Model Architecture System Units” on page 242. If you do not know
which model you have, refer to “Determining System Architecture” on page 165
before proceeding.
The (CHRP) system unit uses Physical Location Codes in conjunction with AIX Location
Codes to provide mapping of the failing field replaceable units. The location codes are
produced by the system unit’s firmware and the AIX operating system.
Note: Diagnostic Versions 5.2.0 and later display physical location codes for all
resources. Diagnostic versions earlier than 5.2.0 show a mixture of physical
location codes and AIX location codes.
The P2/E1 is the physical location code indicating an Ethernet port built into the
P2 planar.
The 10-60 is an AIX location code indicating a PCI parent bus of 10, and a
devfunc number of 60 (for more information , see “AIX Location Codes” on
page 248).
The period (.) is used to identify sub-locations such as memory DIMMs on a base
memory card or a specific SCSI address. The following are examples:
v P1-M1.4 identifies DIMM 4 on memory card 1 on planar 1.
v U1-P1-M2.12 identifies DIMM 12 on memory card in slot 2 on the system planar.
v P1-C1.1 identifies CPU 1 on CPU card 1 on planar 1.
v P2/Z1-A3.1 identifies a SCSI device with a SCSI address of LUN 1 at SCSI ID 3
attached to SCSI bus 1 from planar 2.
v P1-I2#E3.2 identifies the second cable in a series of cables attached to Ethernet
controller 3 in slot 2 (I2) on planar 1.
Depending on the AIX and firmware levels, AIX Diagnostics may include extended
location information when identifying a planar or card. The extended location
information or cable information is always included when identifying a cable or
connector. Location codes with extended location information that display without a
description identifying the devices, always identify the cable attached to the port.
Note: AIX logical location codes may still seen and supported under various AIX
commands and functions. However, the Diagnostic screens and menus display
physical location codes for resources when running versions 5.2.0 and later. For
systems with 5.2.0 or later, refer to “Physical Location Codes” on page 244.
For planars, cards, and non-SCSI devices, the location code is defined as follows:
AB-CD-EF-GH
| | | |
| | | Device/FRU/Port ID
| | Connector ID
| devfunc Number, Adapter Number or Physical Location
Bus Type or PCI Parent Bus
v The AB value identifies a bus type or PCI parent bus as assigned by the firmware.
v The CD value identifies adapter number, adapter’s devfunc number, or physical
location. The devfunc number is defined as the PCI device number times 8, plus the
function number.
v The EF value identifies a connector.
v The GH value identifies a port, address, device, or FRU.
Adapters and cards are identified only with AB-CD. The possible values for AB are:
00 Processor bus
01 ISA bus
02 EISA bus
03 MCA bus
04 PCI bus used in the case where the PCI bus cannot be identified
05 PCMCIA buses
xy For PCI adapters where x is equal to or greater than 1. The x and y are characters in
the range of 0-9, A-H, J-N, P-Z (O, I, and lower case are omitted) and are equal to the
parent bus’s ibm, aix-loc Open Firmware Property.
For the integrated adapters, EF-GH is the same as the definition for the pluggable
adapters. For example, the location code for a diskette drive is 01-D1-00-00. A second
diskette drive is 01-D1-00-01.
Where:
v AB-CD-EF are the same as non-SCSI devices.
v G defines the control unit address of the device. Values of 0 to 15 are valid.
v H defines the logical unit address of the device. Values of 0 to 255 are valid.
There is also a bus location code that is generated as ’00-xxxxxxxx’ where xxxxxxxx is
equivalent to the node’s unit address. Refer to the system unit service guide for
additional information.
The SRNs listed in the following chapters are for all systems and devices for which this
version of the diagnostic programs can produce an SRN. Use the table “Service
Request Number Lists” on page 252 to select the correct chapter for your SRN.
Note: If you are servicing an SP system, do not do the Action listed. Instead, always
return to the SP System Service Guide.
If you are servicing a clustered Eserver, do not do the Action listed. Instead,
always return to the Clustered Eserver Installation and Service Guide.
251
Using the Service Request Number List
The service request number list is in numerical sequence by the SRN. Use the list as
follows:
1. Find your SRN in the table.
2. Record the code letter for the Source of SRN (the SRN Src. column).
3. Record the failing function codes in the order listed.
4. Perform the action shown in the Description and Action column.
Notes:
a. If you cannot find SRN information in the ″Service Request Number List,″ check
for the existence of supplemental material supporting the device for which the
SRN was generated.
b. x in an SRN represents any digit or character.
c. If your SRN is not listed, check to see if xxx or xxxx has been used. The -xxx or
-xxxx should always be the last SRN identified within a specific prefix. An
example would be 950-xxxx. The xxxx is the last digit within the 950 prefix.
d. If you are servicing a SP system, do not do the Action listed. Instead, always
return to the SP Hardware Manual.
e. If you are servicing a clustered Eserver, do not do the Action listed. Instead,
always return to the Clustered Eserver Installation and Service Guide.
5. When replacing a FRU, use Chapter 23, “MAP 0410: Repair Checkout” on page 125
to verify the fix.
Note: There may be 7 or 8 digit SRNs within each range. If the SRN contains 3 digits
before the dash, then select the chapter that contains the first 3 digits, followed
by the dash, of the SRN you wish to locate. For example, if the SRN is
802-xxxx, you would go to Chapter 33, “SRNs 700-102 through 89c-302”,
because this SRN falls between the number range of 700-102 and 89c-302.
However, if your SRN contains 4 digits before the dash, then select the chapter
that contains the first 4 digits, followed by the dash, of the SRN you wish to
locate. For example, if the SRN is 2570–603, you would go to Chapter 36,
“SRNs 2520-101 through 25C1-(x)xxx”, because this SRN falls between the
number range of 2520-(x)xxx and 2590-(x)xxx.
The following table identifies each SRN chapter range and the page number on which it
begins:
AA SSA Address
255
Service SRN Failing Description and Action (Unless otherwise indicated,
Request Src. Function use Chapter 11, “MAP 0210: General Problem
Number Codes Resolution” on page 67.)
2A005 Description: Async code 05 was received. This code
indicates that a disk drive module detected the loss of
redundant power or cooling. Action: Refer to the SRN
table in the Service Guide for the unit containing the disk
drive.
2A006 Description: Async code 06 was received. This code
indicates that a disk drive module detected the loss of
redundant power or cooling. Action: Refer to the SRN
table in the Service Guide for the unit containing the disk
drive.
2A106 Description: Async code 06 was received. This code
indicates that multiple disk drive modules detected loss
of redundant power or cooling. Action: Refer to the SRN
table in the Service Guide for the unit containing the disk
drive.
2A206 Description: A disk drive module detected that one of its
SSA links failed the POST. Action: Refer to the SRN
table in the Service Guide for the unit containing the disk
drive.
2FFFF Description: An async code that is not valid was
received. Action: Refer to the SRN table in the Service
Guide for the unit containing the disk drive and the
User’s Guide and Maintenance Information for the SSA
adapter.
300C0 Description: A disk drive module detected the loss of
redundant power or cooling. Action: Refer to the SRN
table in the Service Guide for the unit containing the disk
drive.
301C0 Description: Multiple disk drive modules detected the
loss of redundant power or cooling. Action: Refer to the
SRN table in the Service Guide for the unit containing
the disk drive.
303FE Description: A disk drive Microcode Error was detected.
Action: Refer to the SRN table in the Service Guide for
the unit containing the disk drive and the User’s Guide
and Maintenance Information for the SSA adapter.
303FF Description: An SCSI status that is not valid was
received. Action: Refer to the SRN table in the Service
Guide for the unit containing the disk drive and the
User’s Guide and Maintenance Information for the SSA
adapter.
31000 Description: The disk drive was reset by the adapter.
The disk drive might be going to fail. Action: Refer to the
SRN table in the Service Guide for the unit containing
the disk drive.
Note: Some SRNs in this chapter may have 4 rather than 3 digits after the dash (-).
267
Service SRN Failing Description and Action (Unless otherwise indicated,
Request Src. Function use Chapter 11, “MAP 0210: General Problem
Number Codes Resolution” on page 67.)
101-551 A Description: The system hung while loading the
to software. This can be caused by a hardware or software
101-557 problem. Action: Run the standalone diagnostics. If the
standalone diagnostics run correctly, the problem may
be a damaged disk data. Consider having the customer
contact Software Support before reinstalling the
operating system on the disk. Otherwise, go to
Chapter 4, “MAP 0020: Problem Determination
Procedure” on page 29. If a different problem occurs
when you run the standalone diagnostics, correct that
problem. If the standalone diagnostics still produces the
same SRN, go to MAP 1540 in either the service guide
or the installation and service guide for this system unit.
101-558 A Description: There is not enough memory to execute
diagnostics. Action: There must be a minimum of 16 MB
of installed memory. If the system has 16 MB or more of
memory installed, suspect a problem with a memory
card.
101-559 A Description: The system halted while software was
to loading. This problem may be attributed to either
101-599 hardware or software. Action: Use the standalone
diagnostics if not yet used. If the same SRN is
generated from standalone diagnostics, go to MAP 1540
in the system unit service guide for problem isolation.If a
different problem occurs when you run the standalone
diagnostics, correct that problem.
If the fan still is not turning replace the fan. If the fan is
turning, replace the fan sensor FRU.
651-802 G Description: Fan failure. Action: Use MAP 210. Instead
of failing function codes, use the physical location
code(s) from the diagnostic problem report screen to
determine the FRUs.
651-809 G Description: Power fault warning due to unspecified
cause. Action: Use MAP 210. Instead of failing function
codes, use the physical location code(s) from the
diagnostic problem report screen to determine the FRUs.
651-810 G 152 Description: Over voltage condition was detected Action:
2E2 Shut the system down and do the following before
replacing any FRUs.
1. Visually inspect the power cables and reseat the
connectors.
2. Run the following command diag -Avd sysplanar0.
When the Resource Repair Action menu displays,
select sysplanar0.
377
Service SRN Failing Description and Action (Unless otherwise indicated,
Request Src. Function use Chapter 11, “MAP 0210: General Problem
Number Codes Resolution” on page 67.)
700-137 D 700 Description: Unit attention condition has occurred on the
B88 Send Diagnostic command.
190
700-138 D 190 Description: Error log indicates that the disk is operating
at higher than recommended temperatures.
700-139 D 700 Description: Error configuring or opening device. Action:
Use Chapter 11, “MAP 0210: General Problem
Resolution” on page 67.
700-640 G 640 Description: Error Log Analysis indicates a path error.
701-102 D 701 Description: An unrecoverable media error.
B88
701-104 D 701 Description: The motor failed to restart.
701-105 D 701 Description: The drive did not become ready.
701-106 D 701 Description: The electronics card test failed.
701-108 D 701 Description: The bus test failed.
701-110 D 701 Description: The media format is corrupted.
701-112 D 701 Description: The diagnostic test failed.
701-114 D 701 Description: An unrecoverable hardware error.
701-116 D Description: A protocol error. Action: Check the levels of
the device, adapter, diagnostic software, and application
software to ensure they are all compatible. If you do not
find a problem, call your support person.
701-117 D 701 Description: A write-protect error occurred.
701-118 D 701 Description: A SCSI command timeout.
B88
701-120 D 701 Description: A SCSI busy or command error.
701-122 D 701 Description: A SCSI reservation conflict error.
701-124 D 701 Description: A SCSI check condition error.
701-126 D 701 Description: A software error was caused by a hardware
B88 failure.
701-128 G 701 Description: The error log analysis indicates a hardware
failure.
701-129 G 701 Description: Error log analysis indicates a SCSI bus
problem.
701-130 G 701 Description: Error log analysis indicates a problem
B88 reported by the disk drive’s self-monitoring function.
701-132 D 701 Description: A disk drive hardware error occurred.
701-133 D 701 Description: Use the subsystem guide to complete
diagnostics.
701-134 D B88 Description: The adapter failed to configure.
software
461
Service SRN Failing Description and Action (Unless otherwise indicated,
Request Src. Function use Chapter 11, “MAP 0210: General Problem
Number Codes Resolution” on page 67.)
900-234 D software Description: IOCTL_GSC_HANDLE_FAILED.
900-235 D software Description: AIXGSC_MAKE_GP_FAILED.
900-236 D software Description: AIXGSC_UNMAKE_GP_FAILED.
900-237 D software Description: DEVICE_BUSY_ERROR.
900-241 D Info code Description: The SCROLLING_17_H_TEST failed.
900-263 D Info code Description: The EMC_SCROLLING_21_H_TEST failed.
900-2FF D 900 Description: Image transfer across Plane Test failed.
227
900-301 D 900 Description: A write of ″0x00″ to the palette register
227 failed.
900-302 D 900 Description: A write of ″0x15″ to the palette register
227 failed.
900-303 D 900 Description: A write of ″0x2A″ to the palette register
227 failed.
900-304 D 900 Description: A write of ″0x3F″ to the palette register
227 failed.
900-305 D 900 Description: The test of the palette registers failed.
227
900-3FF D 900 Description: The test of the palette registers failed.
227
900-401 D 900 Description: Frame buffer base address inconsistent.
227
900-402 D 900 Description: VRAM inaccessible.
227
900-403 D 900 Description: Miscompare found in VRAM.
227
900-404 D 900 Description: The test of the VRAM failed.
227
900-447 D 900 Description: Green screen error.
227
900-449 D 900 Description: Blue Screen error.
227
900-471 D 900 Description: 9 x 7 Cross hatch grid failed.
227
900-473 D 900 Description: 11 x 9 Cross hatch grid failed.
227
900-495 D Info code Description: The SCROLLING_17_H_TEST failed.
900-497 D Info code Description: The SCROLLING_21_H_TEST failed.
900-4FF D 900 Description: The test of the VRAM failed.
227
Note: Some SRNs in this chapter may have 4 rather than 3 digits after the dash (-).
Use the physical location codes and FRU numbers listed on the diagnostics Problem
Report Screen.Failing Function Codes (FFCs) are not used in this group of SRNs. For
more detailed information refer to Chapter 13, “MAP 0230: Platform Error Problem
Resolution” on page 83.
535
Service SRN Failing Description and Action
Request Src. Function
Number Code
A01-07x G Description: System bus parity error. Action: Use
Chapter 13, “MAP 0230: Platform Error Problem
Resolution” on page 83.
A01-08x G Description: System bus protocol/transfer error. Action:
Use Chapter 13, “MAP 0230: Platform Error Problem
Resolution” on page 83.
A02-00x G Description: Error log analysis indicates an error
detected by the memory controller, but the failure could
not be isolated. Action: Use Chapter 13, “MAP 0230:
Platform Error Problem Resolution” on page 83.
A02-01x G Description: Uncorrectable Memory Error. Action: Use
Chapter 13, “MAP 0230: Platform Error Problem
Resolution” on page 83.
A02-02x G Description: ECC correctable error. Action: Use
Chapter 13, “MAP 0230: Platform Error Problem
Resolution” on page 83.
A02-03x G Description: Correctable error threshold exceeded.
Action: Use Chapter 13, “MAP 0230: Platform Error
Problem Resolution” on page 83.
A02-04x G Description: Memory Control subsystem internal error.
Action: Use Chapter 13, “MAP 0230: Platform Error
Problem Resolution” on page 83.
A02-05x G Description: Memory Address Error (invalid address or
access attempt). Action: Use Chapter 13, “MAP 0230:
Platform Error Problem Resolution” on page 83.
A02-06x G Description: Memory Data error (Bad data going to
memory). Action: Use Chapter 13, “MAP 0230: Platform
Error Problem Resolution” on page 83.
A02-07x G Description: Memory bus/switch internal error. Action:
Use Chapter 13, “MAP 0230: Platform Error Problem
Resolution” on page 83.
A02-08x G Description: Memory time-out error. Action: Use
Chapter 13, “MAP 0230: Platform Error Problem
Resolution” on page 83.
A02-09x G Description: System bus parity error. Action: Use
Chapter 13, “MAP 0230: Platform Error Problem
Resolution” on page 83.
A02-10x G Description: System bus time-out error. Action: Use
Chapter 13, “MAP 0230: Platform Error Problem
Resolution” on page 83.
A02-11x G Description: System bus protocol/transfer error. Action:
Use Chapter 13, “MAP 0230: Platform Error Problem
Resolution” on page 83.
555
Service SRN Failing Description and Action (Unless otherwise indicated,
Request Src. Function use Chapter 11, “MAP 0210: General Problem
Number Codes Resolution” on page 67.)
2520-301 D 2520 Description: Configuration open failed for parent bus.
221
2520-801 G Description: Error Log Analysis indicates a terminator
problem. Action: Run diagnostic on the adapter in
System Verification Mode. If an SRN is reported, use the
SRN to resolve the problem. If a SRN is not reported,
use the SCSI service hints to resolve the problem.
2520-802 G Description: Error Log Analysis indicates that multiple
attempts to reset the SCSI bus have timed out. Action:
Run diagnostic on the adapter in System Verification
Mode. If an SRN is reported, use the SRN to resolve the
problem. If a SRN is not reported, use the SCSI service
hints to resolve the problem.
2520-803 G Description: Error Log Analysis indicates that multiple
SCSI bus errors have occurred. Action: Run diagnostic
on the adapter and the devices in System Verification
Mode. If an SRN is reported, use the SRN to resolve the
problem. If a SRN is not reported, use the SCSI service
hints to resolve the problem.
2521-101 D 221 Description: Enhanced error handling failure on the bus.
2521-102 D 293 Description: Enhanced error handling failure on a
PCI-PCI Bridge.
2521-103 D 2521 Description: Enhanced error handling failure on the
221 adapter.
2521-227 D 2521 Description: SCSI adapter test failure.
221
2521-701 G 2521 Description: Error log analysis indicates a PCI SCSI
adapter failure.
2521-801 G Description: Error Log Analysis indicates a terminator
problem. Action: Run diagnostic on the adapter in
System Verification Mode. If an SRN is reported, use the
SRN to resolve the problem. If a SRN is not reported,
use the SCSI service hints to resolve the problem.
2521-802 G Description: Error Log Analysis indicates that multiple
attempts to reset the SCSI bus have timed out. Action:
Run diagnostic on the adapter in System Verification
Mode. If an SRN is reported, use the SRN to resolve the
problem. If a SRN is not reported, use the SCSI service
hints to resolve the problem.
2521-803 G Description: Error Log Analysis indicates that multiple
SCSI bus errors have occurred. Action: Run diagnostic
on the adapter and the devices in System Verification
Mode. If an SRN is reported, use the SRN to resolve the
problem. If a SRN is not reported, use the SCSI service
hints to resolve the problem.
2522-101 D 2522 Description: Adapter configuration error.
Note: When replacing a FRU, use Chapter 23, “MAP 0410: Repair Checkout” on
page 125 To verify the fix.
A function may not be physically packaged on the same FRU in different system units.
When this condition exists, the FRU part number for each type of system unit is listed.
Machine Type/Model
This column is used when the failing function is on a FRU which differs by machine
type and model. Use the part number for the type system unit you are servicing.
See “Machine Types” on page 564 for the names of the machine types.
Note: Although the machine cover logo may depict the model number as four digits,
the service and parts ordering system requires three-digit numbers. For example,
if the cover logo depicts model number xxxx, service and parts documentation
may refer to that model as xxx.
Part Number
This column contains the part number of the FRU that contains the failing function. Use
the part number for the type of system unit you are servicing.
563
Machine Types
Machine Description
Type
3151 Display Terminal
3161 Display Terminal
3163 Display Terminal
3514 External Disk Array, Models 212 and 213
3812 Pageprinter
3852 Printer
4201 Proprinter II
4202 Proprinter XL
4207 Proprinter X24
4208 Proprinter XL24
4216 Personal Pageprinter
4224 Printer
4234 Printer
4869 5.25-Inch External Diskette Drive
5081 Color Display
5083 Tablet
5085 Graphics Processor
5086 Graphics Processor
5088 Communications Controller
5202 Quietwriter Printer
5204 Quickwriter Printer
6094 Model 10, Dials
6094 Model 20, Lighted Program Function Keyboard (LPFK)
6094 Model 30, Spaceball™
6180 M1 color plotter
6182 Color Plotter
6184 Color Plotter
6185 Model 1 Color Plotter
6186 Color Plotter
6187 Plotter
7372 Plotter
7017 System Unit (Rack Mount), I/O Rack (with up to 4 I/O drawers)
7024 System Unit (Floor Standing)
7025 System Unit (Floor Standing)
7026 System Unit (Rack Mount)
7027 Disk Drive Drawer
7028 System Unit (Model 6C1, 6C4, 6E1, 6E4)
7038 System Unit (Model 6M2)
7039 System Unit (Model 651)
7040 System Unit (Model 670, 690)
7043 System Unit (Models 140, 150, 240, 260)
7044 System Unit (Models 170, 270) Floor Standing
7046 System Unit (Model B50) Rack Mount
7131 Model 105 SCSI Multi-Storage Tower
651
Failing
Description and Notes Function
Code
Adapter, FDDI (vendor logoed) 795
Adapter, 10/100/1000 Base-TX PCI-X Adapter 25C1
Adapter, GXT110P Graphics Adapter 900
Adapter, GXT120P Graphics Adapter 685
Adapter, GXT130P Graphics Adapter 655
Adapter, GXT135P Graphics Adapter 67E
Adapter, GXT250P PCI Graphics Adapter 786
Adapter, GXT255P High Performance PCI Graphics Adapter D96
Adapter, GXT300P Graphics Adapter 78D
Adapter, GXT500P Graphics 787
Adapter, GXT550P Graphics D95
Adapter, GXT800P Graphics (Base Card) 806
Adapter, GXT800P Graphics (Base DIMM) 298
Adapter, GXT800P Graphics (Texture DIMM) 297
Adapter, GXT1000 Graphics Attachment (SPAN) 908
Adapter, GXT2000P Graphics 657
Adapter, GXT3000P 3D PCI Graphics Adapter 778
Adapter, GXT4000P 3D PCI Graphics Adapter 78b
Adapter, GXT4500P 3D PCI Graphics Adapter 2550
Adapter, GXT6000P 3D PCI Graphics Adapter 68E
Adapter, GXT6500P 3D PCI Graphics Adapter 2551
Adapter, Serial HIPPI PCI 938
Adapter, Integrated Ethernet System Planar 887
Adapter, ISDN Basic Rate Interface 693
Adapter, Media Streamer Audio/Video Decoder 966
Adapter, Multibus Integrated Ethernet 790
Adapter, MVP Power Graphics 775
Adapter, PCI Gigabit Fibre Channel 677
Adapter, PCI 2-Gigabit Fibre Channel 447
Adapter, PCI Differential Ultra SCSI (wide/fast-20) (4-L) 2E6
Adapter, PCI Dual-Channel Ultra3 SCSI (4-Y) 2520
Adapter, PCI Gigabit Ethernet 669
Adapter, PCI IBM Cryptographic Accelerator 2570
Adapter, Universal PCI Differential Ultra SCSI (4-U) 2E6
Adapter, Dual-Channel PCI-2 to Ultra2 SCSI (4-R) 637
Adapter, Integrated Ultra2 SCSI 84A
659
Adapter Name Connector Type, Port Name, Part
or Cable Number
2-Gigabit Fibre Channel PCI Adapter Fiber Connector Wrap Plug 05N6768
64-bit/66 MHz PCI ATM MMF Adapter Fiber Connector Wrap Plug 21H3547
Gigabit Ethernet-SX PCI Adapter
Serial HIPPI PCI Adapter
TURBOWAYS 155 PCI MMF ATM PCI
Adapter
Gigabit Fibre Channel PCI Adapter Fiber Connector Wrap Plug 16G5609
64-bit/66 MHz PCI ATM 155 UTP Wrap Plug 21P8009
Adapter 42H0540
TURBOWAYS 25 ATM PCI Adapter
TURBOWAYS 155 PCI UTP ATM PCI
Adapter
X.25 Interface Co-Processor Adapter X.25 Adapter Wrap Plug 07F3132
X.21 Cable Wrap Plug 07F3153
V.24 Cable Wrap Plug 07F3163
V.35 Cable Wrap Plug 07F3173
661
662 Diagnostic Information for Multiple Bus Systems
Appendix C. System Memory Reference
663
Type and Model System Memory Base Memory Memory Memory Card Pairs
Configuration Card FRU Module Size Module FRU
Type/Size (Min./Max.) Number (Bytes) Number
7026/B80 System board 07L7065 128 MB 93H470 Yes*
(256 MB/16 GB) 256 MB 09P0550
512 MB 09P0491
7026/H10 (16 MB/1 GB) N/A 16 MB 42H4772 No
32 MB 42H2773
64M 42H2774
128M 93H6821
7026/H50 (64 MB/1 GB) 93H2641 32 MB 93H4700 Yes
128 MB 93H4702
7026/H70 (64 MB/8 GB) 93H2641 32 MB 93H4700 Yes
128 MB 93H4702
256 MB 07L9030
7026/6H0 (256 MB/16 GB) 04N4808 32 MB 07L7729 Quads
7026/6H1 128 MB 93H4702
256 MB 07L9030
512 MB 10L5417
1 GB 09P0335
7026/6M1 (1 GB/32 GB) 04N3033 128 MB 93H4702 Group of 8
256 MB 07L9030
512 MB 10L5417
1 GB 09P0335
7026/H80 (256 MB/16 GB) 04N4808 32 MB 07L7729 Quads
128 MB 93H4702
256 MB 07L9030
512 MB 10L5417
1 GB 09P0335
7026/M80 (1 GB/32 GB) 04N3033 128 MB 93H4702 Group of 8
256 MB 07L9030
512 MB 10L5417
7028/6C1 (512 MB/ 8 GB with all N/A 512 MB 09P0491 Yes
7028/6E1 processors)
7028/6C4 (I GB/16GB) N/A 256 MB 09P2705 Quads
7028/6E4 512 MB 09P2706
1 GB 09P2707
2 GB 53P3232
7038/6M2 (1 GB/64 GB) On Processor Card 512MB 53P3226 Quads
09P6257 1 GB 53P3230
4 GB 53P3232
7039/651 (4 GB/32 GB) 09P2927 4 GB N/A No
09P2882 8 GB N/A No
7040/671 (4 GB/128 GB) Inner 53P1013 4 GB N/A No
Inner 53P0374 8 GB
Inner 53P0378 16 GB
Inner 53P0382 32 GB
Note: These attributes should be set before the diagnostic programs are loaded.
General Setup Attributes 3151 /11 /31 3151 /51 /61 3161 /3164 Description
/41 Settings Settings Settings
Machine mode IBM 3151 IBM 3151 PC IBM 3161 or The diagnostic programs are set to
IBM 3164 emulate use of the 3161 ASCII Display
Terminal. If your terminal can emulate a
5085, 3161 or 3164 terminal, use the
following attribute settings. Otherwise,
refer to your operator’s manual, compare
the follow attribute descriptions with
those of your terminal, and set your
attributes accordingly.
Generated Code Set ASCII
Screen Normal Normal Uses the EIA-232 interface protocol.
Row and Column 24 x 80 24 x 80 Uses the EIA-232 interface protocol.
Scroll Jump Jump Jump When the last character on the bottom
line is entered, the screen moves down
one line.
Auto LF Off Off Off For the ″On″ setting, pressing the Return
key moves the cursor to the first
character position of the next line. For
the ″Off″ setting, pressing the Return key
moves the cursor to the first character
position of the current line. The CR and
LF characters are generated by the New
line setting.
CRT saver Off Off 10 The ″10″ setting causes the display
screen to go blank if there is no activity
for 10 minutes. When the system unit
sends data or a key is pressed, the
screen contents are displayed again.
Line wrap On On On The cursor moves to the first character
position of the next line in the page after
it reaches the last character position of
the current line in the page.
Forcing insert Off Off
Tab Field Field Field The column tab stops are ignored, and
the tab operation depends on the field
attribute character positions.
667
General Setup Attributes 3151 /11 /31 3151 /51 /61 3161 /3164 Description
/41 Settings Settings Settings
Trace All Both inbound data (data to the system
unit) and outbound data (data from the
system unit) to and from the main port
can be transferred to the auxiliary port
without disturbing communications with
the system unit when the Trace key is
pressed.
Note: These logs are analyzed automatically by AIX diagnostics and should not require
manual analysis, however, a situation may arise making it useful to examine the
detail manually, possibly supplying you with more information.
See Figure 1 for the general structure of the error log data.
Variable Length,
8 Bytes 40 Bytes 4 Bytes Ends in Null To End of Log
(<00>)
As shown in Figure 1:
v The Error Log Fixed Part is always present, and provides error information about the
type of error, its severity, and how it was handled. The Error Log Fixed Part also
indicates the presence and length of the extended error log information.
v The Extended Error Log portion provides more specific information about the cause,
type, and location of the error.
v The next 4 bytes, containing the ASCII characters IBM and a null character, indicate
that IBM-specific extensions to the standard CHRP log architecture follow.
v A variable length string follows, containing the physical locations of any Field
Replaceable Units (FRUs) implicated by the failure.
Location codes are separated by blanks (0x20), and the string ends in a null
character (0x00). Refer to “Location Codes for CHRP Model Architecture System
Units” on page 244 for more information on CHRP Location Codes.
671
v Other data may follow after the location codes, but it is unarchitected engineering
debug information that varies from system to system.
More detailed information on each portion of the log format is available in later sections
of this chapter. Refer also to “Location Codes for CHRP Model Architecture System
Units” on page 244 for more information on CHRP Location Codes.
LABEL: SCAN_ERROR_CHRP
IDENTIFIER: BFE4C025
Description
UNDETERMINED ERROR
Fixed Part (8 bytes)
Failure Causes Extended Error Log (40 bytes)
UNDETERMINED
"IBM <0x00>" (4 bytes)
Recommended Actions
Location Codes (variable):
RUN SYSTEM DIAGNOSTICS
"P1-C1 P1<0x00>"
Detail Data Debug Data Records
PROBLEM DATA
0144 1000 0000 006A C600 9108 0014 3700 1997 0221 0200 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 4942 4D00 5031 2D43 3120 5031 0000 0000
0008 5331 0000 0300 0008 5332 0000 007F 0008 4D31 1000 0000 0008 4831 0000 0000
0008 4832 0000 0000 0008 4833 0000 0000 0002 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000
Note: In Table 2, the location of each field within the integer is included in parentheses
after its name. Numerical field values are indicated in decimal unless noted
otherwise.
Internal Errors:
v RETRY (1): too many tries failed, and a retry count expired
v TCE_ERR (2): range or access type error in an access through a
TCE
v INTERN_DEV_FAIL (3): some RTAS-abstracted device has failed (for
example, TODC)
v TIMEOUT (4): intended target did not respond before a time-out
occurred
v DATA_PARITY (5): Parity error on data
v ADDR_PARITY(6): Parity error on address
v CACHE_PARITY (7): Parity error on external cache
v ADDR_INVALID(8): access to reserved or undefined address, or
access of an unacceptable type for an address
v ECC_UNCORR (9): uncorrectable ECC error
v ECC_CORR (10): corrected ECC error
v RESERVED (11-63): Reserved for future use
Figure 3 and Table 3 on page 676 shows the general layout for the extended error log
format, while Table 4 on page 678 through Table 10 on page 682 show the detailed
layout of bytes 12 through 39. The detail area format is determined by bits 4:7 of byte
2, which indicate the error log type.
IBM Specific 44
Extensions Vendor ID
40
Detail Data 16
Vendor ID
12
Header 12
0
Figure 3. Layout of extended error log format from RTAS
Product-unique data (location codes and debug information) is added to the end of the
extended error log buffer (starting at byte 40) for capture and logging.
Note: The following log formats are designed to support the representation of integer
values in either the Big-Endian (AIX, Apple) or Little-Endian (Intel) formats. For
AIX, this is always Big-Endian format, which means there is no byte swapping,
and bits run from 0 (high-order, leftmost) to 7 (low-order, rightmost).
Note: Time and Date values included in the CHRP Extended Error Log format (bytes
4-11) are recorded in Universal Time Coordinated (UTC) which essentially is
Greenwich Mean Time. UTC is also the way time is reported from the Time of
Day clock hardware. Be aware that UTC time is NOT the same as local time that
is usually presented by the operating system. The system has built-in functions
to manage time differences and takes into account special cases such as
Daylight Savings Time. For example, you may see an AIX error log with an AIX
time stamp of 12:00 pm, containing a CHRP error log in the Detail Data where
the encoded time stamp is 5:00 pm. This difference is actually the difference
between local time and UTC time.
The manufacturer may not offer the products, services, or features discussed in this
document in other countries. Consult the manufacturer’s representative for information
on the products and services currently available in your area. Any reference to the
manufacturer’s product, program, or service is not intended to state or imply that only
that product, program, or service may be used. Any functionally equivalent product,
program, or service that does not infringe any intellectual property right of the
manufacturer may be used instead. However, it is the user’s responsibility to evaluate
and verify the operation of any product, program, or service.
The manufacturer may have patents or pending patent applications covering subject
matter described in this document. The furnishing of this document does not give you
any license to these patents. You can send license inquiries, in writing, to the
manufacturer.
The following paragraph does not apply to the United Kingdom or any country
where such provisions are inconsistent with local law: THIS MANUAL IS
PROVIDED ″AS IS″ WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR
IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF
NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR
PURPOSE. Some states do not allow disclaimer of express or implied warranties in
certain transactions; therefore, this statement may not apply to you.
Information concerning products made by other than the manufacturer was obtained
from the suppliers of those products, their published announcements, or other publicly
available sources. The manufacturer has not tested those products and cannot confirm
the accuracy of performance, compatibility or any other claims related to products made
by other than the manufacturer. Questions on the capabilities of products made by other
than the manufacturer should be addressed to the suppliers of those products.
697
698 Diagnostic Information for Multiple Bus Systems
Index
699
I missing device problem resolution, MAP 0291 121
missing path resolution 146
I/O
missing resource problem resolution, MAP 0290 117
multipath 146
mpio 146
identify indicators 211
multipath I/O 146
indicators
multiple systems diagnostics 164
fault 203
installation checkout procedure 135
intermittent problem isolation, MAP 0040 47
N
network service aid 212
L NIM standalone diagnostics 155
local area network service aid 212
location code standard prefixes 246
O
operator panel display numbers 225
location codes 242, 244, 248
log repair action service aid 212 P
logical partition 1 partitioned system 1
LPAR 1 platform error problem resolution, MAP 0230 83
standalone diagnostics 163 printing
from disk 140
M problem determination procedure 45
machine types 564
problem determination procedure, MAP 0020 29
MAPs
problem determination procedure, SCSI RAID , MAP
fast path 17
0025 41
MAP 0020, problem determination procedure 29
MAP 0025, SCSI RAID problem determination R
procedure 41 related publications x
MAP 0030, additional problem determination 45 repair checkout, MAP 0410 125
MAP 0040, intermittent problem isolation 47 repair log service aid 212
MAP 0050, SCSI bus problems 51 reset fault indicators 203
MAP 0070, 888 sequence 59 run exercisers 214
MAP 0080, system bus problem isolation 63
MAP 0210, general problem resolution 67 S
MAP 0220, hot-swap FRU problem resolution 77 scan dump policy 187
MAP 0230, platform error problem resolution 83 SCSI
MAP 0235, system array self-repair problem bus analyzer 218
resolution 91 configuration checks 3
MAP 0240, memory problem resolution 93 high availability 3
MAP 0250, unexpected system halts during high-availability diagnostics 166
diagnostics 97 multiple SCSI system checks 3
MAP 0260, system hangs during resource SCSI bus problems, MAP 0050 51
configuration 99 SCSI RAID Problem Resolution and Verification, MAP
MAP 0270, SCSI RAID Problem Resolution and 0270 103
Verification 103 service aids
MAP 0280, boot problem resolution 111, 113 7135 RAIDiant array 223
MAP 0290, missing resource problem 7318 service aid 224
resolution 117 add resource 177
MAP 0291, missing device problem resolution 121 analyze adapter internal log 177
MAP 0410, repair checkout 125 backup/restore media 178
MAP 0420, system checkout 133 callin/out setup 188
service hints 1 CE login 141
Slow Boot 2 certify media 179
start of call 15 change hardware VPD 182
using the maintenance analysis procedures 1 configure dials and LPF Keys 182
memory problem resolution, MAP 0240 93 configure ISA adapter 182
memory testing 143 configure reboot policy 183
microcode 139 configure remote maintenance policy 184
Index 701
702 Diagnostic Information for Multiple Bus Systems
Readers’ Comments — We’d Like to Hear from You
RS/6000
Eserver pSeries
Diagnostic Information for Multiple Bus Systems
Version 5.2.0.0
Overall, how satisfied are you with the information in this book?
How satisfied are you that the information in this book is:
When you send comments to IBM, you grant IBM a nonexclusive right to use or distribute your comments in any
way it believes appropriate without incurring any obligation to you.
Name Address
Company or Organization
Phone No.
_________________________________________________________________________________
Readers’ Comments — We’d Like to Hear from You Cut or Fold
SA38-0509-17 Along Line
NO POSTAGE
NECESSARY
IF MAILED IN THE
UNITED STATES
Information Development
Department H6DS-905-6C006
11501Burnet Road
Austin, TX 78758-3493
_____________________________________________________________________________
Fold and Tape Please do not staple Fold and Tape
Cut or Fold
SA38-0509-17 Along Line
Printed in U.S.A.
December 2002
SA38-0509-17
Spine information: