Use this procedure to isolate a failing I/O adapter.
If you need additional information
for failing part numbers, location codes, or removal and replacement
procedures, see Part locations
and location codes. Select your machine type and model number to
find additional location codes, part numbers, or replacement procedures
for your system.
- Is the partition an AIX® partition
or a Linux partition, or is
a management console attached?
- No:
- Continue with the next step.
- Yes:
- Go to PCI bus isolation using AIX, Linux, or the management console to isolate a
PCI bus problem from an AIX partition,
a Linux partition, or a management
console.
- If the system is not IPLed, will it IPL to DST?
- No:
- Perform MABIP54. This ends the procedure.
- Yes:
- From the SAL display for the reference code, record the count.
Continue with the next step.
- Go to the SST/DST display in the partition which reported
the problem. Use STRSST if the operating system is running; use function
21 if STRSST does not work; or IPL the partition to DST.
- On the Start Service Tools Sign On display, type in a user
ID with QSRV authority and password.
- Select Start a service tool > Hardware service
manager > Logical hardware resources > System bus resources.
- Is there a resource name logged in the SAL entry?
- No:
- Continue with the next step.
- Yes:
- Go to step 13
- Do you have a location for the I/O processor?
- No:
- Record the Direct Select Address (DSA), word 7 of the reference
code, from the SAL display. Then continue with the next step.
- Yes:
- Go to step 11
- Return to the HSM System bus resources display.
- Locate the I/O processor by performing the following:
- Select Display detail.
- Compare the DSA with the bus, card, and board information
for the IOP.
Note: The card information on the HSM display
is in decimal format. You must convert the decimal card information
to hexadecimal format to match the DSA format.
- Repeat this step until you find the IOP with the same
DSA.
- Select Cancel, and then go to step 14.
- Locate the I/O processor in HSM by performing
the following for each IOP:
- Select Associated packaging resources > Display detail.
- Repeat until you find the IOP with the same location.
- Select Cancel > Cancel and go to step 14.
- Page forward until you find the multi-adapter
bridge and IOP where the problem exists. Verify that the multi-adapter
bridge and IOP are correct by matching the resource names on the display
with the resource names in the SAL for the problem you are working
on.
- For the IOP you are working on, select Resources
associated with IOP (if the I/O adapters are not already displayed).
- If there is an IOA that is listed in any state other than "operational",
then perform steps 16 through 19, starting with the disabled IOA by
moving the cursor to the disabled IOA. Otherwise, move the cursor
to the first IOA that is assigned to the IOP.
- Select Associated packaging resources >
Concurrent maintenance > Power off domain. Record the unit ID
of the slot you are powering off. Did the domain power off successfully?
- No:
- Choose from the following options:
- If only one IOA was listed as failing, power down the system and
replace the IOA. Re-IPL the system. If a different reference code
occurred, perform problem analysis and work that reference code. If
there was no reference code, go to Verify a repair . This
ends the procedure.
- If there were multiple failed IOAs and concurrent maintenance
did not work on one, then move to the next failed IOA and repeat steps 16 through 19.
- If concurrent maintenance does not work for multiple failed IOAs,
this procedure will not be able to identify a failing I/O adapter.
Return to the procedure that sent you here. This ends the procedure.
- Yes
- Perform MABIP05 and then return here
and continue with the next step.
- Did the IOP reset and IPL successfully?
- No:
- This procedure will not be able to identify a failing I/O adapter.
Return to the procedure that sent you here. This ends the procedure.
- Yes:
- Check for the same failure that sent you to this procedure. Check
the system control panel, the SAL for the partition that reported
the problem, or the Work with partition status display for the partition
that reported the problem. In the SAL, the count will increase if
the reference code occurred again. Continue with the next step.
- Did the same reference code occur after the IOP was reset
and you performed an IPL?
- No:
- Go to step 20.
- Yes:
- Perform the following steps:
- Go to the Hardware Service Manager display.
- Go to Packaging Hardware Resources.
- Power on the IOA by selecting Power on domain.
- Reassign the IOA to the IOP
- Return to the HSL resource display, showing the IOP and associated
resources.
- Continue with the next step.
- Is there any other IOA, assigned to the IOP,
that you have not already powered off and on?
- No:
- Go to step 22.
- Yes:
- Move the cursor to another IOA assigned to the IOP, choosing IOAs
with a status of "unknown" or "disabled" before moving
on to IOAs with a status of "operational". Go to step 16.
- The failing IOA is located. Exchange the I/O
adapter that you just powered off. Use the location you recorded in
step 16 to locate the IOA.
- Power on the IOA that you just exchanged. Does the same
reference code that sent you to this procedure still occur?
- No:
- You have exchanged the failing IOA. Go to Verify a repair . This
ends the procedure.
- Yes:
- The IOA is not the failing item. Remove the IOA and reinstall
the original IOA. Continue with the next step.
- No failing IOAs were identified. Return to
the procedure that sent you here. This ends the procedure.