Identifying the location of the GPU
The error message provides information to help you to determine the location of the graphics processing unit (GPU).
On an 8335-GCA or 8335-GTA system, the log might
contain an error message similar to the following
text:
EEH: PHB#0 failure detected, location: Slot5
On an 8335-GTB system,
the log might contain an error message similar to the following
text:
EEH: PHB#0 failure detected, location: GPU1
If you have an 8335-GTB system with Red
Hat Enterprise Linux 7.4 or later, and if you get an error
message with only PCI bus information (for example, 0002:01:00.0), you can
determine the GPU slot information by using the lshw command. Complete the
following steps:
- Record the PCI bus information that is in the error message.
- Log in to the operating system with root authority.
- Type the following command and press Enter:
lshw -class display
- Determine the GPU slot that is associated with the PCI bus information that you recorded in step 1.
Use the following table to map the slot or GPU number information in the operating system log to the GPU description and service action. This ends the procedure.
| Slot number information from the log | GPU description | Service action |
|---|---|---|
| Slot5 | GPU 2 | Replace the GPU indicated in the GPU description column. Go to 8335-GCA and 8335-GTA locations to identify the physical location and the removal and replacement procedure. |
| Slot2 | GPU 1 |
| GPU number information from the log | GPU description | Service action |
|---|---|---|
| GPU1 | GPU 1 | Replace the GPU indicated in the GPU description column. Go to 8335-GTB locations to identify the physical location and the removal and replacement procedure. |
| GPU2 | GPU 2 | |
| GPU3 | GPU 3 | |
| GPU4 | GPU 4 |