EPUB_PRC_GPU_ISOLATION_PROCEDURE isolation procedure
Learn how to identify the service action that is needed to resolve a graphics processing unit (GPU) problem.
- Is the system an 8335-GTB?
If Then Yes: Continue with the next step. No: Go to Contacting IBM service and support. This ends the procedure. - Use the ipmitool command to examine system event logs (SELs).
- To list SELs by using an in-band network, use the following command:
ipmitool sel elist
- To list SELs remotely over the LAN, use the following command:
ipmitool -I lanplus -U <username> -P <password> -H <BMC IP addres or BMC hostname> sel elist
- To list SELs by using an in-band network, use the following command:
- Identify all SELs with CPU Func or CPU Core Func
in the description. Did you find one or more SELs with CPU Func or
CPU Core Func in the description?
If Then Yes: Continue with the next step. No: Go to Contacting IBM service and support. This ends the procedure. - For each of the SELs that you identified in step 3, is the sensor name CPU Func
1 or CPU Core Func x, where
x is 1 - 12?
If Then Yes: Continue with the next step. No: Continue with step 6. - Replace the following items one at a time until the problem is resolved: Note: Go to 8335-GTB locations to identify the physical location and the removal and replacement procedure.
- CPU 1
- GPU 2
- GPU 1
- System backplane
- Is the sensor name CPU Func 2 or CPU Core Func
x, where x is 13 - 24?
If Then Yes: Continue with the next step. No: Go to Contacting IBM service and support. This ends the procedure. - Replace the following items one at a time until the problem is resolved: Note: Go to 8335-GTB locations to identify the physical location and the removal and replacement procedure.
- CPU 2
- GPU 4
- GPU 3
- System backplane
Parent topic: Isolation procedures