Identifying a service action by using sensor and event information for the 8348-21C

You can use the sensor and event information from the system event log to determine a service action to perform for the IBM® Power® System S812LC (8348-21C).

If you have not done so already, complete Identifying a service action by using system event logs. Then, use the following table to determine the service action to perform.
Table 1. Sensor information, event description, and service action for the 8348-21C
Sensor name (Sensor ID) Event description Service action
Watchdog (0x00)
  • Timer Expired
  • Reserved1
  • Reserved2
  • Reserved3
  • Reserved4
No service action is required.
  • Hard Reset
  • Power Down
  • Power Cycle
  • Timer Interrupt
SEL events with OEM record c0 | 000e000 | 3a150xxxxxxx indicate that a boot failed. Search for boot failure SEL events that have a time stamp in close proximity to the time stamp of this SEL event. If events exist, go to Resolving a system firmware boot failure. If there are no boot failure SEL events and the system booted correctly, no service action is required.
Host Status (0x04) Unknown Go to Getting fixes and update the system firmware to the most recent level of firmware that is available. If this SEL event continues to be logged each time you power on the system, go to Collecting diagnostic data. Then, go to Contacting IBM service and support.
  • S0/Go "Working"
  • S1 "Sleeping with system h/w & processor context maintained"
  • S2 "sleeping, processor context lost"
  • S3 "sleeping, processor & h/w context lost, memory retained"
  • S4 "non-volatile sleep / suspend-to disk"
  • S5 / G2: "soft-off"
  • S4 / S5: "soft-off"
  • G3 mechanical Off
  • Sleeping in an S1/S2/S3 State
  • G1: Sleeping
  • S5: entered by override
  • Legacy ON state
  • Legacy OFF state
No service action is required.
FW Boot Progress (0x05)
  • System Firmware Error
  • System Firmware Hang
SEL events with OEM record c0 | 000e000 | 3a150xxxxxxx indicate that a boot failed. Search for boot failure SEL events that have a time stamp in close proximity to the time stamp of this SEL event. If events exist, go to Resolving a system firmware boot failure.
System Firmware Progress No service action is required.
OCC Active (0x08) Device Disabled Replace the system processor. Go to 8348-21C locations to identify the physical location and removal and replacement procedure.
  • State Deasserted
  • Device Enabled
No service action is required.
Ambient Temp (0x0A)
  • Upper Critical - going low
  • Lower Non-critical – going low
  • Lower Non-critical – going high
  • Lower Critical – going high
  • Lower Non-recoverable – going low
  • Lower Non-recoverable – going high
  • Upper Non-critical – going low
  • Upper Non-critical – going high
  • Lower Critical - going low
  • Upper Non-recoverable – going low
  • Upper Non-recoverable – going high
No service action is required.
Upper Critical - going high Ensure that the room temperature meets the requirements that are specified for the system. Ensure that no obstructions are blocking air flow to the system.
CPU Temp (0x64)
  • Lower Non-critical – going low
  • Lower Non-critical – going high
  • Lower Critical - going low
  • Lower Critical – going high
  • Lower Non-recoverable – going low
  • Lower Non-recoverable – going high
  • Upper Non-critical – going low
  • Upper Non-critical – going high
  • Upper Critical - going low
  • Upper Critical - going high
  • Lower Critical - going low
  • Upper Non-recoverable – going low
  • Upper Non-recoverable – going high
No service action is required.
CPU Func (0x4E)
  • IERR
  • Transition to Non-recoverable
  • Predictive Failure
Replace the system processor. Go to 8348-21C locations to identify the physical location and removal and replacement procedure.
  • Processor Disabled
  • Thermal Trip
  • FRB1 BIST Failure
  • FRB2 Hang In POST Failure
  • FRB3 Processor Startup Initialization Failure
  • Configuration Error
  • SMBIOS Uncorrectable CPU Complex Error
  • Terminator Presence Detected
  • Processor Automatically Throttled
  • Machine Check Exception
  • Correctable Machine Check Error
  • State Deasserted
  • Device Disabled
  • Transition to Critical from Less Severe
  • Transition to Non-recoverable from Less Severe
  • Transition to Critical from Non-recoverable
  • Processor Presence Detected
  • State Asserted
  • Device Enabled
  • Transition to OK
  • Transition to Non-Critical from OK
  • Transition to Non-Critical from More Severe
  • Monitor
  • Informational
No service action is required.
All PGood (0x1C)
  • Interlock Power Down
  • Power Off Power Down
  • Power Cycle
  • 240VA Power Down
No service action is required.
  • AC Lost
  • Soft Power Control Failure
  • Ensure that ac power is supplied to the rack.
  • Ensure that the system power cords are plugged tightly into both the power supply and the rack power distribution unit (PDU) for both system power supplies.
  • Ensure that the system was not powered off.
  • Power Unit Failure Detected
  • Predictive Failure
  • DIMM Func 0 (0x1E)
  • DIMM Func 1 (0x1F)
  • DIMM Func 2 (0x20)
  • DIMM Func 3 (0x21)
  • DIMM Func 4 (0x22)
  • DIMM Func 5 (0x23)
  • DIMM Func 6 (0x24)
  • DIMM Func 7 (0x25)
  • DIMM Func 8 (0x26)
  • DIMM Func 9 (0x27)
  • DIMM Func 10 (0x28)
  • DIMM Func 11 (0x29)
  • DIMM Func 12 (0x2A)
  • DIMM Func 13 (0x2B)
  • DIMM Func 14 (0x2C)
  • DIMM Func 15 (0x2D)
  • DIMM Func 16 (0x2E)
  • DIMM Func 17 (0x2F)
  • DIMM Func 18 (0x30)
  • DIMM Func 19 (0x31)
  • DIMM Func 20 (0x32)
  • DIMM Func 21 (0x33)
  • DIMM Func 22 (0x34)
  • DIMM Func 23 (0x35)
  • DIMM Func 24 (0x36)
  • DIMM Func 25 (0x37)
  • DIMM Func 26 (0x38)
  • DIMM Func 27 (0x39)
  • DIMM Func 28 (0x3A)
  • DIMM Func 29 (0x3B)
  • DIMM Func 30 (0x3C)
  • DIMM Func 31 (0x3D)
  • Memory Device Disabled
  • Uncorrectable Memory Error
  • Memory Scrub Failed
  • State Deasserted
  • Device Disabled
  • Transition to Critical from Less Severe
  • Transition to Non-recoverable from Less Severe
  • Transition to Critical from Non-recoverable
  • Correctable Memory Error
  • Parity
  • Correctable Memory Error Logging Limit Reached
  • Memory Automatically Throttled
  • Critical Over temperature
  • Presence Detected
  • Spare
  • State Asserted
  • Device Enabled
  • Transition to OK
  • Transition to Non-Critical from OK
  • Transition to Non-Critical from More Severe
  • Monitor
  • Informational
No service action is required.
  • Transition to Non-recoverable
  • Predictive Failure
If the sensor name is DIMM Func 0, replace DIMM 0. If the sensor name is DIMM Func 1, replace DIMM 1. And so on. Go to 8348-21C locations to identify the physical location and removal and replacement procedure.
  • DIMM Func 0 (0x1E)
  • DIMM Func 1 (0x1F)
  • DIMM Func 2 (0x20)
  • DIMM Func 3 (0x21)
  • DIMM Func 4 (0x22)
  • DIMM Func 5 (0x23)
  • DIMM Func 6 (0x24)
  • DIMM Func 7 (0x25)
  • DIMM Func 8 (0x26)
  • DIMM Func 9 (0x27)
  • DIMM Func 10 (0x28)
  • DIMM Func 11 (0x29)
  • DIMM Func 12 (0x2A)
  • DIMM Func 13 (0x2B)
  • DIMM Func 14 (0x2C)
  • DIMM Func 15 (0x2D)
  • DIMM Func 16 (0x2E)
  • DIMM Func 17 (0x2F)
  • DIMM Func 18 (0x30)
  • DIMM Func 19 (0x31)
  • DIMM Func 20 (0x32)
  • DIMM Func 21 (0x33)
  • DIMM Func 22 (0x34)
  • DIMM Func 23 (0x35)
  • DIMM Func 24 (0x36)
  • DIMM Func 25 (0x37)
  • DIMM Func 26 (0x38)
  • DIMM Func 27 (0x39)
  • DIMM Func 28 (0x3A)
  • DIMM Func 29 (0x3B)
  • DIMM Func 30 (0x3C)
  • DIMM Func 31 (0x3D)
Configuration Error Complete the following steps:
  1. If the sensor name is DIMM Func 0, ensure that DIMM 0 is seated properly. If the sensor name is DIMM Func 1, ensure that DIMM 1 is seated properly. And so on.
  2. If you recently installed or replaced memory DIMMs, ensure that the DIMMs are plugged in the correct memory slots.
  3. If the sensor name is DIMM Func 0, replace DIMM 0. If the sensor name is DIMM Func 1, replace DIMM 1. And so on. Go to 8348-21C locations to identify the physical location and removal and replacement procedure.
  • CPU Core Func 1 (0x3E)
  • CPU Core Func 2 (0x3F)
  • CPU Core Func 3 (0x40)
  • CPU Core Func 4 (0x41)
  • CPU Core Func 5 (0x42)
  • CPU Core Func 6 (0x43)
  • CPU Core Func 7 (0x44)
  • CPU Core Func 8 (0x45)
  • CPU Core Func 9 (0x46)
  • CPU Core Func 10 (0x47)
  • CPU Core Func 11 (0x48)
  • CPU Core Func 12 (0x49)
  • IERR
  • Transition to Non-recoverable
  • Predictive Failure
Replace the system processor. Go to 8348-21C locations to identify the physical location and removal and replacement procedure.
  • Processor Disabled
  • FRB1 BIST Failure
  • FRB2 Hang In POST Failure
  • FRB3 Processor Startup Initialization Failure
  • Configuration Error
  • SMBIOS Uncorrectable CPU Complex Error
  • Terminator Presence Detected
  • Machine Check Exception
  • Correctable Machine Check Error
  • State Deasserted
  • Device Disabled
  • Transition to Critical from Less Severe
  • Transition to Non-recoverable from Less Severe
  • Transition to Critical from Non-recoverable
  • Thermal Trip
  • Processor Automatically Throttled
  • Processor Presence Detected
  • State Asserted
  • Device Enabled
  • Transition to OK
  • Transition to Non-Critical from OK
  • Transition to Non-Critical from More Severe
  • Monitor
  • Informational
No service action is required.
  • Membuf Func 0 (0x4A)
  • Membuf Func 1 (0x4B)
  • Membuf Func 2 (0x4C)
  • Membuf Func 3 (0x4D)
  • Uncorrectable Memory Error
  • Memory Device Disabled
  • State Deasserted
  • Device Disabled
  • Transition to Critical from Less Severe
  • Transition to Non-recoverable from Less Severe
  • Transition to Critical from Non-recoverable
  • Correctable Memory Error
  • Parity
  • Memory Scrub Failed
  • Correctable Memory Error Logging Limit Reached
  • Memory Automatically Throttled
  • Critical Over temperature
  • Presence Detected
  • Spare
  • State Asserted
  • Device Enabled
  • Transition to OK
  • Transition to Non-Critical from OK
  • Transition to Non-Critical from More Severe
  • Monitor
  • Informational
No service action is required.
  • Configuration Error
  • Transition to Non-recoverable
  • Predictive Failure
Replace the system backplane. Go to 8348-21C locations to identify the physical location and removal and replacement procedure.
Boot Count (0x50) None No service action is required.
Backplane Fault (0x51) State Deasserted No service action is required.
State Asserted Replace the system backplane. Go to 8348-21C locations to identify the physical location and removal and replacement procedure.
System Event (0x52) Undetermined system hardware failure Go to Collecting diagnostic data. Then, go to Contacting IBM service and support.
  • System Reconfigured
  • OEM System boot event
  • Entry added to auxiliary log
  • PEF Action
  • Timestamp Clock Sync
  • Transition State Active
  • Transition State Idle
  • Transition State Busy
No service action is required.
Activate Pwr Lt (0x53) None No service action is required.
  • Ref Clock Fault (0x54)
  • PCI Clock Fault (0x55)
  • State Deasserted
  • State Asserted
No service action is required.
  • DIMM Temp 0 (0x69)
  • DIMM Temp 1 (0x6A)
  • DIMM Temp 2 (0x6B)
  • DIMM Temp 3 (0X6C)
  • DIMM Temp 4 (0x6D)
  • DIMM Temp 5 (0x6E)
  • DIMM Temp 6 (0x6F)
  • DIMM Temp 7 (0x70)
  • DIMM Temp 8 (0x71)
  • DIMM Temp 9 (0x72)
  • DIMM Temp 10 (0x73)
  • DIMM Temp 11 (0x74)
  • DIMM Temp 12 (0x75)
  • DIMM Temp 13 (0x76)
  • DIMM Temp 14 (0x77)
  • DIMM Temp 15 (0x78)
  • DIMM Temp 16 (0x79)
  • DIMM Temp 17 (0x7A)
  • DIMM Temp 18 (0x7B)
  • DIMM Temp 19 (0x7C)
  • DIMM Temp 20 (0x7D)
  • DIMM Temp 21 (0x7E)
  • DIMM Temp 22 (0x7F)
  • DIMM Temp 23 (0x80)
  • DIMM Temp 24 (0x81)
  • DIMM Temp 25 (0x82)
  • DIMM Temp 26 (0x83)
  • DIMM Temp 27 (0x84)
  • DIMM Temp 28 (0x85)
  • DIMM Temp 29 (0x86)
  • DIMM Temp 30 (0x87)
  • DIMM Temp 31 (0x88)
  • Lower Non-critical – going low
  • Lower Non-critical – going high
  • Lower Critical – going low
  • Lower Critical – going high
  • Lower Non-recoverable – going low
  • Lower Non-recoverable – going high
  • Upper Non-critical – going low
  • Upper Non-critical – going high
  • Upper Critical - going low
  • Upper Critical - going high
  • Upper Non-recoverable – going low
  • Upper Non-recoverable – going high
No service action is required.
  • CPU Core Temp 1 (0x89)
  • CPU Core Temp 2 (0x8A)
  • CPU Core Temp 3 (0x8B)
  • CPU Core Temp 4 (0x8C)
  • CPU Core Temp 5 (0x8D)
  • CPU Core Temp 6 (0x8E)
  • CPU Core Temp 7 (0x8F)
  • CPU Core Temp 8 (0x90)
  • CPU Core Temp 9 (0x91)
  • CPU Core Temp 10 (0x92)
  • CPU Core Temp 11 (0x93)
  • CPU Core Temp 12 (0x94)
  • Lower Non-critical – going low
  • Lower Non-critical – going high
  • Lower Critical – going low
  • Lower Critical – going high
  • Lower Non-recoverable – going low
  • Lower Non-recoverable – going high
  • Upper Non-critical – going low
  • Upper Non-critical – going high
  • Upper Critical - going low
  • Upper Critical - going high
  • Upper Non-recoverable – going low
  • Upper Non-recoverable – going high
No service action is required.
  • Mem Proc0 Pwr (0xA1)
  • Mem Proc1 Pwr (0xA2)
  • Mem Proc2 Pwr (0xA3)
  • Mem Proc3 Pwr (0xA4)
  • Proc0 Power (0xA5)
  • PCIE Proc0 Pwr (0xA6)
  • Fan Power A (0xA9)
  • Mem Cache Power (0xAC)
  • GPU Power (0xAD)
  • Lower Non-critical – going low
  • Lower Non-critical – going high
  • Lower Critical – going low
  • Lower Critical – going high
  • Lower Non-recoverable – going low
  • Lower Non-recoverable – going high
  • Upper Non-critical – going low
  • Upper Non-critical – going high
  • Upper Critical - going low
  • Upper Critical - going high
  • Upper Non-recoverable – going low
  • Upper Non-recoverable – going high
No service action required.
  • TOD Clock Fault (0x56)
  • APSS Fault (0x57)
  • State Deasserted
  • State Asserted
No service action is required.
PS Derating Fac (0x58) None No service action is required.
OS Boot (0x5A)
  • Installation aborted
  • Installation failed
Ensure that the operating system boot image is loaded. Ensure that the disk drive or solid-state drive is ready. Reload the operating system boot image.
  • A: boot completed
  • C: boot completed
  • PXE boot completed
  • Diagnostic boot completed
  • CD-ROM boot completed
  • ROM boot completed
  • Boot completed - device not specified
  • Installation started
  • Installation completed
No service action is required.
PCI (0x5B)
  • State Deasserted
  • State Asserted
No service action is required.
  • Membuf Temp 0 (0x65)
  • Membuf Temp 1 (0x66)
  • Membuf Temp 2 (0x67)
  • Membuf Temp 3 (0x68)
  • Lower Non-critical – going low
  • Lower Non-critical – going high
  • Lower Critical – going low
  • Lower Critical – going high
  • Lower Non-recoverable – going low
  • Lower Non-recoverable – going high
  • Upper Non-critical – going low
  • Upper Non-critical – going high
  • Upper Critical - going low
  • Upper Critical - going high
  • Upper Non-recoverable – going low
  • Upper Non-recoverable – going high
No service action is required.
CPU Diode Sensor (0x0B)
  • Lower Non-critical – going low
  • Lower Non-critical – going high
  • Lower Critical – going low
  • Lower Critical – going high
  • Lower Non-recoverable – going low
  • Lower Non-recoverable – going high
  • Upper Non-critical – going low
  • Upper Non-critical – going high
  • Upper Critical - going low
  • Upper Critical - going high
  • Upper Non-recoverable – going low
  • Upper Non-recoverable – going high
No service action is required.
Checkstop (0x0C) IERR If this event immediately precedes a system power off, no service action is required. Otherwise, search for SEL events that meet the following criteria:

If you found a SEL event that matches the criteria, perform the service action that is indicated in this table for the SEL event. Otherwise, go to Collecting diagnostic data. Then, go to Contacting IBM service and support.

  • Thermal Trip
  • Configuration Error
  • Processor Automatically Throttled
  • Correctable Machine Check Error
  • Processor Presence Detected
No service action is required.
  • FRB1 BIST Failure
  • FRB2 Hang In POST Failure
  • FRB3 Processor Startup Initialization Failure
  • SMBIOS Uncorrectable CPU Complex Error
  • Processor Disabled
  • Terminator Presence Detected
  • Machine Check Exception
Go to Collecting diagnostic data. Then, go to Contacting IBM service and support.
  • PSU Fault 1 (0x5D)
  • PSU Fault 2 (0x5E)
Power Supply Failure Detected An assert event immediately followed by a deassert event indicates that a power cycle of the system occurred. No service action is required. If there is no deassert event immediately following the assert event, replace the power supply. If the sensor name is PSU Fault 1, replace PSU 1. If the sensor name is PSU Fault 2, replace PSU 2. Go to 8348-21C locations to identify the physical location and removal and replacement procedure.
  • Predictive Failure
  • Power Supply Input Out of Range But Present
If the sensor name is PSU Fault 1, replace PSU 1. If the sensor name is PSU Fault 2, replace PSU 2. Go to 8348-21C locations to identify the physical location and removal and replacement procedure.
  • Power Supply Input Lost or AC DC
  • Power Supply Input Lost Or Out Of Range
Ensure that ac power is supplied to the rack. Ensure that the system power cords are plugged tightly into both the power supply and the rack PDU unit for both system power supplies. Go to 8348-21C locations to identify the physical location and removal and replacement procedure.
Configuration Error Ensure that both power supplies are securely seated in the system. Go to 8348-21C locations to identify the physical location and removal and replacement procedure.
  • Presence Detected
  • Power Supply Inactive
No service action is required.
BIOS Golden Side (0x5C) None Go to Resolving a system firmware boot failure and follow the service action for a system event log (SEL) with the value OEM record c0 and OEM c0 specific log information 3a1504xxxxxx.
BMC Golden Side (0x60) None Go to Resolving a system firmware boot failure and follow the service action for a system event log (SEL) with the value OEM record c0 and OEM c0 specific log information 3a1504xxxxxx.
  • Fan 1 (0xB3)
  • Fan 2 (0xB4)
  • Fan 3 (0xB5)
  • Fan 4 (0xB6)
  • Fan 5 (0xB7)
  • Lower Non-critical – going low
  • Lower Non-critical – going high
  • Lower Critical – going low
  • Lower Critical – going high
  • Lower Non-recoverable – going low
  • Lower Non-recoverable – going high
  • Upper Non-critical – going low
  • Upper Non-critical – going high
  • Upper Critical - going low
  • Upper Critical - going high
  • Upper Non-recoverable – going low
  • Upper Non-recoverable – going high
No service action is required.
Quick power drop (0x0D)
  • IERR
  • Thermal Trip
  • FRB1 BIST Failure
  • FRB2 Hang In POST Failure
  • FRB3 Processor Startup Initialization Failure
  • Configuration Error
  • SMBIOS Uncorrectable CPU Complex Error
  • Processor Presence Detected
  • Processor Disabled
  • Terminator Presence Detected
  • Processor Automatically Throttled
  • Machine Check Exception
  • Correctable Machine Check Error
No service action is required.
  • IO A Power (0xA7)
  • IO B Power (0xA8)
  • Storage Power A (0xAA)
  • Storage Power B (0xAB)
  No service action is required.
CPU VDD Volt (0x0E)
  • Lower Non-critical – going low
  • Lower Non-critical – going high
  • Lower Critical – going low
  • Lower Critical – going high
  • Lower Non-recoverable – going low
  • Lower Non-recoverable – going high
  • Upper Non-critical – going low
  • Upper Non-critical – going high
  • Upper Critical - going low
  • Upper Critical - going high
  • Upper Non-recoverable – going low
  • Upper Non-recoverable – going high
No service action is required.



Last updated: Thu, December 02, 2021