IBM Support

System resets one minute after runtime Multi-Bit Error (MBE) is trriggered - IBM iDataPlex dx360

Troubleshooting


Problem

On iDataPlex dx360 M2 Servers, the system will take one minute to reset after runtime a Multi-Bit Error (MBE) occurs. The user will see the following error messages in this event: Failure, system hung w/o reset when MBE triggered. System hung PCILED: Off DIMM LED: Off After one minute, the system will reboot. While the system is rebooting, the following light code will appear: PCI LED: On DIMM LED: On The following will be logged: POST Event Viewer: [W.50001] DIMM Disabled IMM System Event Log: D5h, 01 FF FF - Uncorrectable ECC D5h, 04 FF FF - Memory Device Disabled Firmware Aux Log: [S.51003] Fatal Memory Error Occurred * 2 [W.50001] DIMM Disabled * 3

Resolving The Problem

Source

RETAIN tip: H136531

Symptom

On iDataPlex dx360 M2 Servers, the system will take one minute to reset after runtime a Multi-Bit Error (MBE) occurs.

The user will see the following error messages in this event:

  Failure, system hung w/o reset when MBE triggered. System hung PCI LED: Off DIMM LED: Off

After one minute, the system will reboot.

While the system is rebooting, the following light code will appear:

  PCI LED: On DIMM LED: On

The following will be logged:

  POST Event Viewer:
[W.50001] DIMM Disabled

IMM System Event Log:
D5h, 01 FF FF - Uncorrectable ECC D5h, 04 FF FF - Memory Device
Disabled

Firmware Aux Log:
[S.51003] Fatal Memory Error Occurred * 2 [W.50001] DIMM
Disabled * 3

Affected configurations

The system may be any of the following IBM Servers:

  • System x iDataPlex dx360 M2 Server, type 6380, any model
  • System x iDataPlex dx360 M2 Server, type 7321, any model
  • System x iDataPlex dx360 M2 Server, type 7323, any model

This tip is not software specific.

This tip is not option specific.

Additional information

This is working as designed.

The Integrated Management Module (IMM) is designed to start a one-minute timer in the event of an MBE in order to allow time for the System Management Interrupt (SMI) handler to Non-Maskable Interrupt (NMI) the server and de-assert the Input Output Controller Hub (IOH) error signal.

The system will reboot after the one minute timer expires.

The user will not see the system reboot immediately and needs to wait for one minute.

Document Location

Worldwide

Operating System

System x:Operating system independent / None

[{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HW31U","label":"System x->System x iDataPlex dx360 M2 server"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
29 January 2019

UID

ibm1MIGR-5080304