Troubleshooting
Problem
On iDataPlex dx360 M2 Servers, the system will take one minute to reset after runtime a Multi-Bit Error (MBE) occurs. The user will see the following error messages in this event: Failure, system hung w/o reset when MBE triggered. System hung PCILED: Off DIMM LED: Off After one minute, the system will reboot. While the system is rebooting, the following light code will appear: PCI LED: On DIMM LED: On The following will be logged: POST Event Viewer: [W.50001] DIMM Disabled IMM System Event Log: D5h, 01 FF FF - Uncorrectable ECC D5h, 04 FF FF - Memory Device Disabled Firmware Aux Log: [S.51003] Fatal Memory Error Occurred * 2 [W.50001] DIMM Disabled * 3
Resolving The Problem
Source
RETAIN tip: H136531
Symptom
On iDataPlex dx360 M2 Servers, the system will take one minute to reset after runtime a Multi-Bit Error (MBE) occurs.
The user will see the following error messages in this event:
| Failure, system hung w/o reset when MBE triggered. System hung PCI LED: Off DIMM LED: Off |
After one minute, the system will reboot.
While the system is rebooting, the following light code will appear:
| PCI LED: On DIMM LED: On |
The following will be logged:
| POST Event Viewer: [W.50001] DIMM Disabled IMM System Event Log: D5h, 01 FF FF - Uncorrectable ECC D5h, 04 FF FF - Memory Device Disabled Firmware Aux Log: [S.51003] Fatal Memory Error Occurred * 2 [W.50001] DIMM Disabled * 3 |
Affected configurations
The system may be any of the following IBM Servers:
- System x iDataPlex dx360 M2 Server, type 6380, any model
- System x iDataPlex dx360 M2 Server, type 7321, any model
- System x iDataPlex dx360 M2 Server, type 7323, any model
This tip is not software specific.
This tip is not option specific.
Additional information
This is working as designed.
The Integrated Management Module (IMM) is designed to start a one-minute timer in the event of an MBE in order to allow time for the System Management Interrupt (SMI) handler to Non-Maskable Interrupt (NMI) the server and de-assert the Input Output Controller Hub (IOH) error signal.
The system will reboot after the one minute timer expires.
The user will not see the system reboot immediately and needs to wait for one minute.
Document Location
Worldwide
Was this topic helpful?
Document Information
Modified date:
29 January 2019
UID
ibm1MIGR-5080304