IBM Support

Memory uncorrectable Error Correction Code (ECC) error - IBM System x

Troubleshooting


Problem

When an IBM System x3200 M3 system has two (2) Dual Inline Memory Modules (DIMMs) installed, it can have an uncorrectable Error Correction Code (ECC) error. This error is intermittent.

Resolving The Problem

Source

RETAIN tip: H206123

Symptom

When an IBM System x3200 M3 system has two Dual Inline Memory Modules (DIMMs) installed, it can have an uncorrectable Error Correction Code (ECC) error.

This error is intermittent.

Affected configurations

The system may be any of the following IBM servers:

  • System x3200 M3, type 7327, any model
  • System x3200 M3, type 7328, any model
  • System x3250 M3, type 4251, any model
  • System x3250 M3, type 4252, any model
  • System x3250 M3, type 4261, any model

This tip is not software specific.

This tip is not option specific.

Workaround

  1. Power off the system.
  2. Reseat the failure alerted Dual In-Line Memory Module (DIMM) into the same slot.
  3. Reboot the system.

Additional information

x3200 M3 installed with two DIMMs, in slot-1 and slot-4 may intermittently incur memory uncorrectable errors.

Re-enable DIMMs is not able to clear this symptom.

Reseating the DIMMs resolves the error in all reported cases.

When slot-1 memory uncorrectable ECC error happens, slot-2 (empty slot) in the same memory channel will also have error message with "Assertion: Memory Device Disabled".

When slot-4 memory uncorrectable ECC error happened, slot-5 (empty slot) in the same memory channel will also have error message with "Assertion: Memory Device Disabled".

Please refer to the following attachment:

 

ftp://ftp.software.ibm.com/systems/support/system_x_pdf/cogent_26446_x3200_m3_dsa_log_1.pdf

The error messages represent in Dynamic System Analysis (DSA) log System Overview:

  Memory device (replaceable memory devices, e.g. DIMM/SIMM) (Memory - DIMM 1): Assertion: Uncorrectable ECC / other uncorrectable memory error.

Memory device (replaceable memory devices, e.g. DIMM/SIMM) (Memory - DIMM 2): Assertion: Memory Device Disabled.

or

Memory device (replaceable memory devices, e.g. DIMM/SIMM) (Memory - DIMM 4): Assertion: Uncorrectable ECC / other uncorrectable memory error.

Memory device (replaceable memory devices, e.g. DIMM/SIMM) (Memory - DIMM 5): Assertion: Memory Device Disabled.

Document Location

Worldwide

Operating System

System x:Operating system independent / None

[{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU04FAH","label":"System x->System x3200 M3->7328"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"QU04IMI","label":"System x->System x3200 M3->7327"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU04INM","label":"System x->System x3250 M3->4251"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU04IOF","label":"System x->System x3250 M3->4252"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"QU04IPR","label":"System x->System x3250 M3->4261"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
30 January 2019

UID

ibm1MIGR-5090943