Troubleshooting
Problem
[This abstract has been truncated due to length constraints] Light path diagnostic panel Light Emitting Diodes (LEDs) 'MIS' and 'MEM' are illuminated on Flex System x240 M5 Node. Light path diagnostic panel LEDs 'Check log LED' and 'System errorLED' are illuminated for NeXtScale and System x servers. If memory population rules are not followed, the server will log erroneous DIMM failed memory test error along with memory configuration errors: POST/UEFI Event [W.58007] - Unsupported DI MM population POST/UEFI Event [S.58008] - DIMM failed memory test POST/UEFI Event [W.50001] - DIMM disabled IMM Event [W.58007] - Invalid memory configuration (unsupported DIMM Population) detected. Please verify memory configuration is valid. IMM Event [S.58008] - A DIMM has failed the POST memory test. [*] where DIMM stands for Dual In-line Memory Module. [*] where MIS stands for mismatch occurred between the processor
Resolving The Problem
Source
RETAIN tip: H213780
Symptom
Light path diagnostic panel Light Emitting Diodes (LEDs) 'MIS' and 'MEM' are illuminated on Flex System x240 M5 Node.
Light path diagnostic panel LEDs 'Check log LED' and 'System error LED' are illuminated for NeXtScale and System x servers.
If memory population rules are not followed, the server will log erroneous DIMM failed memory test error along with memory configuration errors:
| POST/UEFI Event [W.58007] - Unsupported DIMM population POST/UEFI Event [S.58008] - DIMM failed memory test POST/UEFI Event [W.50001] - DIMM disabled IMM Event [W.58007] - Invalid memory configuration (unsupported DIMM Population) detected. Please verify memory configuration is valid. IMM Event [S.58008] - A DIMM has failed the POST memory test. |
[*] where DIMM stands for Dual In-line Memory Module.
[*] where MIS stands for mismatch occurred between the processors, DIMMs, or Hard Disk Drives (HDDs) within the configuration as reported by Power On Self-Test (POST).
[*] where MEM stands for a memory fault. The corresponding DIMM error LEDs on the system board will also be illuminated.
Affected configurations
The system may be any of the following Lenovo servers:
- Lenovo Flex System x240 M5 Compute Node, type 9532, any model,
any AC1
- Lenovo NeXtScale nx360 M5, type 5465, any model
- Lenovo NeXtScale nx360 M5, type 5467, any model
- Lenovo System x3500 M5, type 5464, any model
- Lenovo System x3550 M5, type 5463, any model
- Lenovo System x3650 M5, type 5462, any model
This tip is not software specific.
This tip is not option specific.
Solution
This behavior is corrected in 2Q2015 Unified Extensible Firmware Interface (UEFI) Flash Update as follows:
- NeXtScale nx360 M5, type 5465 Version 1.20 - Build D: THE108J
- Flex System x240 M5, type 9532 Version 1.10 - Build ID: C4E106J
- System x3500 M5, type 5464 Version 1.11 - Build ID: TAE106J
- System x3550 M5, type 5463 Version 1.10 - Build ID: TBE106K
- System x3650 M5, type 5462 Version 1.10 - Build ID: TCE106K
The erroneous DIMM failed memory test error tied to the event [S.58008] will no longer be logged when DIMM population rules are not followed.
The file will be available by selecting the appropriate Product Group, type of System, Product name, Product machine type, and Operating system on IBM Support's Fix Central web page, at the following URL:Â Â Â Â
Workaround
The DIMM memory scrub error can be safely ignored.
To resolve DIMM memory configuration error, follow memory population rules as documented in the individual server installation guidelines. Do not order new DIMMs for this error.
The supported DIMM population rules for each channel are as follows, where DIMM0 is the DIMM in the channel furthest away from the CPU:
|
DIMM0-DIMM1-DIMM2 SR - Single Rank DIMM (Registered Dual In-Line Memory Module) DR - Dual Rank DIMM (Registered Dual In-Line Memory Module) LR - LRDIMM (Load Reduced Dual-inline Memory Module) |
IMPORTANT: Mixing of RDIMM with LRDIMM in a system is not supported.
(where LRDIMM = Load Reduced Dual In-Line Memory Module)
Additional information
With the introduction of the Intel Xeon E5-2600 v3 processor architecture, Intel memory reference code (MRC) is enforcing DIMM population rules.
If a memory channel has an unsupported memory configuration, the DIMMs will be disabled in that channel and the system will continue to boot if additional memory is found in the other memory channels. The system will indicate a memory configuration error for the disabled channel, and the memory error LED will indicate the slot with the unsupported memory configuration.
DIMMs disabled due to memory configuration error should not be considered as bad and bereplaced with new parts. Instead, they should be reinserted into a supported memory configuration.
If no system memory can be found as a result of an unsupported memory configuration, the system will halt with a memory configuration error.
Document Location
Worldwide
Was this topic helpful?
Document Information
Modified date:
30 January 2019
UID
ibm1MIGR-5097081