IBM Support

Memory over temp logged after asserting processor IERR - IBM BladeCenter and System x

Troubleshooting


Problem

When a processor fails with an Internal ERRor (IERR) and becomes disabled, the user may see an over-temperature event reported on the memory bank for the processor. The Integrated Management Module (IMM) event log will show the following messages: - "Processor 1" has failed with IERR - "Processor 1" has been disabled - An Over-Temperature Condition has been detected on the "Memory Device 1 in Memory Module 1" on subsystem "System Memory" - An Over-Temperature Condition has beendetected on the "Memory Device 8 in Memory Module 1" on subsystem "System Memory" - "Mem Card1 Bank" on subsystem "System Memory" throttled - An Over-Temperature Condition has been detected on the "Mem Card1 Bank" on subsystem "System Memory"

Resolving The Problem

Source

RETAIN tip: H204036

Symptom

When a processor fails with an Internal ERRor (IERR) and becomes disabled, the user may see an over-temperature event reported on the memory bank for the processor.

The Integrated Management Module (IMM) event log will show the following messages:

  "Processor 1" has failed with IERR
" Processor 1" has been disabled

An Over-Temperature Condition has been detected on the "Memory Device 1 in Memory Module 1" on subsystem "System Memory"

An Over-Temperature Condition has been detected on the "Memory Device 8 in Memory Module 1" on subsystem "System Memory"
"Mem Card1 Bank" on subsystem "System Memory" throttled
An Over-Temperature Condition has been detected on the "Mem Card1 Bank" on subsystem "System Memory"

Affected configurations

The system may be any of the following IBM servers:

  • BladeCenter HX5, type 1909, any model
  • BladeCenter HX5, type 7872, any model
  • BladeCenter HX5, type 7873, any model
  • System x3690 X5, type 7147, any model
  • System x3690 X5, type 7148, any model
  • System x3690 X5, type 7149, any model
  • System x3690 X5, type 7192, any model
  • System x3850 X5, type 7143, any model
  • System x3850 X5, type 7145, any model
  • System x3850 X5, type 7146, any model
  • System x3850 X5, type 7191, any model
  • System x3950 X5, type 7143, any model
  • System x3950 X5, type 7145, any model

This tip is not software specific.

This tip is not option specific.

Solution

This behavior is corrected in IMM firmware Version 1.32 (Build ID: YUOOD4G) and later.

The file is available by selecting the appropriate Product Group, Product name, Product machine type, and Operating system on IBM Support's Fix Central web page, at the following URL:

Additional information

The event log messages for the over-temperature events are incorrect and may be safely ignored.

Document Location

Worldwide

Operating System

BladeCenter:Operating system independent / None

System x:Operating system independent / None

[{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU04SRF","label":"System x->System x3850 X5->7146"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HW233","label":"BladeCenter HX5"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"LOB57","label":"Power"}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU04SRO","label":"System x->System x3850 X5->7145"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU04SZB","label":"System x->System x3950 X5->7145"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"SUNSET","label":"PRODUCT REMOVED"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU04WDX","label":"System x->System x3690 X5->7149"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU04WDY","label":"System x->System x3690 X5->7148"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU90ABO","label":"System x->System x3850 X5->7191"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU90ABQ","label":"System x->System x3690 X5->7147"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU90ABX","label":"System x->System x3850 X5->7143"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU90ACM","label":"System x->System x3690 X5->7192"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"SUNSET","label":"PRODUCT REMOVED"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU90ADT","label":"System x->System x3950 X5->7143"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
30 January 2019

UID

ibm1MIGR-5088921