Troubleshooting
Problem
In single or multi-node xSeries 440 and xSeries 445 models, a user might see the following messages reported in the Remote Supervisor Adapter (RSA) event log: SMI Hdlr Information Processor MachineCheck Data a Chassis=-- reporting bank:0000 reporting APICID:0060 Status:8C000109 20140136 CR4:00000000 SMI Hdlr Information Processor MachineCheck Data b Chassis=-- Address:0000000F FFF6D4C0 Timestamp:00035785 CD35AFB
Resolving The Problem
Source
RETAIN tip: H191645
Issue
In single or multi-node xSeries 440 and xSeries 445 models, a user might see the following messages reported in the Remote Supervisor Adapter (RSA) event log:
| SMI Hdlr Information Processor MachineCheck Data a Chassis=-- reporting bank:0000 reporting APICID:0060 Status:8C000109 20140136 CR4:00000000 SMI Hdlr Information Processor MachineCheck Data b Chassis=-- Address:0000000F FFF6D4C0 Timestamp:00035785 CD35AFB |
Affected configurations
The system may be any of the following IBM servers:
- xSeries 440, type 8687, any model
- xSeries 445, type 8870, any model
This tip is not option specific.
This tip is not software specific.
Additional information
The identified CPU is functioning as designed.
The CPU is identified by the APIC ID number in the "data a" message. The APIC ID list below identifies the CPU affected.
APIC ID 00xx, where xx = the following:
|
Primary or Single node:
Secondary node:
|
CPUs 1-4 are located on the bottom SMP board or SMP Expansion Module, CPUs 5-8 are located on the top SMP board or SMP Expansion Module.
The CPU numbers represent the physical CPU slot. DP processor equipped servers will have CPUs installed in slots 1 and 4 only on both bottom and top SMP boards.
The messages point to a processor that experienced a correctable CPU machine check (the SMI handler records the event as "Information"). This occurs when the CPU detects an invalid instruction or address and is able to correct the problem without causing the CPU to hang or reset the server.
Often, these messages can be ignored. However, if the messages occur multiple times and frequently, it may indicate a potential issue with the CPU and the CPU should be replaced.
SMI messages showing an "Error" indicate a failure with the CPU and require replacement of the part.
Document Location
Worldwide
Was this topic helpful?
Document Information
Modified date:
29 January 2019
UID
ibm1MIGR-5077883