IBM Support

Processor machine check and how to identify CPU - IBM Servers

Troubleshooting


Problem

In single or multi-node xSeries 440 and xSeries 445 models, a user might see the following messages reported in the Remote Supervisor Adapter (RSA) event log: SMI Hdlr Information Processor MachineCheck Data a Chassis=-- reporting bank:0000 reporting APICID:0060 Status:8C000109 20140136 CR4:00000000 SMI Hdlr Information Processor MachineCheck Data b Chassis=-- Address:0000000F FFF6D4C0 Timestamp:00035785 CD35AFB

Resolving The Problem

Source

RETAIN tip: H191645

Issue

In single or multi-node xSeries 440 and xSeries 445 models, a user might see the following messages reported in the Remote Supervisor Adapter (RSA) event log:

  SMI Hdlr Information Processor MachineCheck Data a Chassis=-- reporting bank:0000 reporting APICID:0060 Status:8C000109 20140136 CR4:00000000

SMI Hdlr Information Processor MachineCheck Data b Chassis=-- Address:0000000F FFF6D4C0 Timestamp:00035785 CD35AFB

Affected configurations

The system may be any of the following IBM servers:

  • xSeries 440, type 8687, any model
  • xSeries 445, type 8870, any model

This tip is not option specific.

This tip is not software specific.

Additional information

The identified CPU is functioning as designed.

The CPU is identified by the APIC ID number in the "data a" message. The APIC ID list below identifies the CPU affected.

APIC ID 00xx, where xx = the following:

 

Primary or Single node:

  00 CPU-1 Physical
01 CPU-1 Hyperthread

10 CPU-2 Physical
11 CPU-2 Hyperthread

02 CPU-3 Physical
03 CPU-3 Hyperthread

12 CPU-4 Physical
13 CPU-4 Hyperthread

20 CPU-5 Physical
21 CPU-5 Hyperthread

30 CPU-6 Physical
31 CPU-6 Hyperthread

22 CPU-7 Physical
23 CPU-7 Hyperthread

32 CPU-8 Physical
33 CPU-8 Hyperthread

Secondary node:

  40 CPU-1 Physical
41 CPU-1 Hyperthread

50 CPU-2 Physical
51 CPU-2 Hyperthread

42 CPU-3 Physical
43 CPU-3 Hyperthread

52 CPU-4 Physical
53 CPU-4 Hyperthread

60 CPU-5 Physical
61 CPU-5 Hyperthread

70 CPU-6 Physical
71 CPU-6 Hyperthread

62 CPU-7 Physical
63 CPU-7 Hyperthread

72 CPU-8 Physical
73 CPU-8 Hyperthread

CPUs 1-4 are located on the bottom SMP board or SMP Expansion Module, CPUs 5-8 are located on the top SMP board or SMP Expansion Module.

The CPU numbers represent the physical CPU slot. DP processor equipped servers will have CPUs installed in slots 1 and 4 only on both bottom and top SMP boards.

The messages point to a processor that experienced a correctable CPU machine check (the SMI handler records the event as "Information"). This occurs when the CPU detects an invalid instruction or address and is able to correct the problem without causing the CPU to hang or reset the server.

Often, these messages can be ignored. However, if the messages occur multiple times and frequently, it may indicate a potential issue with the CPU and the CPU should be replaced.

SMI messages showing an "Error" indicate a failure with the CPU and require replacement of the part.

Document Location

Worldwide

Operating System

Older System x:Operating system independent / None

[{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HW191","label":"Older System x->xSeries 440"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HW19U","label":"Older System x->xSeries 445"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
29 January 2019

UID

ibm1MIGR-5077883