IBM Support

Voltage error and CPU over temperature causes system shutdown - IBM System x3755 M3 (type 7164)

Troubleshooting


Problem

The IBM System x3755 M3, Type 7164 might encounter intermittently multiple events of voltage fault or Central Processing Unit (CPU) high temperature with a specific value equal to the raw value '0xFF' in an Intelligent Platform Management Interface (IPMI) Event, such as (See Symptom section for a large sample event log).

Resolving The Problem

Source

RETAIN tip: H207632

Symptom

The IBM System x3755 M3, Type 7164 might encounter intermittently multiple events of voltage fault or Central Processing Unit (CPU) high temperature with a specific value equal to the raw value '0xFF' in an Intelligent Platform Management Interface (IPMI) Event, such as:

  - 1.2 Volt
# 1 - 10/28/2012 04:07:38
System board 1 (Voltage - ): Assertion: Upper Critical - going high.
Trigger threshold value: 1.26 Volts.
Trigger reading: 1.99 Volts.

# 2 - 10/28/2012 04:07:40
System board 1 (Voltage - ): Assertion: Upper Non-recoverable - going high.
Trigger threshold value: 1.32 Volts.
Trigger reading: 1.99 Volts.

# 3 - 10/28/2012 04:07:47
Power unit / power domain 3 (Power Unit - HOST_POWER): Assertion: Power Off / Power Down.

# Note: 1.99 Volt is the raw value 0xFF of 1.2 Volt

- 3.3 Volt
# 1 - 10/12/2012 23:12:52
System board 1 (Voltage - ): Assertion: Upper Critical - going high.
Trigger threshold value: 3.46 Volts.
Trigger reading: 4.39 Volts.

# 2 - 10/12/2012 23:12:53
System board 1 (Voltage - ): Assertion: Upper Non-recoverable - going high.
Trigger threshold value: 3.63 Volts.
Trigger reading: 4.39 Volts.

# 3 - 10/12/2012 23:13:01
Power unit / power domain 3 (Power Unit - HOST_POWER): Assertion: Power Off / Power Down.

# Note: 4.39 Volt is the raw value 0xFF of 1.2 Volt

- CPU 4 Temp
# 1 - 10/13/2012 01:43:20
Processor 4 (Temperature - n): Assertion: Upper Non-critical - going high.
Trigger threshold value: 75.00 degrees C.
Trigger reading: 255.00 degrees C.

# 2 - 10/13/2012 01:43:22
Processor 4 (Temperature - n): Assertion: Upper Critical - going high.
Trigger threshold value: 80.00 degrees C.
Trigger reading: 255.00 degrees C

# 3 - 10/13/2012 01:43:24
Processor 4 (Temperature - n): Assertion: Upper Non-recoverable - going high.
Trigger threshold value: 85.00 degrees C.
Trigger reading: 255.00 degrees C.

# 4 - 10/13/2012 01:43:30
Power unit / power domain 3 (Power Unit - HOST_POWER): Assertion: Power Off / Power Down.

# Note: 255.00C is the raw value 0xFF of CPU temperature.

Affected Configurations

The system can be any of the following IBM servers:

  • System x3755 M3, type 7164, any model

This tip is not software specific.

This tip is not option specific.

The following system firmware levels are affected:

  • iBMC v1.02 Build ID: AYOO30J, and earlier versions

Solution

This behavior was corrected in Integrated Baseboard Management Controller (IBMC) Version 1.07, AYOO32Q, and later versions.

The file is available by selecting the appropriate Product Group, type of System, Product name, Product machine type, and Operating system on IBM Support's Fix Central web page, at the following URL:

  http://www.ibm.com/support/fixcentral/

Additional Information

The affected sensors are:

Sensor ID Sensor Description Value of 0xFF
0x00 P1_TMP 255 C
0x01 P2_TMP 255 C
0x02 P3_TMP 255 C
0x03 P4_TMP 255 C
0x32 VDD_12V_1 16.07 Volts
0x75 VDD_12V_2 16.07 Volts
0x76 VDD_12V_3 16.07 Volts
0x77 VDD_12V_4 16.07 Volts
0x78 VDD_12V_5 16.07 Volts
0x79 VDD_12V_6 16.07 Volts
0x7A VDD_12V_7 16.07 Volts
0x33 VDD_VLDT_P1 1.600 Volts
0x34 VDD_VLDT_P2 1.600 Volts
0x35 VDD_VLDT_P3 1.600 Volts
0x36 VDD_VLDT_P4 1.600 Volts
0x37 VDD_P1_VDDA 3.333 Volts
0x38 VDD_P2_VDDA 3.333 Volts
0x39 VDD_P3_VDDA 3.333 Volts
0x3A VDD_P4_VDDA 3.333 Volts
0x3E VDD_1.1_RUN 1.99 Volts
0x3D VDD_1.2_RUN 1.99 Volts
0X3C VDD_1.8_RUN 2.39 Volts
0x3B VDD_3.3_RUN 6.63 Volts
0x3F VDD_3.3_STBY 4.40 Volts
0x40 VDD_5_RUN 6.63 Volts
0x41 RTC_BATTERY 3.97 Volts

Document Location

Worldwide

Operating System

System x:Operating system independent / None

[{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"QU90ABE","label":"System x->System x3755 M3->7164"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
30 January 2019

UID

ibm1MIGR-5092550