IBM Support

CPU soft lockups reported in Puredata System for Transactions (PDTX) if a compute node has been up for over 208 days

Troubleshooting


Problem

CPU soft lockups reported in Puredata System for Transactions (PDTX) if a compute node has been up for over 208 days

Symptom

The following message appears in /var/log/messages:


Aug 8 01:57:18 node01 kernel: BUG: soft lockup - CPU#5 stuck for 4278190091s! [ca-server:1867]

Diagnosing The Problem

Use the uptime command to check how long the system has been up. If the uptime is over 208 days then you are likely hitting Redhat defect BZ#781974.


 18:27:42 up 209 days, 15:10,  9 users,  load average: 10.82, 10.15, 9.37

This is described in the following Redhat article.


https://access.redhat.com/solutions/68466

Resolving The Problem

This defect is fixed in kernel 2.6.32-220.4.2.el6 and later.

https://rhn.redhat.com/errata/RHBA-2012-0124.html

From a PDTX perspective, you may install kernel 2.6.32-220.13.1.el6 to resolve the issue.

[{"Product":{"code":"SSNVAT","label":"PureData System for Transactions"},"Business Unit":{"code":"BU059","label":"IBM Software w\/o TPS"},"Component":"T1500","Platform":[{"code":"PF016","label":"Linux"}],"Version":"1.0","Edition":"","Line of Business":{"code":"LOB10","label":"Data and AI"}}]

Document Information

Modified date:
16 June 2018

UID

swg21965359