Troubleshooting
Problem
CPU soft lockups reported in Puredata System for Transactions (PDTX) if a compute node has been up for over 208 days
Symptom
The following message appears in /var/log/messages:
Aug 8 01:57:18 node01 kernel: BUG: soft lockup - CPU#5 stuck for 4278190091s! [ca-server:1867]
Diagnosing The Problem
Use the uptime command to check how long the system has been up. If the uptime is over 208 days then you are likely hitting Redhat defect BZ#781974.
18:27:42 up 209 days, 15:10, 9 users, load average: 10.82, 10.15, 9.37
This is described in the following Redhat article.
https://access.redhat.com/solutions/68466
Resolving The Problem
This defect is fixed in kernel 2.6.32-220.4.2.el6 and later.
https://rhn.redhat.com/errata/RHBA-2012-0124.html
From a PDTX perspective, you may install kernel 2.6.32-220.13.1.el6 to resolve the issue.
[{"Product":{"code":"SSNVAT","label":"PureData System for Transactions"},"Business Unit":{"code":"BU059","label":"IBM Software w\/o TPS"},"Component":"T1500","Platform":[{"code":"PF016","label":"Linux"}],"Version":"1.0","Edition":"","Line of Business":{"code":"LOB10","label":"Data and AI"}}]
Was this topic helpful?
Document Information
Modified date:
16 June 2018
UID
swg21965359