APAR status
Closed as program error.
Error description
Hard lockup between 2 pemsmod kernel threads can panic the kernel. A kernel panic will mean system down time and maybe quorum loss for the customer. Stack trace at vmcore-dmesg.txt will have something like this: [88432.803601] CPU: 27 PID: 14563 Comm: pemsRollUpQueue Kdump: loaded Tainted: G
Local fix
Problem summary
Hard lockup between 2 pemsmod kernel threads can panic the kernel. A kernel panic will mean system down time and maybe quorum loss for the customer. Stack trace at vmcore-dmesg.txt will have something like this: [88432.803601] CPU: 27 PID: 14563 Comm: pemsRollUpQueue Kdump: loaded Tainted: G
Problem conclusion
This problem is fixed in 5.0.5 PTF 10 Benefits of the solution: Avoid system down and quorum loss. To see all Spectrum Scale APARs and their respective fix solutions refer to page https://public.dhe.ibm.com/storage/spectrumscale/spectrum_scale_ apars.html Work around: None Problem trigger: System running heavy I/O workload can hit this issue. Symptom: Kernel crash Platforms affected: x86_64-linux only Functional Area affected: ESS/GNR Customer Impact: Critical
Temporary fix
Comments
APAR Information
APAR number
IJ34813
Reported component name
SPEC SCALE STD
Reported component ID
5737F33AP
Reported release
505
Status
CLOSED PER
PE
NoPE
HIPER
NoHIPER
Special Attention
NoSpecatt / Xsystem
Submitted date
2021-09-07
Closed date
2021-09-07
Last modified date
2021-09-07
APAR is sysrouted FROM one or more of the following:
APAR is sysrouted TO one or more of the following:
Fix information
Fixed component name
SPEC SCALE STD
Fixed component ID
5737F33AP
Applicable component levels
[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"STXKQY"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"505","Line of Business":{"code":"LOB26","label":"Storage"}}]
Document Information
Modified date:
08 September 2021