IBM Support

IJ21396: NMI WATCHDOG: BUG: SOFT LOCKUP - CPU STUCK [NFSD]

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • On RHEL 7 nodes (pre-Linux kernel v3.18), in the GPFS kernel NFS
    support environment, GPFS may try to acquire some mutex, while
    holding an inode environment, GPFS may try to acquire
    some mutex, while holding an inode spin lock, which may be
    detected as a soft lockup issue by the kernel NMI watchdog.
    

Local fix

Problem summary

  • On RHEL 7 nodes (pre-Linux kernel v3.18), in the GPFS kernel
    NFS support environment, GPFS may try to acquire
    some mutex, while holding an inode spin lock, which may be
    detected as a soft lockup issue by the kernel NMI watchdog.
    

Problem conclusion

  • Benefits of the solution:
    
    Avoid CPU stuck and performance impacts
    
    Work around:
    
    None
    
    Problem trigger:
    
    GPFS breaks a spin lock holding policy in NFS support
    environment
    
    Symptom:
    
    Performance Impact/CPU stuck
    
    Platforms affected:
    
    All RHEL 7.x
    
    Functional Area affected:
    
    Users of KNFS/CNFS only
    
    Customer Impact:
    
    High Importance
    

Temporary fix

Comments

APAR Information

  • APAR number

    IJ21396

  • Reported component name

    SPEC SCALE STD

  • Reported component ID

    5737F33AP

  • Reported release

    504

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2019-12-04

  • Closed date

    2020-02-03

  • Last modified date

    2020-02-03

  • APAR is sysrouted FROM one or more of the following:

    IJ21127

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    SPEC SCALE STD

  • Fixed component ID

    5737F33AP

Applicable component levels

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"STXKQY","label":"IBM Spectrum Scale"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"504","Edition":"","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
03 February 2020