IBM Support

IJ28314: DECLUSTERED ARRAY STUCK IN-TRANSITION WITH LONG WAITERS.

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • In a distributed Spectrum Scale environment in
    the presence of repetitive node failures can result in
    the declustered array becoming stuck in-transition
    with long waiters. Long waiters may occur, and file
    system operations may become stalled.
    

Local fix

  • Restart the daemon
    

Problem summary

  • In a distributed Spectrum Scale environment in
    the presence of repetitive node failures can result in
    the declustered array becoming stuck in-transition
    with long waiters. Long waiters may occur, and file
    system operations may become stalled.
    

Problem conclusion

  • Benefits of the solution:
    No more deadlocks
    
    Work around:
    Restart the daemon
    
    Problem trigger:
    Occurs in Spectrum Scale RAID configurations involving
    multiple log groups, such as Spectrum Scale Erasure
    Code Edition or ESS 3000, that experience repetitive node
    shutdowns or network partitioning affecting a small subset
    of nodes in the cluster.
    
    Symptom:
    Hang/Deadlock/Unresponsiveness/Long Waiters
    
    Platforms affected:
    Linux Only
    
    Functional Area affected:  ESS/GNR
    
    Customer Impact: Suggested
    

Temporary fix

Comments

APAR Information

  • APAR number

    IJ28314

  • Reported component name

    SPEC SCALE STD

  • Reported component ID

    5737F33AP

  • Reported release

    505

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2020-09-28

  • Closed date

    2020-09-28

  • Last modified date

    2020-09-28

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    SPEC SCALE STD

  • Fixed component ID

    5737F33AP

Applicable component levels

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"STXKQY","label":"IBM Spectrum Scale"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"505","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
29 September 2020