IBM Support

IJ30878: AFM GATEWAY CRASH DURING THE READ

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • If the read on the file is in progress , next read on the
    same file will bypass read if the gap is more than 64MB.
    This is causing the problem if the original read is
    stuck in the kernel and  if its gets killed due
    to afmSyncOpWaitTimeout , bypass read causes kernel
    crash as the original read frees up the allocated mount
    structure.  readbypass is undocumented config option
    and is not persistent across daemon restarts. so when
    user restarts
    daemon during the efix installation, this change
    will be lost and user does not need to set to back to
    default value.
    

Local fix

  • work around to avoid crash:  mmfsadm afm  readbypass -1
    

Problem summary

  • AFM gateway node crashes if the home is not
    responding and multiple threads are trying to
    read the same file.
    

Problem conclusion

  • Benefits of the solution:
    AFM gateway node no longer crashes
    
    Work around:
    Use undocumented config option
    "mmfsadm afm readbypass -1" at the gateway node.
    
    Problem trigger:
    Reading the same file from multiple threads
    when the home is not responding
    
    Symptom: Crash
    
    Platforms affected: All Linux OS environments
    
    Functional Area affected: AFM
    
    Customer Impact: HiPER
    

Temporary fix

Comments

APAR Information

  • APAR number

    IJ30878

  • Reported component name

    SPEC SCALE STD

  • Reported component ID

    5737F33AP

  • Reported release

    505

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2021-02-12

  • Closed date

    2021-02-23

  • Last modified date

    2021-02-23

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

    IJ31044

Fix information

  • Fixed component name

    SPEC SCALE STD

  • Fixed component ID

    5737F33AP

Applicable component levels

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"STXKQY","label":"IBM Spectrum Scale"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"505","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
26 February 2021