IBM Support

IJ54979: TWICE THE FILE SIZE AMOUNT OF DATA IS SENT BETWEEN CLUSTERS

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • With afmFastCreate enabled, if the Create that tries to push the
    initial chunk of data fails to complete and gets requeued, then
    the requeued Create is replaying all data when it retries.And
    later there are a couple of Write messages that starting from
    offset where Create initially went inflight that is also played.
    Totaling to almost twice the amount of data of the file size to
    be replicated.
    

Local fix

  • Set a higher value of afmAsyncDelay to push replication as far
    as the file is being written.
    

Problem summary

  • With afmFastCreate enabled, if the Create that tries to push the
    initial chunk of data fails to complete and gets requeued, then
    the requeued Create is replaying all data when it retries.And
    later there are a couple of Write messages that starting from
    offset where Create initially went inflight that is also played.
    Totaling to almost twice the amount of data of the file size to
    be replicated.
    

Problem conclusion

  • This problem is fixed in 5.1.9.10
    To see all Spectrum Scale APARs and their respective
    Fix solutions refer to page: 
    https://public.dhe.ibm.com/storage/spectrumscale/spectrum_scale
    _apars.html
    
    Benefits of the solution:
    Check if the file is open anywhere (in line with Object), for
    afmFastCreate - Create messages. If file is open - don't 
    replicate until closed. This also helps to gain a bit of
    performance with afmFastCreate.
    
    Work around:
    Set a higher value of afmAsyncDelay to push replication as far
    as the file is being written.
    
    Problem trigger:
    afmFastCreate replication failing initially because of lock or
    network error and later replication being tried again.
    
    Symptom:
    Unexpected Behaviour
    
    Platforms affected:
    All Linux OS Environments (AFM Gateway nodes)
    
    Functional Area affected:
    AFM
    
    Customer Impact:
    High Importance
    

Temporary fix

Comments

APAR Information

  • APAR number

    IJ54979

  • Reported component name

    SPEC SCALE STD

  • Reported component ID

    5737F33AP

  • Reported release

    519

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2025-06-05

  • Closed date

    2025-06-10

  • Last modified date

    2025-06-10

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    SPEC SCALE STD

  • Fixed component ID

    5737F33AP

Applicable component levels

[{"Business Unit":{"code":"BU048","label":"IBM Software"},"Product":{"code":"STXKQY"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"519","Line of Business":{"code":"LOB69","label":"Storage TPS"}}]

Document Information

Modified date:
10 June 2025