IBM Support

IJ44678: AFM: RESYNC TRIGGERED AFTER REPLICATION FAILURE WITH REMOTE ERR

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • AFM:  resync triggered after replication failure with
    remote error 2
    
    Error Description:
    A replication operation fails with remote error 2, which
    triggers a resync.
    
    You might see below in mmfs.log:
    
    2022-12-05_20:26:15.116+1100: [E] AFM: Link file system
    fs1 fileset fileset1 file IDs
    [792665531.801069107.-1.-1,N] name DNAHD827_R1.fastq.gz
    remote error 2
    2022-12-05_20:26:15.119+1100: [E] AFM: File system fs1
    fileset fileset1 encountered an error synchronizing with
    the remote cluster. Cannot synchronize with the remote
    cluster until AFM recovery is executed.
    2022-12-05_20:26:15.119+1100: [I] Calling user exit
    script mmAfmQueueDropped: event afmQueueDropped, Async
    command /usr/lpp/mmfs/bin/mmsysmonc, filesystem fs1,
    fileset fileset1.
    2022-12-05_20:26:15.125+1100: [I] Calling user exit
    script mmAfmRecoveryStart: event afmRecoveryStart, Async
    command /usr/lpp/mmfs/bin/mmsysmonc, filesystem fs1,
    fileset fileset1.
    2022-12-05_20:34:44.053+1100: mmafmctl: [I] Performing
    resync of fileset: fileset1
    2022-12-06_06:54:42.378+1100: [I] Calling user exit
    script mmAfmRecoveryEnd: event afmRecoveryEnd, Async
    command /usr/lpp/mmfs/bin/mmsysmonc, filesystem fs1,
    fileset fileset1.
    
    Reported in:
    Spectrum Scale 5.1.4.1 on RHEL7
    
    Known Impact:
    Performance impact during the long run AFM resync.
    
    Verification steps:
    
    Recovery action:
    N/A
    

Local fix

  • N/A
    

Problem summary

  • Remote error 2 while replicating Link operation if
    parent directory is deleted before replicating create/link
    operation.
    

Problem conclusion

  • This problem is fixed in 5.1.6.1
    To see all Spectrum Scale APARs and their respective
    Fix solutions refer to page:
    https://public.dhe.ibm.com/storage/spectrumscale/spectrum_scale_
    apars.html
    
    Benefits of the solution:
    Drop write operation in case of using Fast Create if file is
    already deleted.
    
    Work around:
    None
    
    Problem trigger:
    Create/Link/Parent dir remove operation in queue with Fast
    Create
    config option enabled.
    
    Symptom:
    AFM Queue drop and Fileset goes to resync state.
    
    Platforms affected:
    All Linux OS environments
    
    Functional Area affected:
     AFM
    
    Customer Impact:
    High Importance
    

Temporary fix

Comments

APAR Information

  • APAR number

    IJ44678

  • Reported component name

    SPEC SCALE ADV

  • Reported component ID

    5737F35AP

  • Reported release

    514

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2022-12-20

  • Closed date

    2023-01-17

  • Last modified date

    2023-01-17

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    SPEC SCALE ADV

  • Fixed component ID

    5737F35AP

Applicable component levels

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"STXKQY"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"514","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
17 January 2023