IBM Support

IT29570: REPLICATION NOT CLEANING UP ORPHAN SNAPSHOTS CAUSING THE VSNAP TO NOT FREE SPACE

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • IBM Spectrum Protect Plus does not clean up orphaned
    snapshot entries causing the vSnap to run out space.
      The following message is seen in the joblog:
    
        INFO pid-6065 vsnap.zfs  Checking for orphaned snapshots
        WARNING pid-6065 vsnap.zfs  Snapshot 187 found on disk but
          not in DB, deleting snapshot
    
      Several hours later the same message is still being presented
      indicating the orphan was not deleted. The vsnap-maint process
      gets into a hung state. This can occur on a busy system when
      large snapshots can take hours to delete.
    
    IBM Spectrum Protect Plus Versions Affected:
      10.1.2 and 10.1.3
    Customer/L2 Diagnostics (If Applicable)
      N/A
    
    Initial Impact:
      Medium
    Additional Keywords:
      TS002345154 SPP RECLAIM HUNG STUCK FULL OUT OF SPACE
    

Local fix

  •   On the vSnap where the snapshots are not being deleted run
      the following commands:
    
       sudo sqlite3 /etc/vsnap/maint.db "delete from maint_session;"
       sudo systemctl restart vsnap-maint
    
      These commands force the queue of pending deletions to be
      purged and re-created from scratch. After running these
      commands, wait for a few minutes and run the command:
    
        vsnap maint show
    
      This allows you to monitor the queue and see if it shows any
      active/completed deletions.
    

Problem summary

  • ****************************************************************
    * USERS AFFECTED:                                              *
    * IBM Spectrum Protect Plus level 10.1.3 and 10.1.4            *
    ****************************************************************
    * PROBLEM DESCRIPTION:                                         *
    * See ERROR DESCRIPTION                                        *
    ****************************************************************
    * RECOMMENDATION:                                              *
    * Apply fixing level when available. This problem is currently *
    * projected to be fixed in IBM Spectrum Protect Plus level     *
    * 10.1.4.179 and 10.1.5. Note that this is subject to change   *
    * at the discretion of IBM.                                    *
    ****************************************************************
    

Problem conclusion

  • The vSnap maintenance service that is responsible for cleaning
    up older snapshots had incorrect logic that caused the service
    to get stuck if a particular snapshot took longer than 6 hours
    to clean up. This resulted in all subsequent snapshot deletions
    to remain stuck in the queue. The problem has been addressed by
    fixing the incorrect logic to ensure the service does not get
    stuck.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IT29570

  • Reported component name

    SP PLUS

  • Reported component ID

    5737SPLUS

  • Reported release

    A13

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2019-06-27

  • Closed date

    2019-08-15

  • Last modified date

    2019-08-15

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    SP PLUS

  • Fixed component ID

    5737SPLUS

Applicable component levels

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSNQFQ","label":"IBM Spectrum Protect Plus"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"A13","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
30 January 2024