IBM Support

IT31273: 'VSNAP POOL REMOVELOG' FAILS DUE TO TIMEOUT

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • When it is needed to remove log file system from a vSnap storage
    pool, e.g. due to a failing disk, the vsnap command could time
    out.
    The vSnap log will show:
    
    [time] INFO pid-7060 vsnap.cli    CLI process started: ['pool',
    'removelog', '--id', '1']
    [time] INFO pid-7060 vsnap.zfs    Updating pool id 1 to remove
    all log disks
    ...
    [time] ERROR pid-7060 vsnap.linux.system    Timed out (480
    seconds) waiting for command to complete: zpool remove vpool1
    mirror-5
    [time] INFO pid-7060 vsnap.linux.system    Collecting process
    stacks in /opt/vsnap/log/stacks_hungproc_7074_1575102856.txt
    
    
    IBM Spectrum Protect Plus Versions Affected:
    IBM Spectrum Protect Plus 10.1.x and above
    
    Initial Impact: Medium
    
    Additional Keywords: SPP, SPPlus, TS003076677
    

Local fix

  • Run the underlying zpool command instead of the vsnap command.
    Determine the name of the log device to be removed:
      sudo zpool status
    Remove the log device:
      sudo zpool remove vpool1 mirror-1
    

Problem summary

  • ****************************************************************
    * USERS AFFECTED:                                              *
    * IBM Spectrum Protect Plus level 10.1.5                       *
    ****************************************************************
    * PROBLEM DESCRIPTION:                                         *
    * See Error Description                                        *
    ****************************************************************
    * RECOMMENDATION:                                              *
    * Apply fixing level when available. This problem is currently *
    * projected to be fixed in IBM Spectrum Protect Plus level     *
    * 10.1.6. Note that this is subject to change at the           *
    * discretion of IBM.                                           *
    ****************************************************************
    

Problem conclusion

  • Safely detaching a log or cache device from a vSnap pool can
    take several minutes especially if the pool is undergoing other
    I/O activity at the time of the removal. The vSnap CLI command
    for detaching a log/cache device did not wait long enough. It
    timed out too quickly and reported a failure even though the
    underlying removal was still in progress. The problem has been
    resolved by eliminating the redundant timeout mechanism at the
    CLI layer. The CLI now waits as long as necessary as long as the
    underlying removal is still in progress.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IT31273

  • Reported component name

    SP PLUS

  • Reported component ID

    5737SPLUS

  • Reported release

    A15

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2020-01-06

  • Closed date

    2020-04-29

  • Last modified date

    2020-04-29

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    SP PLUS

  • Fixed component ID

    5737SPLUS

Applicable component levels

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSNQFQ","label":"IBM Spectrum Protect Plus"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"A15","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
30 January 2024