IBM Support

IT17170: PROCESS HANGS WAITING FOR FILE LIBRARY INPUT VOLUME AFTER A STORAGE AGENT USES THE SAME VOLUME FOR INPUT.

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • When a a storage agent acquires a volume for input it is not
    maintaining the list of volume waiters so the request for the
    volume gets lost. This can only happen for file storage pools
    between input volumes for the storage agent, and the server data
    movement operations like migration, reclamation, MOVE DATA, and
    MOVE NODEDATA.
    .
    Since the volume waiter has been lost the QUERY PROCESS will
    show the process requesting the volume waiting indefinitely
    for the volume to be mounted. The the server data movement
    operation can not be canceled, even after all client sessions
    that are accessing the volume have ended or have been canceled.
    .
    The following scenario is an example of the series of events
    leading up to the hang of a migration process:
    .
    1. A file volume in a GPFS file system is in the filling state,
       the file storage pool is only 60% full, and there are
       plenty of private or scratch file volumes available for
       selection.
    .
    2. The volume is selected by a storage agent as an output
       volume because it is in the filling state, and is used to
       backup client data.
    .
    3. During this time a migration process is attempting to acquire
       the volume as an input volume.
    .
    4. While migration is attempting to acquire the volume a client
       restore session preempts the migration process, and the
       storage agent opens the volume as an input volume for the
       restore operation. The client restore session does not
       maintain the volume waiter list, so when a migration process
       is acquiring the volume for input the waiter request is lost.
    .
    5. After all session request for the volume have ended or have
       been canceled the migration process continues to wait for
       the volume indefinitely because there is no notification
       that the volume has been freed up.
    .
    Initial Impact:
    Medium
    .
    Tivoli Storage Manager Versions Affected:
    Applies to all current 6.3 and 7.1 server versions.
    .
    Additional Keywords:
    Hang hung wait session process Library STA TSM GPFS LANFree
    

Local fix

  • Halt and restart the Spectrum Protect server to clear the
    deadlock.
    

Problem summary

  • ****************************************************************
    * USERS AFFECTED:                                              *
    * All IBM Spectrum Protect server users.                       *
    ****************************************************************
    * PROBLEM DESCRIPTION:                                         *
    * See error description.                                       *
    ****************************************************************
    * RECOMMENDATION:                                              *
    * Apply fixing level when available.                           *
    * This problem is currently projected to be fixed in levels    *
    * 7.1.8 and 8.1.1.                                             *
    * Note that this is subject to change at the discretion of     *
    * IBM.                                                         *
    ****************************************************************
    *
    

Problem conclusion

  • This problem was fixed.
    Affected platforms:  AIX, HP-UX, Solaris, Linux, and Windows.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IT17170

  • Reported component name

    TSM SERVER

  • Reported component ID

    5698ISMSV

  • Reported release

    71L

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2016-09-22

  • Closed date

    2016-11-09

  • Last modified date

    2016-12-13

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    TSM SERVER

  • Fixed component ID

    5698ISMSV

Applicable component levels

  • R71A PSY

       UP

  • R71H PSY

       UP

  • R71L PSY

       UP

  • R71S PSY

       UP

  • R71W PSY

       UP

[{"Line of Business":{"code":"LOB26","label":"Storage"},"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSGSG7","label":"Tivoli Storage Manager"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"7.1.3"}]

Document Information

Modified date:
26 September 2021