IBM Support

IT38305: Server can crash if 'protect stgpool type=local' process is cancelled before worker processes are created.

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • IBM Spectrum Protect server can crash if 'protect stgpool'
    process is cancelled.
    The problem can happen if the 'cancel process' happens during
    the deletion phase where it is only seen that the
    
    Summary process (or processes) are created but the worker
    processes have not been created yet.
    
    'query process' will display if the summary and worker processes
     are running for 'protect stgpool' process.
    
    For example;
    
    query process
    
         107     Protect Stgpool                         Protecting
    storage pool CONTAINER on  server Server1 to COPY_TAPE. Protect
    process   phase: DELETING.
                  (Summary)
    Extents protected: 0 of 0.   Extents failed to protect: 0.
    Amount protected:  0 bytes of 0 bytes. Amount failed to protect:
    0 bytes.
    
    This shows a 'protect stgpool' summary process running in
    deleting phase. No worker process is listed.
    
    
    
    When seen on an AIX server, using dbx, there be a stack similar
    to:
    
    pthread_kill(??, ??) at 0x900000000513884
    _p_raise(??) at 0x9000000005130c4
    raise.raise(??) at 0x90000000003f6e8
    abort() at 0x90000000005e0b8
    PsAbortServer(??) at 0x10001a690
    pkAbort(??) at 0x10001028c
    pkAcquireMutexTracked(??, ??, ??) at 0x100009e60
    SdProtQueryProcess(??, ??, ??, ??, ??) at 0x100c169cc
    procQueryProcess(??, ??, ??, ??, ??, ??) at 0x1004c5070
    AdmQueryProcess(??) at 0x101246d5c
    AdmCommandLocal(??, ??, ??, ??, ??) at 0x10089b7d4
    admCommand(??, ??, ??, ??, ??) at 0x1008985cc
    SmAdminCommandThread(??) at 0x1008c4cf0
    StartThread(0x0) at 0x1000114ac
    
    
    Actlog will show a similar sequence to this before server crash
    occurs:
    
    PROTECT STGPOOL_NAME type=local
    [...]
    ANR0984I Process XX for PROTECT STGPOOL (SUMMARY) started in the
    FOREGROUND
    [...]
    CANCEL PROCESS XX  (SESSION: 10560)
    
    
     | MDVREGR 8.1.11.0-TIV_5698MSV |
    
    
    
    
    
    
    
    
    
    
    IBM Spectrum Protect Versions Affected: Server 8.1.11.000 and
    above on all platforms.
    
    
    
    
    
    
    
    
    Additional Keywords: TSM TS005699408 194097  crash abend core
    protect stgpool cancel
    

Local fix

  • Do not run 'cancel process' or wait until 'query process' shows
    worker processes, before running 'cancel process'.
    

Problem summary

  • ****************************************************************
    * USERS AFFECTED:                                              *
    * All IBM Spectrum Protect server users.                       *
    ****************************************************************
    * PROBLEM DESCRIPTION:                                         *
    * See error description.                                       *
    ****************************************************************
    * RECOMMENDATION:                                              *
    * Apply fixing level when available. This problem is currently *
    * projected to be fixed in levels 8.1.12.200 and 8.1.13. Note  *
    * that this is subject to change at the discretion of IBM.     *
    ****************************************************************
    

Problem conclusion

  • This problem was fixed.
    Affected platforms for reported release:  AIX, Linux, and
    Windows.
    Platforms fixed:  AIX, Linux, and Windows.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IT38305

  • Reported component name

    TSM SERVER

  • Reported component ID

    5698ISMSV

  • Reported release

    81A

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2021-09-09

  • Closed date

    2021-12-06

  • Last modified date

    2021-12-06

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    TSM SERVER

  • Fixed component ID

    5698ISMSV

Applicable component levels

  • R81A PSY

       UP

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSGSG7","label":"Tivoli Storage Manager"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"81A","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
27 April 2022