IBM Support

IT44595: REPLICATION CAN CRASH THE SERVER WHEN NO NODES ARE CONFIGURED FOR REPLICATION

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • When replication is run by IBM Storage Protect server, without
    any nodes configured, the end status for the process would be
    the following:
    ......................              completed. Files current: 0.
    Files replicated: 0 of 0.
                Files updated: 0 of 0. Files deleted: 0 of 0. Amount
    
                replicated: 0 bytes of 0 bytes. Amount transferred:
    0
                bytes. Elapsed time: 0 Days, 0 Hours, 2 Minutes.
                (SESSION: ###, PROCESS: #, JOB: ##)
    
    
    When replication process does not find eligible for replication
    file spaces due to node and/or file spaces misconfiguration the
    replication process has nothing to process.
    
    In this case, there is a timing window where a possability
    exists for the main replication thread to exit and clean up
    shared objects before the child threads are finished. When the
    orphaned child threads attempt to access shared objects created
    by the main thread the server may crash.
    
    
    
    
    Support/Customer diagnostics
    
    ANR1934W and ANR2777I messages are reported the activity log.
    ANR1934W REPLICATE NODE: Node node name is disabled.
    ANR2777I REPLICATE NODE: Node node name is decommissioned and
    disabled.
    
    The following call stack, obtained from the core dump file will
    show similar to:
    
    pthread_kill(??, ??) at 0x9000000006aa2f8
    _p_raise(??) at 0x9000000006a9b84
    raise.raise(??) at 0x90000000022a628
    abort() at 0x900000000252ca0
    PsAbortServer(??) at 0x10001aa8c
    pkAbort(??) at 0x1000108b4
    pkAcquireMutexTracked(??, ??, ??) at 0x10000a4e4
    smReplFreeSession(??, ??, ??, ??) at 0x100c34804
    NrFreeSession(??, ??, ??, ??) at 0x100c0e748
    ReplTcrSessionThread(??) at 0x100cd81f4
    StartThread(0x0) at 0x100011a6cLocal Fix:
    
    
    Additional Keywords: Crash, Replication, Node configuration
    TS013571321 TSM spectrum protect
    Versions Affected: Storage Protect Server 8.1.x on all supported
    platforms.
    

Local fix

  • Configure some of the nodes for replication correctly, so that
    the process has something to process,  and does not exit.
    Or do not run replication.
    

Problem summary

  • ****************************************************************
    * USERS AFFECTED:                                              *
    * All IBM Spectrum Protect Server users                        *
    ****************************************************************
    * PROBLEM DESCRIPTION:                                         *
    * See error description                                        *
    ****************************************************************
    * RECOMMENDATION:                                              *
    * Apply fixing level when available.                           *
    * The problem currently projected to be fixed in level 8.1.21. *
    * Note: this is subject to change at the discretion of IBM     *
    ****************************************************************
    

Problem conclusion

  • This problem was fixed
    

Temporary fix

Comments

APAR Information

  • APAR number

    IT44595

  • Reported component name

    TSM SERVER

  • Reported component ID

    5698ISMSV

  • Reported release

    81A

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2023-09-25

  • Closed date

    2024-04-21

  • Last modified date

    2024-04-21

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    TSM SERVER

  • Fixed component ID

    5698ISMSV

Applicable component levels

[{"Business Unit":{"code":"BU029","label":"Software"},"Product":{"code":"SSGSG7","label":"Tivoli Storage Manager"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"81A"}]

Document Information

Modified date:
22 April 2024