IBM Support

IC71584: V6.X SERVER MAY CRASH DURING BA/API CLIENT BACKUP OR ARCHIVE OP-ERATIONS IF THE SESSION IS BEING RETRIED DUE TO PREVIOUS FAILURE

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • A Tivoli Storage Manager 6.x Server might crash during
    BA/API Client backup, or archive, operations due to the client
    sending invalid protocol information during a session retry
    attempt.
    
    1.) This can occur if the Backup/Archive Client has the
    DEDUPLICATION option set to YES (client deduplication) and a
    retryable communication has been encountered.
    
    2.) This can occur if the API Client encounters a retryable
    communication independent if client deduplication is being used
    or not.
    
    The server crashes since the client session did not establish a
    valid store environment on the server due to the protocol
    violation.
    
    
    Customer/L2 Diagnostics:
    Call stack of a crash dump from a Tivoli Storage Manager
    6.2.1.1 Windows server crashing due to a B/A Client using
    client deduplication:
    adsmdll bfCreate 0xa3
    adsmdll!CreateBitfile+0x431
    adsmdll!DoBackInsNormEnhanced+0x3fa
    adsmdll!SmNodeSession+0x21df
    adsmdll!HandleNodeSession+0x1602
    adsmdll!smExecuteSession+0x1cdf
    adsmdll!SessionThread+0x405
    adsmdll!startThread+0xa5
    msvcr90!_callthreadstartex+0x17
    msvcr90!_threadstartex+0x84
    kernel32!BaseThreadStart+0x3a
    
    
    Call stack of a crash dump from a Tivoli Storage Manager
    6.2.1.1 Linux server crashing due to an API Client without
    client deduplication:
    #0  bfCreate (sessHandle=0x0, txnId=0x220765f8, bfId=1507417381,
    estSize=2505072, estBitfileSize=2505072,
        ck1=212, ck2=2, poolName=0x4811e620 "FILE.POOL",
    mountWaitMode=bfWaitMount,
        sourceFunc=0xaba570 <SmRecvNextData>, contextP=0x1e9894e8,
    aggregateState=admAggregate,
        forceAggregation=False) at bfcreate.c:1011
    #1  0x0000000000a426fd in CreateBitfile (sessP=0x1e9894e8,
    bfHandle=0x0, txnId=0x220765f8, bfId=...,
        ck1=212, ck2=2, poolName=0x4811e620 "FILE.POOL",
    estSize=..., estBitfileSize=..., mountWaitMode=2,
        objType=1 '\001') at smnode.c:25902
    #2  0x0000000000a68cba in SmNodeSession (sessP=0x1e9894e8,
    logSummaryP=0x4811f27c) at smnode.c:15454
    #3  0x0000000000a23dd4 in HandleNodeSession (sessP=0x1e9894e8,
    nodeInfoP=0x4811fb20, bIsSchedSess=False)
        at smexec.c:4860
    #4  0x0000000000a26661 in DoNodeGeneral (infoP=<value optimized
    out>, beginFunc=<value optimized out>,
        sendFunc=<value optimized out>, recvFunc=<value optimized
    out>, flushFunc=<value optimized out>,
        abortFunc=<value optimized out>, qmethodFunc=0xcc73e0
    <tcpQryMethod>,
        qaddressFunc=0xcca240 <tcpQueryAddress>, authFunc=0,
    isNetworkMethod=True,
        qIsNodeThreadFunc=0xcc73c0 <tcpIsNodeThread>) at
    smexec.c:4901
    #5  smExecuteSession (infoP=<value optimized out>,
    beginFunc=<value optimized out>,
        sendFunc=<value optimized out>, recvFunc=<value optimized
    out>, flushFunc=<value optimized out>,
        abortFunc=<value optimized out>, qmethodFunc=0xcc73e0
    <tcpQryMethod>,
        qaddressFunc=0xcca240 <tcpQueryAddress>, authFunc=0,
    isNetworkMethod=True,
        qIsNodeThreadFunc=0xcc73c0 <tcpIsNodeThread>) at
    smexec.c:3124
    #6  0x0000000000cc7d1b in psSessionThread (argP=<value optimized
    out>) at tcpcomm.c:2504
    #7  0x0000000000cb914b in StartThread
    (startInfoP=0x2aaab022e578) at pkthread.c:3357
    #8  0x0000003e0ac0673d in start_thread () from
    /lib64/libpthread.so.0
    #9  0x0000003e0a0d3d1d in clone () from /lib64/libc.so.6
    
    
    NOTE:
    The key function in the callstack for verifying this APAR
    condition is bfCreate.
    
    
    Tivoli Storage Manager Versions Affected:
    Tivoli Storage Manager V6.1 and 6.2 servers
    
    
    Initial Impact:
    Medium
    
    
    Additional Keywords:
    TSM zz62 txn abort crash crashing dedup deduplication retry
    retryable protocol violation ANR9999D IC71720
    

Local fix

Problem summary

  • ****************************************************************
    * USERS AFFECTED: All Tivoli Storage Manager server users.     *
    ****************************************************************
    * PROBLEM DESCRIPTION: See ERROR DESCRIPTION.                  *
    ****************************************************************
    * RECOMMENDATION: Apply fixing level when available. This      *
    *                 problem is currently projected to be fixed   *
    *                 in level 6.2.2.                              *
    *                 Note that this is subject to change at the   *
    *                 discretion of IBM.                           *
    ****************************************************************
    *
    

Problem conclusion

  • The described problem has been resolved.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IC71584

  • Reported component name

    TSM SERVER

  • Reported component ID

    5698ISMSV

  • Reported release

    62W

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt

  • Submitted date

    2010-10-01

  • Closed date

    2010-11-01

  • Last modified date

    2010-11-01

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    TSM SERVER

  • Fixed component ID

    5698ISMSV

Applicable component levels

  • R62A PSY

       UP

  • R62H PSY

       UP

  • R62L PSY

       UP

  • R62S PSY

       UP

  • R62W PSY

       UP

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSGSG7","label":"Tivoli Storage Manager"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"62W","Edition":"","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
01 November 2010