IBM Support

IC85767: SERVER CAN CRASH AFTER PKACQUIREMUTEXTRACKED FUNCTION IS CALLED

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • Server crashed after acquiring a mutex a second time.
    
    
    DBX call stack from the core (this example is from AIX)  will
    show similar to;
    
    
    pth_signal.pthread_kill(??, ??) at 0x9000000004d17d0
    pth_signal._p_raise(??) at 0x9000000004d1048
    raise.raise(??) at 0x90000000002bd2c
    abort.abort() at 0x900000000088604
    psthread.PsAbortServer(??) at 0x100021a80
    pkthread.pkAbort(??) at 0x10001b9e4
    pkmon.TrapSyncError(??) at 0x10000783c
    pkmon.pkAcquireMutexTracked(??, ??, ??) at 0x100007ad0
    smutil.smGetSessIdForBFHandle(??, ??) at 0x1001f95fc
    asshow.ShowVol(??, ??, ??) at 0x1009076b4
    asshow.asVolReportContext(??) at 0x1009085c8
    output.OutDiagToCons(??, ??, ??, ??, ??) at 0x10000e400
    outvarg.outDiagfExt(0x11a6a34c0, 0x44222284a028d350,
    0x9000000004db2f8,
    0x1010c4f6c, 0x11476c498, 0x0, 0x0, 0x0) at 0x1000094b8
    smtrans.SmSendData(??, ??, ??, ??) at 0x1003a905c
    sstrans.RtrvFramed(??, ??, ??, ??, ??, ??, ??, ??) at
    0x1005299d0
    sstrans.ssRtrv(0x1006b2088, 0x113418b38, 0x11a6a3c00, 0x83f84ef,
    0x2b1311b, 0x11a6a3bb8, 0x8, 0x9000000004e2d14) at 0x100528468
    afrtrv.AfRtrv(??, ??, ??, ??, ??, ??, ??, ??) at 0x100701668
    bfrtrv.RtrvOne(??, ??, ??, ??, ??, ??, ??, ??) at 0x10072a638
    bfrtrv.bfRtrv(??, ??, ??, ??, ??, ??, ??, ??) at 0x100727224
    smnqr.SmRetrieveBitfile(0x49, 0x53907, 0x83f84ef, 0x83f84ef,
    0x2b1311b,
    0x117d2f, 0x200000002) at 0x1007ff89c
    smnode.SmDoObjRtrv(??, ??, ??, ??, ??, ??) at 0x1003f6e14
    smnode.SmNodeSession(??, ??) at 0x1003e1bdc
    smexec.HandleNodeSession(??, ??, ??) at 0x10039deb0
    smexec.DoNodeGeneral(??, ??) at 0x1003988f0
    smexec.smExecuteSession(??, ??, ??, ??, ??, ??, ??, ??) at
    0x100394db0
    tcpcomm.psSessionThread(??) at 0x10009d528
    pkthread.StartThread(??) at 0x10001c5a4
    
    dsmserv.err will show messages similar to;
    
    07/25/2012 09:43:03  ANR9999D Mutex acquire deadlock at
    smutil.c.1214:
    mutex for sminit.c.598 thread 457792 (tid 3840) - mutex is owned
    by
    thread 457792 (tid 3840).
    07/25/2012 09:43:05  ANR9999D Mutex acquisition failure,
    errno=45 from
    smutil.c.1214; thread 457792 (tid 3840).
    
    
    
    
    Platforms affected:
    Tivoli Storage Manager Servers 6.1, 6.2, 6.3 Unix/Linux/Windows
    
    
    Initial Impact: Medium
    
    Additional Keywords:
    ZZ62 ZZ61 ZZ63  82737 crash abend core mutex
    

Local fix

Problem summary

  • ****************************************************************
    * USERS AFFECTED: All Tivoli Storage Manager server users.     *
    ****************************************************************
    * PROBLEM DESCRIPTION: See ERROR DESCRIPTION.                  *
    ****************************************************************
    * RECOMMENDATION: Apply fixing level when available. This      *
    *                 problem is currently projected to be fixed   *
    *                 in levels 6.2.5, and 6.3.2. Note      *
    *                 that this is subject to change at the        *
    *                 discretion of IBM.                           *
    ****************************************************************
    *
    

Problem conclusion

  • This problem was fixed.
    Affected platforms:  AIX, HP-UX, Solaris, Linux, and Windows.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IC85767

  • Reported component name

    TSM SERVER

  • Reported component ID

    5698ISMSV

  • Reported release

    62A

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt

  • Submitted date

    2012-08-07

  • Closed date

    2012-08-10

  • Last modified date

    2013-08-20

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    TSM SERVER

  • Fixed component ID

    5698ISMSV

Applicable component levels

  • R62A PSY

       UP

  • R62H PSY

       UP

  • R62L PSY

       UP

  • R62S PSY

       UP

  • R62W PSY

       UP

  • R63A PSY

       UP

  • R63H PSY

       UP

  • R63L PSY

       UP

  • R63S PSY

       UP

  • R63W PSY

       UP

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSGSG7","label":"Tivoli Storage Manager"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"62A","Edition":"","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
20 August 2013