IBM Support

IT10373: IBM MQ V8 QUEUE MANAGER HANGS AND GENERATES AN FDC WITH PROBE ID XC307100

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • IBM MQ version 8 queue manager hangs and generates a Failure
    Data Capture (FDC) file which shows:
    Probe Id :- XC307100
    Component :- xlsRequestMutex
    Program Name :- amqzlaa0
    Major Errorcode :- xecL_W_LONG_LOCK_WAIT
    Probe Description :- AMQ6150: WebSphere MQ semaphore is busy
    There is no corresponding FDC showing xlsReleaseMutex.
    .
    The lack of the xlsReleaseMutex FDC indicates a hang.
    
    FDC Summary:
    
    One or more FDCs with xecL_W_LONG_LOCK_WAIT error code and
    likely an FDC from amqzmuc0 process will be generated.
    
    2015/11/02 21:58:01.574580+7158 amqzmuc0 14942374 25 XC307100
    xlsRequestMutex xecL_W_LONG_LOCK_WAIT
    2015/11/02 21:58:01.575609+7158 amqzlaa0 11862512 922 XC307100
    xlsRequestMutex xecL_W_LONG_LOCK_WAIT
    2015/11/02 21:58:01.579309+7158 amqzlaa0 9830708 540 XC307100
    xlsRequestMutex xecL_W_LONG_LOCK_WAIT
    
     Probe Id :- XC307100
     Component :- xlsRequestMutex
     Program Name :- amqzmuc0
     Thread :- 25 AsyncQActivity
     Major Errorcode :- xecL_W_LONG_LOCK_WAIT
     Probe Type :- MSGAMQ6150
     Probe Description :- AMQ6150: WebSphere MQ semaphore is busy.
    
    MQM Function Stack
    zmuThreadMain
    zmuAsyncQActivityTask
    kpiAsyncQActivityTask
    kpiCheckDeleteCLXmitqs
    kqiTryDeleteShadowHdl
    xcsDecrementHashCellUseCount
    xcsDecrementQuickCellUseCount
    xlsRequestMutex
    xcsFFST
    
    Probe Id :- XC307100
    Component :- xlsRequestMutex
    Thread :- 922 SharedAgent
    Last ObjectName :- SYSTEM.CLUSTER.TRANSMIT.QUEUE
    Major Errorcode :- xecL_W_LONG_LOCK_WAIT
    Probe Type :- MSGAMQ6150
    Probe Description :- AMQ6150: WebSphere MQ semaphore is busy.
    
    MQM Function Stack
    zlaMainThread
    zlaProcessMessage
    zlaProcessMQIRequest
    zlaMQPUT
    zsqMQPUT
    kpiMQPUT
    kqiPutIt
    kqiFastnetBindXmitQ
    apiLookupMemoryObject
    xlsRequestMutex
    xcsFFST
    
     Probe Id :- XC307100
    Component :- xlsRequestMutex
    Thread :- 540 SharedAgent
    Last ObjectName :- SYSTEM.CLUSTER.TRANSMIT.QUEUE
    Major Errorcode :- xecL_W_LONG_LOCK_WAIT
    Probe Type :- MSGAMQ6150
    Probe Description :- AMQ6150: WebSphere MQ semaphore is busy.
    
    MQM Function Stack
    zlaMainThread
    zlaProcessMessage
    zlaProcessMQIRequest
    zlaMQPUT
    zsqMQPUT
    kpiMQPUT
    kqiPutIt
    kqiFastnetSetResolvedQ
    kqiFastnetSetXmitQ
    xcsAllocateHashCell
    xcsAllocateQuickCell
    xlsRequestMutex
    xcsFFST
    

Local fix

Problem summary

  • ****************************************************************
    USERS AFFECTED:
    This issue affects users of MQ version 8 clustering
    
    
    Platforms affected:
    MultiPlatform
    
    ****************************************************************
    PROBLEM DESCRIPTION:
    A timing condition existed whereby a deadlock state could be
    entered during the cleanup of CLUSSDR channels. It was possible
    for a thread to obtain an inconsistent set of locks and prevent
    other threads from completing, resulting in the FDC with probe
    ID XC307100 and a queue manager hang.
    

Problem conclusion

  • The MQ clustering code has been updated to correctly follow the
    appropriate locking model when cleaning up CLUSSDR channels, in
    order to prevent this timing condition.
    
    ---------------------------------------------------------------
    The fix is targeted for delivery in the following PTFs:
    
    Version    Maintenance Level
    v8.0       8.0.0.4
    
    The latest available maintenance can be obtained from
    'WebSphere MQ Recommended Fixes'
    http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006037
    
    If the maintenance level is not yet available information on
    its planned availability can be found in 'WebSphere MQ
    Planned Maintenance Release Dates'
    http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006309
    ---------------------------------------------------------------
    

Temporary fix

Comments

APAR Information

  • APAR number

    IT10373

  • Reported component name

    WMQ BASE MULTIP

  • Reported component ID

    5724H7251

  • Reported release

    800

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2015-07-28

  • Closed date

    2015-08-18

  • Last modified date

    2015-11-04

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    WMQ BASE MULTIP

  • Fixed component ID

    5724H7251

Applicable component levels

  • R800 PSY

       UP

[{"Business Unit":{"code":"BU053","label":"Cloud & Data Platform"},"Product":{"code":"SSYHRD","label":"IBM MQ"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"8.0.0.0","Edition":"","Line of Business":{"code":"LOB36","label":"IBM Automation"}}]

Document Information

Modified date:
04 November 2015