IBM Support

IT32949: Queue manager terminates due to a SIGSEGV issued for amqrrmfa process with Probe Id XC130004

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • Queue manager ends due to a a SIGSEGV issued for amqrrmfa
    process with Probe ID XC130004.
    
    The problem happens when messages are pending for a cluster
    queue and
    one of the queue managers in the cluster which hosts the queue
    becomes
    unavailable.  At this point the amqrrmfa process attempts to
    reallocate the
    messages on the SCTQ to an alternative (available) queue
    manager.
    The error is triggered when one of the reallocated messages is
    found to have
    expired.
    
    The following failure data capture (FDC) record is generated:
    
    Probe Id         :- XC130004
    Component        :- xehExceptionHandler
    Build Level      :- p910-004-191122
    Program Name     :- amqrrmfa
    Arguments        :- -m ABCD -t2332800 -s2592000 -p2592000
    -g5184000 -c3600
    Process          :- 13107212
    Thread           :- 1   Repository
    Last ObjectName  :- SYSTEM.CLUSTER.TRANSMIT.QUEUE
    Major Errorcode  :- STOP
    Minor Errorcode  :- OK
    Probe Type       :- HALT6109
    Probe Description :- AMQ6109S: An internal IBM MQ error has
    occurred.
    Arith1           :- 11 (0xb)
    Comment1         :- SIGSEGV: invalid address permissions(0)
    
    MQM Function Stack
    amqrrfma_main
    rrmMain
    rrmRepository
    rrmProcessMsg
    rrmReallocMsgs
    MQGET
    zstMQGET
    zifMQGET
    zsqMQGET
    kpiMQGET
    kqiWaitForMessage
    apiGetMessage
    aqmGetMessage
    aqhGetMessage
    aqqReadMsg
    aqqReadBytes
    aqpReadData
    xcsFFST
    

Local fix

Problem summary

  • ****************************************************************
    USERS AFFECTED:
    Users of MQ clustering and message expiry.
    
    
    Platforms affected:
    MultiPlatform
    
    ****************************************************************
    PROBLEM DESCRIPTION:
    A queue manager (Qmgr B) in an MQ cluster became unavailable
    while another
    queue manager (Qmgr A) was waiting to send messages to a queue
    hosted by
    Qmgr B.
    The amqrrmfa process on Qmgr A attempted to reallocate the
    messages waiting
    on the SYSTEM.CLUSTER.TRANSMIT.QUEUE to another queue manager in
    the
    cluster, but a programming error in the queue manager code meant
    that the
    amqrrmfa process hit a memory access violation error when one of
    the reallocated
    messages was found to have expired.
    

Problem conclusion

  • The queue manager code has been fixed to prevent the access
    violation.
    
    ---------------------------------------------------------------
    The fix is targeted for delivery in the following PTFs:
    
    Version    Maintenance Level
    v9.0 LTS   9.0.0.11
    v9.1 LTS   9.1.0.7
    v9.2 LTS   9.2.0.2
    v9.x CD    9.2.1
    
    The latest available maintenance can be obtained from
    'WebSphere MQ Recommended Fixes'
    http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006037
    
    If the maintenance level is not yet available information on
    its planned availability can be found in 'WebSphere MQ
    Planned Maintenance Release Dates'
    http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006309
    ---------------------------------------------------------------
    

Temporary fix

Comments

APAR Information

  • APAR number

    IT32949

  • Reported component name

    IBM MQ BASE MP

  • Reported component ID

    5724H7271

  • Reported release

    910

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    YesHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2020-05-21

  • Closed date

    2020-11-11

  • Last modified date

    2020-11-19

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    IBM MQ BASE MP

  • Fixed component ID

    5724H7271

Applicable component levels

[{"Line of Business":{"code":"LOB36","label":"IBM Automation"},"Business Unit":{"code":"BU053","label":"Cloud & Data Platform"},"Product":{"code":"SSYHRD","label":"IBM MQ"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"910"}]

Document Information

Modified date:
20 November 2020