IBM Support

IT34391: RDQM resource probe fails during "stop" action which then prevents auto-restart of the queue manager

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • An active RDQM node rebooted unexpectedly (in the observed case,
    this was due to a kernel panic caused by an unrelated process),
    and encountered further resource issues issues on its restart.
    The queue manager does not automatically restart. Interrogation
    of the RDQM logs and diagnostics indicates that an attempt to
    resolve the mqm user against the operating system failed.
    
    The FFST associated with the monitor error was :
    
    | Probe Id          :- ZS818000
                                                  |
    | Application Name  :- MQM
                                                       |
    | Component         :- zslRDSwitchToMqm
                                          |
    | SCCS Info         :-
    /build/slot2/p910_P/src/lib/zs/amqzslra.c,             |
    | Line Number       :- 2901
                                                      |
    | Program Name      :- rdqm
                                                      |
    | Arguments         :- monitor
                                                   |
    | Major Errorcode   :- xecF_E_UNEXPECTED_SYSTEM_RC
                               |
    | Minor Errorcode   :- OK
                                                        |
    | Probe Type        :- MSGAMQ6119
                                                |
    | Probe Severity    :- 1
                                                         |
    | Probe Description :- AMQ6119S: An internal IBM MQ error has
    occurred        |
    |   (setgroups)
                                                                  |
    | FDCSequenceNumber :- 0
                                                         |
    | Arith2            :- 22 (0x16)
                                                 |
    | Comment1          :- setgroups
                                                 |
    | Comment2          :- Invalid argument
                                          |
    
    MQM Function Stack
    rdqm
    zslRDSwitchToMqm
    xcsFFST
    

Local fix

Problem summary

  • ****************************************************************
    USERS AFFECTED:
    All IBM MQ users using RDQM feature.
    
    
    Platforms affected:
    Linux on x86-64
    
    ****************************************************************
    PROBLEM DESCRIPTION:
    A RDQM node was unable to access the mqm group id on Active
    Directory after the system
    rebooted unexpectedly. This caused the RDQM resource probe to
    fail, which led to a failure
    in the resource "stop" action, causing the resources to get
    marked as "not re-startable"
    This then prevented the queue manager from starting on any node.
    

Problem conclusion

  • The RDQM resource agent has been modified to prevent errors on
    the "stop" action.
    
    ---------------------------------------------------------------
    The fix is targeted for delivery in the following PTFs:
    
    Version    Maintenance Level
    v9.1 LTS   9.1.0.9
    v9.2 LTS   9.2.0.3
    v9.x CD    9.2.1
    
    The latest available maintenance can be obtained from
    'WebSphere MQ Recommended Fixes'
    http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006037
    
    If the maintenance level is not yet available information on
    its planned availability can be found in 'WebSphere MQ
    Planned Maintenance Release Dates'
    http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006309
    ---------------------------------------------------------------
    

Temporary fix

Comments

APAR Information

  • APAR number

    IT34391

  • Reported component name

    IBM MQ BASE MP

  • Reported component ID

    5724H7271

  • Reported release

    910

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2020-09-29

  • Closed date

    2021-06-29

  • Last modified date

    2021-07-16

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    IBM MQ BASE MP

  • Fixed component ID

    5724H7271

Applicable component levels

[{"Line of Business":{"code":"LOB36","label":"IBM Automation"},"Business Unit":{"code":"BU053","label":"Cloud \u0026 Data Platform"},"Product":{"code":"SSYHRD","label":"IBM MQ"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"910"}]

Document Information

Modified date:
18 July 2021