IBM Support

IC78904: WLM DAEMON, LAST USED DAEMON, AND FAST WRITERS GET NODE FAILURE ERRORS EVEN AFTER NODE IS BROUGHT BACK ONLINE

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • The wlm daemon, last used daemon, and fast writers are not
    recovering from node failures properly and continue to get NODE
    FAILURE errors even after the node failure is resolved.
    
    This is a typical message that would be seen in the db2diag.log
    even after the node failure has been resolved.
    
    2011-08-16-08.20.00.960332-240 I10265006A522        LEVEL:
    Severe
    PID     : 516680               TID  : 39198         PROC :
    db2sysc 0
    INSTANCE: dbinst1          NODE : 000            DB   : TESTDB
    APPHDL  : 0-113               APPID: *N0.DB2.110812125911
    AUTHID  : dbinst1
    EDUID   : 39198               EDUNAME: db2agntp 0
    FUNCTION: DB2 UDB, WLM, sqlrwWlmRPCRouter, probe:20
    CALLED  : DB2 UDB, WLM, sqlrwWlmRPCRouter
    RETCODE : ZRC=0x81580016=-2124939242=SQLKD_NODE_FAILURE
              "Mapping for SQLKF_NODE_FAILED"
    

Local fix

  • Deactivate the database and terminate all connections to the
    database. In other words, restart the database.
    

Problem summary

  • ****************************************************************
    * USERS AFFECTED:                                              *
    * DPF environments                                             *
    ****************************************************************
    * PROBLEM DESCRIPTION:                                         *
    * See above.                                                   *
    ****************************************************************
    * RECOMMENDATION:                                              *
    * Upgrade to v9.7 fp6 or higher.                               *
    ****************************************************************
    

Problem conclusion

  • After applying the fix for this APAR, background system
    applications (wlm daemon, last used daemon, fast writers) will
    recover from node failure properly so that they don't continue
    to report node failed errors after the node failure has been
    resolved. The fix was first included in v9.7 fp6.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IC78904

  • Reported component name

    DB2 FOR LUW

  • Reported component ID

    DB2FORLUW

  • Reported release

    970

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt

  • Submitted date

    2011-09-24

  • Closed date

    2012-06-04

  • Last modified date

    2012-06-04

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    DB2 FOR LUW

  • Fixed component ID

    DB2FORLUW

Applicable component levels

  • R970 PSN

       UP

[{"Business Unit":{"code":"BU048","label":"IBM Software"},"Product":{"code":"SSEPGG","label":"DB2 for Linux, UNIX and Windows"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"9.7","Edition":"","Line of Business":{"code":"LOB10","label":"Data and AI"}}]

Document Information

Modified date:
04 June 2012