IBM Support

webMethods Knowlegebase : Cluster failure (1753069)

Troubleshooting


Problem

A Nirvana cluster sporadically fails. Realms started to delay inter realm communications and auto generated thread dumps show threads either in runnable, waiting or blocked state when the cluster went unstable. Clients were all getting disconnected.

Realms started to delay inter realms communications:

Delaying inter realm communications to server

Realms then started to disconnect clients because cluster failed:

Disconnecting client \ due to cluster failure and client registered interest

Then some realms went offline

Cluster> Changing state from nSlaveState to OfflineState

Then cluster was able to be formed:

Cluster> Found existing Master in cluster as server, setting local state to that of cluster

and failed again after trying to revover from master as stated by a thread dump threads were waiting:

"Scheduler Worker Pool:9" daemon prio=5 tid=0xd0 waiting on com.pcbsys.foundation.yb@608ce0b2 WAITING

at java.lang.Object.wait () (Native Method)

at java.lang.Object.wait (Object.java:503)

at com.pcbsys.foundation.xh.c ()

at com.pcbsys.foundation.xh.run ()

at com.pcbsys.foundation.uh.k ()

at com.pcbsys.foundation.ci.run ()

"Scheduler Worker Pool:8" daemon prio=5 tid=0xcf waiting on com.pcbsys.foundation.yb@608ce0b2 WAITING

at java.lang.Object.wait () (Native Method)

at java.lang.Object.wait (Object.java:503)

at com.pcbsys.foundation.xh.c ()

at com.pcbsys.foundation.xh.run ()

at com.pcbsys.foundation.uh.k ()

at com.pcbsys.foundation.ci.run ()

"Scheduler Worker Pool:7" daemon prio=5 tid=0xce waiting on com.pcbsys.foundation.yb@608ce0b2 WAITING

at java.lang.Object.wait () (Native Method)

at java.lang.Object.wait (Object.java:503)

at com.pcbsys.foundation.xh.c ()

at com.pcbsys.foundation.xh.run ()

at com.pcbsys.foundation.uh.k ()

at com.pcbsys.foundation.ci.run ()

Document Location

Worldwide

[{"Line of Business":{"code":"LOB77","label":"Automation Platform"},"Business Unit":{"code":"BU048","label":"IBM Software"},"Product":{"code":"SSVYEV","label":"IBM webMethods Integration"},"ARM Category":[{"code":"a8mKe00000000AQIAY","label":"Software AG Universal Messaging (NUM)"}],"ARM Case Number":"","Platform":[{"code":"PF025","label":"Red Hat Enterprise Linux"}],"Version":"6.1"},{"Line of Business":{"code":"LOB77","label":"Automation Platform"},"Business Unit":{"code":"BU048","label":"IBM Software"},"Product":{"code":"SSFIWYE","label":"IBM webMethods B2B"},"ARM Category":[{"code":"a8mKe00000000AQIAY","label":"Software AG Universal Messaging (NUM)"}],"ARM Case Number":"","Platform":[{"code":"PF025","label":"Red Hat Enterprise Linux"}],"Version":"6.1"}]

Log InLog in to view more of this document

This document has the abstract of a technical article that is available to authorized users once you have logged on. Please use Log in button above to access the full document. After log in, if you do not have the right authorization for this document, there will be instructions on what to do next.

Document Information

Modified date:
20 March 2025

UID

ibm17210531