Abnormal termination of Cluster Manager daemon

The AIX® resource controller subsystem monitors the cluster manager daemon process. If the controller detects that Cluster Manager daemon has exited abnormally (without being shut down using the clstop command), it executes the /usr/es/sbin/cluster/utilities/clexit.rc script to halt the system. This prevents unpredictable behavior from corrupting the data on the shared disks.

See the clexit.rc man page for additional information.

The clexit.rc script creates an AIX error log entry. Here is an example showing the long output:

LABEL: OPMSG
IDENTIFIER:     AA8AB241

Date/Time:Fri Jan  7 10:44:46
Sequence Number: 626
Machine Id:      000001331000
Node Id:ppstest8
Class:  O
Type:   TEMP
Resource Name:   OPERATOR

Description
OPERATOR NOTIFICATION

User Causes
ERRLOGGER COMMAND

 Recommended Actions
 REVIEW DETAILED DATA

Detail Data
MESSAGE FROM ERRLOGGER COMMAND
clexit.rc : Unexpected termination of clstrmgrES

The clexit.rc error message in short form looks like this:

AA8AB241   0107104400 T O OPERATOROPERATOR NOTIFICATION
Important: Never use the kill -9 command on the clstrmgr daemon. Using the kill command causes the clstrmgr daemon to exit abnormally. This causes the System Resource Controller (SRC) facility to run the script /usr/es/sbin/cluster/utilities/clexit.rc , which halts the system immediately and causes the surviving nodes to initiate fallover.

You can modify the file /etc/cluster/hacmp.term to change the default action after an abnormal exit. The clexit.rc script checks for the presence of this file, and if you have made it executable, the instructions there will be followed instead of the automatic halt called by clexit.rc . Please read the caveats contained in the /etc/cluster/hacmp.term file, however, before making any modifications.