Recovering from PowerHA SystemMirror script failure
Select this option from the Problem Determination Tools menu to recover from a PowerHA® SystemMirror® script failure.
About this task
For example, if script failure occurs because a filesystem mount failed, you can correct the problem, mount the filesystem manually, then use this option to complete the rest of the cluster event processing.
The Recover From PowerHA SystemMirror Script Failure menu option sends a signal to the Cluster Manager daemon (clstrmgrES ) on the specified node, causing it to proceed to the next step in the cluster event. If a subsequent event failure occurs, you must repeat the process of correcting the problem, then using Recover From PowerHA SystemMirror Script Failure option to continue to the next step. You must continue this process until the cluster state goes to "stable".
Make sure that you fix the problem that caused the script failure. You need to manually complete the remaining steps that followed the failure in the event script (see /var/hacmp/log/hacmp.out ). Then, to resume clustering, complete the following steps to bring the PowerHA SystemMirror event script state to EVENT COMPLETED:
Procedure
- Enter
smit hacmp - In SMIT, select Problem Determination Tools > Recover From PowerHA SystemMirror Script Failure.
- Select the IP label/address for the node on which you want to run the clruncmd command and press Enter. The system prompts you to confirm the recovery attempt. The IP label is listed in the /etc/hosts file and is the name assigned to the service IP address of the node on which the failure occurred.
- Press Enter to continue. Another SMIT panel appears to confirm the success of the script recovery.