Recovering from cluster job failures

Failure of a cluster resource services job is usually indicative of some other problem.

You should look for the job log associated with the failed job and look for messages that describe why it failed. Correct any error situations.

You can use the Change Cluster Recovery (CHGCLURCY) command to restart a cluster resource group job that was ended without having to end and restart clustering on a node.

  1. CHGCLURCY CLUSTER(EXAMPLE)CRG(CRG1)NODE(NODE1)ACTION(*STRCRGJOB) This command will cause cluster resource group job, CRG1, on node NODE1 to be submitted. To start the cluster resource group job on NODE1 requires clustering to be active on NODE1.
  2. Restart clustering on the node.

If you are using a IBM® Business Partner cluster management product, refer to the documentation that came with the product.