Replacing a failed appliance in a DR/HA configuration

If you lose one group of HA appliances in a DR/HA configuration, you can replace the appliances and restore the disaster recovery configuration by following this procedure.

About this task

Note: These instructions apply to disaster recovery configurations comprising two pairs of HA appliances, for instructions for an HA configuration with a single DR appliance, see Replacing failed high availability nodes in a disaster recovery configuration. For instructions for a DR configuration consisting of two single appliances, see Replacing a failed node in a disaster recovery configuration.
If a disaster occurs such that the HA appliances at the main site are beyond repair, you fail your queue managers over to the recovery HA pair. You then replace the HA appliances at the main site and restore the disaster recovery configuration.

If you lose the HA appliances at the recovery site, you run a command on one of the main appliances to recover the crtdrsecondary command that you used to create the secondary instances of the queue manager. You can then rerun the crtdrsecondary on the new recovery appliances. Repeat these instructions for each queue manager in your configuration.

Procedure

  • Following the loss of the primary queue manager at the main site, take the following steps:
    1. On one of the recovery appliances, run the following command:
      makedrprimary -m QMname
      
      Where QMname is the name of the queue manager.
    2. Replace the two appliances at the main site, and configure the HA group to look like the original.
    3. On the recovery appliance, run the following command to retrieve the crtdrsecondary command:
      dspdrsecondary -m QMname
      
      Where QMname is the name of the queue manager.
    4. On one of the replacement appliances run the crtdrsecondary command that was output in step 3. For example:
      crtdrsecondary -m terentia1 -sx -s 1024 -l Hugin,Mugin -i 10.1.20.1,10.1.20.2 -p 2015 -t a -v 10.2.20.1,10.2.20.2
      Synchronization of data from the recovery HA pair to the replacement HA pair begins. Use the status command to check that synchronization has completed before you proceed.
    5. On the recovery appliance currently running the queue manager, run the following command:
      makedrsecondary -m QMName
      
    6. On one of the replacement appliances, make the queue manager the primary queue manager:
      makedrprimary -m QMName
      
  • Following the loss of the appliances at the recovery site, take the following steps:
    1. Replace the recovery appliances, and configure the HA group to look like the original.
    2. On the main appliance, run the following command to retrieve the crtdrsecondary command:
      dspdrsecondary -m QMname
      
      Where QMname is the name of the queue manager.
    3. On the recovery appliance run the crtdrsecondary command that was output in step 2. For example:
      crtdrsecondary -m terentia1 -sx -s 1024 -l Hugin,Mugin -i 10.1.20.1,10.1.20.2 -p 2015 -t a -v 10.2.20.1,10.2.20.2
      Synchronization of data from the main HA pair to the recovery HA pair begins. Use the status command to check that synchronization has completed before you proceed.