High Availability (HA) considerations

You can set up high availability (HA) for your IBM Cloud Pak® for AIOps deployment by selecting a production (large) sizing profile.

IBM Cloud Pak for AIOps

HA configures your deployment with extra replication of its services, and affinity rules to provide redundancy. A HA configuration allows your deployment to tolerate upgrades and service interruptions to a portion of the infrastructure. Failures in the following domains are minimized:

  • individual pod failure: Red Hat OpenShift recovers individual failed pods.
  • cluster node failure (master or worker): Red Hat OpenShift recovers non-stateful deployments.

For a high availability deployment, you must set a value of large in your IBM Cloud Pak for AIOps Installation custom resource when you create your IBM Cloud Pak for AIOps instance. Sizing is configured in your IBM Cloud Pak for AIOps installation custom resource with the parameter Capacity (Red Hat® OpenShift® Container Platform UI) or spec.size (Red Hat OpenShift CLI), and can be set to small for starter deployments or large for production deployments.

Multi-zone HA

A technology preview of installing IBM Cloud Pak for AIOps on a multi-zone architecture is also available. For more information, see Installing IBM Cloud Pak for AIOps on a multi-zone architecture (multi-zone HA).

Multi-region cold standby disaster recovery

A multi-region cold standby disaster recovery solution for IBM Cloud Pak for AIOps is also available. For more information, see Multi-region cold standby disaster recovery

Infrastructure Automation

For more information about high availability (HA) considerations for your Infrastructure Automation deployment, see High Availability (HA) considerations.