Complete data center loss

One of the data centers is experiencing a total outage.

Symptoms

One data center is experiencing a total outage and cannot be used to upload or download any files by partners. The other data center can be used to upload and download files, but no files can be processed.

Causes

A complete data center outage can be caused by instances such as power outages, connectivity failures, server crashes, and component node disruptions.

Environment

Windows, UNIX, or Linux.

Diagnosing the problem

  1. Check IBM® Control Center for a red circle representing one of the data centers within the Environmental health widget.
  2. Look for error messages indicating that the following failures have occurred: Cassandra, ZooKeeper, WebSphere® MQ, storage, and Global Mailbox Admin node.

Resolving the problem

No files can be processed in the failed data center, so traffic must be rerouted to the surviving data center until the problem is corrected. Refer to the following topics for information about how to recover from a complete data center loss with minimal impact on your business operations.