Running the system recovery

You can use the service assistant to recover the system.

Before you begin

Ensure that all steps in Preparing the system for recovery are completed and that all nodes are online and in candidate state.
Note: Each individual stage of the recovery procedure can take significant time to complete, depending on the specific configuration.

Procedure

  1. Point your browser to the service IP address of one of the nodes.
    If you do not know the IP address or if it was not configured, configure the service address in the following way:
  2. Point your browser to the service IP address of one of the node canisters.
  3. Log on to the service assistant. For more information, see Accessing the service assistant.
  4. Follow the online instructions to select a node to perform this action and then complete the recovery system procedure.
    1. Click Prepare for Recovery.
      The system searches for the most recent backup file and scans quorum disk. If this step is successful, Preparation Status: Prepare complete is displayed on the bottom of the page.
    2. Verify the date and time of the last quorum time. The time stamp must be less than 30 minutes before the failure. The time stamp format is YYYYMMDD hh:mm, where YYYY is the year, MM is the month, DD is the day, hh is the hour, and mm is the minute.
      Attention: If the time stamp is not less than 30 minutes before the failure, call the support center.
    3. Verify the date and time of the last backup date. The time stamp must be less than 24 hours before the failure. The time stamp format is YYYYMMDD hh:mm, where YYYY is the year, MM is the month, DD is the day, hh is the hour, and mm is the minute.
      Attention: If the time stamp is not less than 24 hours before the failure, call the support center.

      Changes that are made after the time of this backup date might not be restored.

    4. If the quorum time and backup date are correct, click Recover to recreate the system.
  5. Select Recover System from the navigation.

Results

Any one of the following categories of messages might be displayed:
  • T3 successful
    The volumes are back online. Use the final checks to get your environment operational again.
  • T3 recovery completed with errors
    T3 recovery that is completed with errors: One or more of the volumes are offline because fast write data was in the cache. To bring the volumes online, see Recovering from offline volumes by using the CLI for details.
  • T3 failed
    Call the support center. Do not attempt any further action.

Now follow the actions described in What to check after running the system recovery.