Library down recovery in an unexpected power outage

About this task

The following procedure describes the general steps for recovery when the IBM Storage Archive Enterprise Edition experiences unexpected power outages while in operation.
Important: Depending on the circumstances, an unplanned power outage can damage the library and other facilities. This document describes a general recovery procedure if and only when the library is not critically damaged. For example, the library was accidentally powered off from the library GUI. If you have any concerns about critical damage to the library, drives, or tape cartridges, consult with IBM® before you proceed with the recovery procedure.

Procedure

  1. Stop the cluster that is associated to the defunct library.
    # eeadm cluster stop -l <library>

    If the cluster stop does not work, try the procedure again with the -f option.

    # eeadm cluster stop -l <library> -f
  2. Power on the library. Verify that the library is running properly.
    Note: Manual library recovery can be needed depending on its state. Refer to tape library documentation for the recovery procedure.
  3. From the library operator panel or web GUI, move all remaining tapes in the drive to an open home slot.
    Note: If a power outage occurs unexpectedly, tape cartridges can remain within a tape drive that is depending on the state of the library at the time of incident. This state elongates the starting operation of a cluster after the library recovery, depending on system configuration. Verify that all cartridges are removed from the drives before you move to step 4.
  4. Start the cluster associated with the library.
    # eeadm cluster start -l <library>
    Note: Check that the network connection to the library is available if MMM fails to start. After it is connected, stop the cluster in the affected node with # eeadm cluster stop -l <library>, and try restarting the cluster.
  5. Confirm that the node status is Available by using the following command:
    # eeadm node list -l <library>

    If any nodes are in an error state, perform the required procedures to restore them.

  6. Check the status of tape drives that are assigned to the node by using the following command:
    # eeadm drive list -l <library>

    If any nodes are in an error state, perform the required procedures to restore them.

  7. Check the status of tapes that are assigned to the node by using the following command:
    # eeadm tape list -l <library>

    If any nodes are in an error state, perform the required procedures to restore them.

    Note: During an unplanned power outage, tapes can be corrupted. For more information, see Checking, validating, and reconciling a medium in the library.