Use this process to manage an unplanned failover and failback
for the local (production) and remote (recovery) sites.
About this task
Global Mirror provides two-site extended distance remote
copy disaster recovery. When a disaster occurs at the local site,
you must initiate the failover and failback recovery of consistent
data on the remote site. Host activity can resume on the local site
when the host recovers but not before a consistent set of data is
copied to all primary volumes on the local site.
With Global
Mirror, the data that the host writes to the storage system at the
local site is asynchronously shadowed to the storage system at the
remote site. A consistent copy of the data is then automatically maintained
on the storage system at the remote site.
The use of Global
Mirror does not prevent all cases of data loss. During a disaster,
data can be restored only to the last known consistent increment that
was created. Data that is written to the primary site and awaiting
transfer to the secondary site is lost if the two storage systems
can no longer communicate.
The following considerations apply
when you use the Global Mirror recovery process:
- The Global Mirror master might still be running at the local site,
especially if the disaster at the local site is a rolling disaster
in which not all components fail simultaneously.
- The consistent copy at the remote site is not the secondary volume,
but it is the FlashCopy target whose source is the secondary volume.
- The in-progress formation of a consistency group can stop at the
time of the failure.
- You can speed up recovery processing by choosing the Fast
Reverse restore process that is explained later.
Procedure
Complete the following steps to use Global Mirror for an unplanned
failover.
- Enter the Global Mirror session at the local site.
Note: Before the next step, await until the master storage
system completes processing or enters the unrecoverable state. If
the local site failed, proceed to the next step without waiting.
- Initiate a recovery failover on the Global Copy volumes
pair to force stoppage of the volume A to B extended distance relationship.
Then, create a volume B to A Global Copy relationship.
Note: All
B volumes must successfully process the recovery failover request
before you can move to the next step.
- Look at the session properties for volumes B and C to ascertain
the state of the consistency group between the B and C volumes. You
are looking primarily at the FlashCopy relationships and your analysis
determines your next step in the recovery process. Act on your analysis
as follows:
- FlashCopy relationships are not restorable and all the
sequence numbers are equal. No action to the consistency group is
necessary.
- FlashCopy relationships are restorable and all the sequence
numbers are equal. Initiate the FlashCopy Discard changes action
for all the FlashCopy relationships in the consistency group.
- All the FlashCopy sequence numbers are equal and at
least one of the FlashCopy relationships is not restorable. Initiate
the FlashCopy Commit changes action for all
the FlashCopy relationships in the consistency group that are restorable.
- You have a mixed list of FlashCopy relationships; some
are restorable and some are not restorable. The sequence numbers of
restorable relationships have the same sequence number. Relationships
that are not restorable have equal sequence numbers that differ from
restorable relationships. Initiate the FlashCopy Commit
changes action for all the FlashCopy relationships in
the consistency group that are restorable.
- You have a mixed list of FlashCopy relationships; some
are restorable and some are not restorable. The sequence numbers are
not the same within each type of relationship. In this case, the recovery
plan cannot continue because the Global Mirror process is corrupted.
If the Global Mirror process is corrupted, you must recover your data
through your last good backup.
Note: When the state of all the FlashCopy relationships are
known, consider initiating a tape backup of Volume C.
- Initiate the fast reverse restore process from the C volumes
to the B volumes, selecting the Initiate background copy option.
Notes:
- When you initiate the fast reverse restore process, Volume C becomes
unusable.
- I/O must not be allowed to the B or C volumes during the fast
reverse restore process.
- If you do not want to use the fast reverse restore process, use
the Recovering from a disaster without using fast reverse restore
procedure instead of this step.
- Before the next step, await completion of the background
copy. The C to B FlashCopy relationship ends when the background copy
completes.
- Initiate FlashCopy from volume B back to C. Ensure that
you also select the Enable Change Recording and Inhibit
writes to target volume options. This action creates a
backup copy of the consistency group before applications begin to
update the B volumes.
- Start the host I/O at the remote site on the B volumes.
Production can continue on the remote site in this configuration until
you are ready to return production to the local site.
- When you are ready to return production to the local site,
run the recovery failback (B to A) to resynchronize the A volumes.
The application at the remote site remains active.
- After the resynchronize process completes its first pass,
quiesce the applications at your remote site so that the resynchronization
can complete.
- When the resynchronization completes (no out-of-sync tracks),
run recovery failover and failback with Global Copy on Volume A to
re-create the Volume A to Volume B Metro Mirror relationship.
- Start the host I/O at the local site on the A volumes.
- Resume the Global Mirror process.