Group restart phases

Group restart requires scanning the logs of each member to rebuild the SCA or retained lock information. It is recommended that you have an alternate coupling facility on which these vital structures can be automatically rebuilt in the event of a coupling facility failure.

The automatic rebuild that occurs during a coupling facility failure does not require the log scans that group restart does.

During group restart, all restarting members update the SCA or lock structure from information contained in their logs. If you do not issue a START DB2 command for all members of the group, the started members perform group restart on behalf of the non-starting members by reading their logs.

Although one member can perform restart on behalf of the group, you should restart all of the non-quiesced members together, perhaps by using an automated procedure. This shortens the total restart time. Also, because retained locks are held for non-starting members, it is best to start all members of the group for maximum data availability.

Because all members must synchronize at the end of current status rebuild (CSR) and at the end of forward log recovery, the time taken for group restart done in parallel is determined by the member that has the longest CSR and, if the lock structure is lost, by the member that has the longest forward log recovery.

When the members are synchronized after forward log recovery, backward log recovery proceeds in parallel for the started members.

The phases of group restart are generally the same as in a non-data-sharing environment, with the addition of function for group restart. The phases of group restart vary based on whether the SCA, lock structure, or both are lost, and whether information is needed from the logs of inactive members. The following table summarizes the phases, depending on which structure is lost.

Table 1. Summary of group restart phases based on which structure is lost
SCA lost Lock structure lost
Initialization Initialization
CSR (rebuild SCA) CSR (reacquire page set P-locks)
Peer CSR (rebuild SCA) Peer CSR (rebuild page set P-locks)
Forward-log recovery (rebuild locks) Forward-log recovery (rebuild locks) or Peer forward recovery (rebuild locks)
Backward-log recovery Backward-log recovery