Group Recovery Mode

IBM Z System Automation can automate recovery of z/OS resources, components, and applications to minimize the impact on outage. This function applies to Move and Server groups only because all members should be started for Basic groups. Move and Server groups can enter and leave the recovery mode.

Recovery mode refers to a state that a set of actions are initiated to restore normal operation and recover from system or application failures or other critical issues in the mainframe environment. For example, if a primary member becomes unavailable, a backup member is then started to maintain continuity of services and minimize downtime. System Automation computes the sequence of actions that are required to perform the recovery based on its goals and its knowledge of the interdependencies between the resources.

A group enters the recovery mode in any of the following conditions:
  • When its previously selected and available member unexpectedly goes into the status (Stopping, Problem, HardDown, SoftDown, or SysGone).
  • When its member with preference 600 or greater AND observed status (Available, Degraded, Starting, Stopping, WasAvailable, SoftDown) becomes (HardDown, Problem) or (SysGone) for a selected member.
  • When the member's system gets excluded.
A group remains in the recovery mode in any of the following conditions:
  • The number of selected members is less than the availability target.
  • Any selected member has preference 599 or less.
  • Any selected member in (SysGone, HardDown) observed status has preference 2000 up to 2599.