Stretch cluster events

The following table lists the events that are created for the Stretch cluster component.

Table 1. Events for the Stretch cluster component
Event Event
Type
Severity Call Home Details
site_degraded_replication STATE_CHANGE WARNING no Message: Replication issues are reported at site {id}.
Description: Replication issues exist at the site.
Cause: Replication issues are reported at the site.
User Action: Check the health of the site recovery group and take any corrective action, such as issuing the mmrestripefs command.
site_found INFO_ADD_ENTITY INFO no Message: Site {id} was found.
Description: Site is detected.
Cause: N/A
User Action: N/A
site_fs_desc_fail STATE_CHANGE ERROR no Message: Site {id} has no descriptor disks for all defined file systems.
Description: All file systems at the site have failure groups with no descriptor disks.
Cause: No file systems contain descriptor disks at the site.
User Action: Check the health of the file system descriptor disks at the site and ensure that they are working properly on all nodes.
site_fs_desc_ok STATE_CHANGE INFO no Message: Site {id} file system descriptor disk health is OK.
Description: Site file system descriptor disk health is OK.
Cause: N/A
User Action: N/A
site_fs_desc_warn STATE_CHANGE WARNING no Message: Site {id} file system {0} has no descriptor disks in failure groups {1}.
Description: One or more file systems have descriptor disks that are missing in the failure groups.
Cause: File system descriptor disks are missing at the site.
User Action: Check the health of the file system descriptor disks at the site and ensure that they are working properly on all nodes.
site_fs_down STATE_CHANGE ERROR no Message: File system {0} is down or unavailable at site {id}.
Description: File system is unavailable on all nodes at the site.
Cause: File system is unavailable.
User Action: Check the health of the file system at the site and ensure that it is properly mounted on all nodes.
site_fs_ok STATE_CHANGE INFO no Message: Site {id} file system health is OK.
Description: Site file system health is OK.
Cause: N/A
User Action: N/A
site_fs_quorum_fail STATE_CHANGE ERROR no Message: Site {id} file system {0} does not have enough healthy descriptor disks for quorum.
Description: Not enough healthy descriptor disks are found.
Cause: The file system at the site does not have enough healthy descriptor disks for quorum.
User Action: Check the health state of disks, which are declared as descriptor disks for the file system, to prevent potential data loss. For more information, see the Disk issues section in the IBM Storage Scale: Problem Determination Guide.
site_fs_warn STATE_CHANGE WARNING no Message: Site {id} has {0} nodes that face file system issues with {1}.
Description: Many nodes face file system events at the site, which indicate network, resource, or configuration issues.
Cause: Many nodes face file system events at the site.
User Action: Check the health of the file system at the site and ensure that it is properly mounted on all nodes.
site_gpfs_down STATE_CHANGE ERROR no Message: GPFS is unavailable at the site {id}.
Description: GPFS is reported as unavailable at the site.
Cause: GPFS is reported as unavailable at the site.
User Action: Check the health of GPFS services at the site.
site_gpfs_ok STATE_CHANGE INFO no Message: Site {id} GPFS health is OK.
Description: Site GPFS health is OK.
Cause: N/A
User Action: N/A
site_gpfs_warn STATE_CHANGE WARNING no Message: Site {id} has {0} nodes that are facing GPFS unavailable health events.
Description: Many nodes are facing GPFS unavailable events at the site, which might indicate network, resource, or configuration issues.
Cause: Many nodes have reported GPFS unavailable events at the site.
User Action: Check the health of GPFS services at the site.
site_heartbeats_degraded STATE_CHANGE WARNING no Message: Site {id} has {0} nodes with missing heartbeat health events.
Description: Many nodes face missing heartbeat events at the site, which might indicate network, resource, or configuration issues.
Cause: Many nodes face missing heartbeat events at the site.
User Action: Check the health of the site nodes.
site_heartbeats_ok STATE_CHANGE INFO no Message: Site {id} heartbeat is OK.
Description: Site heartbeats are healthy.
Cause: N/A
User Action: N/A
site_missing_heartbeats STATE_CHANGE ERROR no Message: Heartbeats are missing from site {id}.
Description: Heartbeats are missing from the site, which might indicate network, resource, or configuration issues.
Cause: Heartbeats are missing from the site.
User Action: Check the health of the site.
site_ok STATE_CHANGE INFO no Message: Site is OK.
Description: Site is healthy.
Cause: N/A
User Action: N/A
site_quorum_down STATE_CHANGE ERROR no Message: Quorum unavailable is reported by site {id}.
Description: Quorum nodes cannot communicate with each other that is causing GPFS to lose quorum.
Cause: IBM Storage Scale quorum is unavailable.
User Action: Check the health of the GPFS quorum state by using the mmgetstate command and take corrective actions.
site_quorum_error STATE_CHANGE ERROR no Message: Site {id} is experiencing quorum issues with site {0}.
Description: Site nodes are unable to contact the quorum nodes at another site.
Cause: IBM Storage Scale quorum reports warnings. IBM Storage Scale
User Action: Check the health of the GPFS quorum state by using the mmgetstate command and take corrective actions.
site_quorum_ok STATE_CHANGE INFO no Message: Site {id} quorum health is OK.
Description: Site quorum health is OK.
Cause: N/A
User Action: N/A
site_replication_ok STATE_CHANGE INFO no Message: Site {id} replication health is OK.
Description: Site replication health is OK.
Cause: N/A
User Action: N/A
site_vanished INFO_DELETE_ENTITY INFO no Message: Site {id} is no longer configured as a stretch cluster site node.
Description: The site is no longer detected in the output of the mmlsnodeclass command.
Cause: N/A
User Action: N/A