NVMe events

The following table lists the events that are created for the NVMe component.

Table 1. Events for the NVMe component
Event Event
Type
Severity Call Home Details
nvme_found ADD_ENTITY INFO no Message: An NVMe controller {0} was found.
Description: An NVMe controller that was listed in the IBM Storage Scale configuration was detected.
Cause: N/A
User Action: N/A
nvme_lbaformat_not_monitored STATE_CHANGE
HEALTHY
INFO no Message: The NVMe device {0} format is not monitored. This is expected on Fusion.
Description: The LBA format of NVMe device is not monitored.
Cause: N/A
User Action: N/A
nvme_lbaformat_not_optimal STATE_CHANGE
DEGRADED
WARNING no Message: The NVMe device {0} does not show expected format.
Description: The LBA format of NVMe device is not formatted as expected.
Cause: The mmlsnvmestatus command reports that LBA format is not optimal.
User Action: Check the NVMe device format for metadata size (expect ms: 0) and relative performance (expt rp: 0).
nvme_lbaformat_ok STATE_CHANGE
HEALTHY
INFO no Message: The NVMe device {0} shows expected format.
Description: The LBA format of NVMe device is formatted as expected.
Cause: N/A
User Action: N/A
nvme_linkstate_not_optimal STATE_CHANGE
DEGRADED
WARNING no Message: The NVMe device {0} reports a link state that does not match the capabilities.
Description: The NVMe device does not have optimal link state.
Cause: The mmlsnvmestatus command reports that link state is not optimal.
User Action: Check PCI link state of the NVMe device.
nvme_linkstate_ok STATE_CHANGE
HEALTHY
INFO no Message: The NVMe device {0} reports a link state that matches the capabilities.
Description: The NVMe device reports an optimal link state.
Cause: N/A
User Action: N/A
nvme_needsservice STATE_CHANGE
DEGRADED
WARNING no Message: The NVMe controller {0} needs service.
Description: The NVMe controller needs service.
Cause: N/A
User Action: N/A
nvme_normal STATE_CHANGE
HEALTHY
INFO no Message: The NVMe controller {0} is OK.
Description: The NVMe controller state is NORMAL.
Cause: N/A
User Action: N/A
nvme_operationalmode_warn STATE_CHANGE
DEGRADED
WARNING no Message: The NVMe controller {0} encountered either internal errors or supercap health issues.
Description: The internal errors or supercap health issues are encountered.
Cause: N/A
User Action: The user is expected to replace the card.
nvme_readonly_mode STATE_CHANGE
DEGRADED
WARNING no Message: NVMe controller {0} is moved to read-only mode.
Description: The device is moved to read-only mode.
Cause: The device is moved to read-only mode when the power source does not allow backup or flash spare block count reaches backup, which is unsupported threshold.
User Action: The user is expected to replace the card.
nvme_sparespace_low STATE_CHANGE
DEGRADED
WARNING no Message: The NVMe controller {0} either indicates program-erase cycles greater than 90% or supercap end of life time is less than or equal to 2 months.
Description: The remaining vault backups until the end of life.
Cause: N/A
User Action: The user is expected to replace the card.
nvme_state_inconsistent STATE_CHANGE
DEGRADED
WARNING no Message: The NVMe controller {0} reports inconsistent state information.
Description: The NVMe controller reports that no service is needed, but overall status has degraded.
Cause: N/A
User Action: N/A
nvme_temperature_warn STATE_CHANGE
DEGRADED
WARNING no Message: NVMe controller {0} reports whether the CPU, System, or Supercap temperature is greater than or less than the critical threshold of a component.
Description: Temperature is greater than or less than a critical threshold.
Cause: N/A
User Action: Check the system cooling, such as air blocked or fan failed.
nvme_vanished DEL_ENTITY INFO no Message: An NVMe controller {0} vanished.
Description: An NVMe controller, which was listed in the IBM Storage Scale configuration, was not detected.
Cause: An NVMe controller, which was previously detected in the IBM Storage Scale configuration, was not found.
User Action: Run the 'nvme' command to verify that all expected NVMe adapters exist.
nvmeof_raw_disk_absent STATE_CHANGE
DEGRADED
WARNING no Message: The NVMeoF disk {id} is expected to be installed, but it is absent.
Description: An NVMeoF disk, which is not be exported to the GNR, is absent.
Cause: The hardware monitoring system fails to detect an NVMeoF disk, which is not be exported to the GNR.
User Action: Install or replace the reported NVMeoF disk.
nvmeof_raw_disk_enabled STATE_CHANGE
DEGRADED
WARNING no Message: The NVMeoF disk {id} is installed, but not configured.
Description: An NVMeoF disk, which should not be exported to the GNR, is enabled.
Cause: The hardware monitoring system detects an unconfigured NVMeoF disk, which should not be exported to the GNR.
User Action: Configure the reported NVMeoF disk.
nvmeof_raw_disk_failed STATE_CHANGE
DEGRADED
WARNING no Message: The NVMeoF disk {id}, which is not exported to the GNR, reports an unknown failure.
Description: An NVMeoF disk, which is not exported to the GNR, has failed.
Cause: The hardware monitoring system detected an unknown failure of an NVMeoF disk, which is not exported to the GNR.
User Action: Check whether the NVMeoF disk is correctly installed. For more information, see the 'Problem Determination Guide' of the relevant system. Contact IBM support if you need more help.
nvmeof_raw_disk_found ADD_ENTITY INFO no Message: The NVMeoF disk {id}, which is not exported to the GNR, runs as expected.
Description: NVMeoF disk, which is in the raw mode, is detected.
Cause: N/A
User Action: N/A
nvmeof_raw_disk_ok STATE_CHANGE
HEALTHY
INFO no Message: The NVMeoF disk {id}, which is not exported to the GNR, runs as expected.
Description: NVMeoF disk, which is in the raw mode, runs as expected.
Cause: N/A
User Action: N/A
nvmeof_raw_disk_smart_failed STATE_CHANGE
DEGRADED
WARNING service ticket Message: The NVMeoF disk {id}, which is not eported to the GNR, should be replaced otherwise a malfunction can occur.
Description: The smart assessment of an NVMeoF disk, which not exported to the GNR, has failed.
Cause: An NVMeoF disk has a failed smart assessment. This disk is not exported to the GNR.
User Action: Replace the disk. Contact IBM support if you need more help.
nvmeof_raw_disk_smart_ok STATE_CHANGE
HEALTHY
INFO no Message: The smart assessment of an NVMeoF disk {id}, which is not exported to the GNR, returns a healthy report.
Description: The smart assessment of an NVMeoF disk in the raw mode returns a healthy report.
Cause: N/A
User Action: N/A
nvmeof_raw_disk_smart_unknown STATE_CHANGE
DEGRADED
WARNING service ticket Message: The system is likely updating the status of an NVMeoF disk {id}. The process should be transient.
Description: No smart information is received from an NVMeoF disk, which not exported to the GNR.
Cause: An NVMeoF disk does not report a smart assessment. This disk is not exported to the GNR.
User Action: If the hardware monitoring system continues to fail in getting the smart information of an NVMeoF disk for more than 15 minutes, then contact the IBM support for further assistance.
nvmeof_raw_disk_standby_offline STATE_CHANGE
DEGRADED
WARNING no Message: The NVMeoF disk {id} is set to an offline state by the user.
Description: An NVMeoF disk, which should not be exported to the GNR, is set to an offline state.
Cause: The hardware monitoring system detects an NVMeoF disk that was put to an offline state. This disk was not exported to the GNR before.
User Action: Activate the offline NVMeoF disk.
nvmeof_raw_disk_unavailable_offline STATE_CHANGE
DEGRADED
WARNING no Message: The NVMeoF disk {id} is set offline by an unknown reason.
Description: An NVMeoF disk, which should not be exported to the GNR, is set offline without known reason.
Cause: The hardware monitoring system detects an offline NVMeoF disk. This disk is not be exported to the GNR and is set offline without any known reason.
User Action: Check for possible problems like missing power, etc. For more information, see the 'Problem Determination Guide' of the relevant system. Contact IBM support if you need more help.
nvmeof_raw_disk_unknown STATE_CHANGE
DEGRADED
WARNING no Message: The system is likely updating the status of an NVMeoF disk {id}. The process should be transient.
Description: The status of an NVMeoF disk, which is not exported to the GNR, is unknown.
Cause: The hardware monitoring system fails to get the status of an NVMeoF disk, which not exported to the GNR.
User Action: If the hardware monitoring continues to fail in getting the NVMeoF disk status for more than 15 minutes, then contact IBM support for more information.
nvmeof_raw_disk_vanished DEL_ENTITY INFO no Message: An NVMeoF disk, which is in raw mode and was previously reported, is not detected anymore.
Description: An NVMeoF disk in raw mode vanished.
Cause: An NVMeoF disk, which was previously detected in the IBM Storage Scale configuration, was not found.
User Action: Verify that all expected NVMeoF disks in the raw mode exist in the IBM Storage Scale configuration.