NVMe events
The following table lists the events that are created for the NVMe component.
Event | Event Type |
Severity | Call Home | Details |
---|---|---|---|---|
nvme_found | ADD_ENTITY | INFO | no | Message: An NVMe controller {0} was found. |
Description: An NVMe controller that was listed in the IBM Storage Scale configuration was detected. | ||||
Cause: N/A | ||||
User Action: N/A | ||||
nvme_lbaformat_not_monitored | STATE_CHANGE HEALTHY |
INFO | no | Message: The NVMe device {0} format is not monitored. This is expected on Fusion. |
Description: The LBA format of NVMe device is not monitored. | ||||
Cause: N/A | ||||
User Action: N/A | ||||
nvme_lbaformat_not_optimal | STATE_CHANGE DEGRADED |
WARNING | no | Message: The NVMe device {0} does not show expected format. |
Description: The LBA format of NVMe device is not formatted as expected. | ||||
Cause: The mmlsnvmestatus command reports that LBA format is not optimal. | ||||
User Action: Check the NVMe device format for metadata size (expect ms: 0) and relative performance (expt rp: 0). | ||||
nvme_lbaformat_ok | STATE_CHANGE HEALTHY |
INFO | no | Message: The NVMe device {0} shows expected format. |
Description: The LBA format of NVMe device is formatted as expected. | ||||
Cause: N/A | ||||
User Action: N/A | ||||
nvme_linkstate_not_optimal | STATE_CHANGE DEGRADED |
WARNING | no | Message: The NVMe device {0} reports a link state that does not match the capabilities. |
Description: The NVMe device does not have optimal link state. | ||||
Cause: The mmlsnvmestatus command reports that link state is not optimal. | ||||
User Action: Check PCI link state of the NVMe device. | ||||
nvme_linkstate_ok | STATE_CHANGE HEALTHY |
INFO | no | Message: The NVMe device {0} reports a link state that matches the capabilities. |
Description: The NVMe device reports an optimal link state. | ||||
Cause: N/A | ||||
User Action: N/A | ||||
nvme_needsservice | STATE_CHANGE DEGRADED |
WARNING | no | Message: The NVMe controller {0} needs service. |
Description: The NVMe controller needs service. | ||||
Cause: N/A | ||||
User Action: N/A | ||||
nvme_normal | STATE_CHANGE HEALTHY |
INFO | no | Message: The NVMe controller {0} is OK. |
Description: The NVMe controller state is NORMAL. | ||||
Cause: N/A | ||||
User Action: N/A | ||||
nvme_operationalmode_warn | STATE_CHANGE DEGRADED |
WARNING | no | Message: The NVMe controller {0} encountered either internal errors or supercap health issues. |
Description: The internal errors or supercap health issues are encountered. | ||||
Cause: N/A | ||||
User Action: The user is expected to replace the card. | ||||
nvme_readonly_mode | STATE_CHANGE DEGRADED |
WARNING | no | Message: NVMe controller {0} is moved to read-only mode. |
Description: The device is moved to read-only mode. | ||||
Cause: The device is moved to read-only mode when the power source does not allow backup or flash spare block count reaches backup, which is unsupported threshold. | ||||
User Action: The user is expected to replace the card. | ||||
nvme_sparespace_low | STATE_CHANGE DEGRADED |
WARNING | no | Message: The NVMe controller {0} either indicates program-erase cycles greater than 90% or supercap end of life time is less than or equal to 2 months. |
Description: The remaining vault backups until the end of life. | ||||
Cause: N/A | ||||
User Action: The user is expected to replace the card. | ||||
nvme_state_inconsistent | STATE_CHANGE DEGRADED |
WARNING | no | Message: The NVMe controller {0} reports inconsistent state information. |
Description: The NVMe controller reports that no service is needed, but overall status has degraded. | ||||
Cause: N/A | ||||
User Action: N/A | ||||
nvme_temperature_warn | STATE_CHANGE DEGRADED |
WARNING | no | Message: NVMe controller {0} reports whether the CPU, System, or Supercap temperature is greater than or less than the critical threshold of a component. |
Description: Temperature is greater than or less than a critical threshold. | ||||
Cause: N/A | ||||
User Action: Check the system cooling, such as air blocked or fan failed. | ||||
nvme_vanished | DEL_ENTITY | INFO | no | Message: An NVMe controller {0} vanished. |
Description: An NVMe controller, which was listed in the IBM Storage Scale configuration, was not detected. | ||||
Cause: An NVMe controller, which was previously detected in the IBM Storage Scale configuration, was not found. | ||||
User Action: Run the 'nvme' command to verify that all expected NVMe adapters exist. | ||||
nvmeof_raw_disk_absent | STATE_CHANGE DEGRADED |
WARNING | no | Message: The NVMeoF disk {id} is expected to be installed, but it is absent. |
Description: An NVMeoF disk, which is not be exported to the GNR, is absent. | ||||
Cause: The hardware monitoring system fails to detect an NVMeoF disk, which is not be exported to the GNR. | ||||
User Action: Install or replace the reported NVMeoF disk. | ||||
nvmeof_raw_disk_enabled | STATE_CHANGE DEGRADED |
WARNING | no | Message: The NVMeoF disk {id} is installed, but not configured. |
Description: An NVMeoF disk, which should not be exported to the GNR, is enabled. | ||||
Cause: The hardware monitoring system detects an unconfigured NVMeoF disk, which should not be exported to the GNR. | ||||
User Action: Configure the reported NVMeoF disk. | ||||
nvmeof_raw_disk_failed | STATE_CHANGE DEGRADED |
WARNING | no | Message: The NVMeoF disk {id}, which is not exported to the GNR, reports an unknown failure. |
Description: An NVMeoF disk, which is not exported to the GNR, has failed. | ||||
Cause: The hardware monitoring system detected an unknown failure of an NVMeoF disk, which is not exported to the GNR. | ||||
User Action: Check whether the NVMeoF disk is correctly installed. For more information, see the 'Problem Determination Guide' of the relevant system. Contact IBM support if you need more help. | ||||
nvmeof_raw_disk_found | ADD_ENTITY | INFO | no | Message: The NVMeoF disk {id}, which is not exported to the GNR, runs as expected. |
Description: NVMeoF disk, which is in the raw mode, is detected. | ||||
Cause: N/A | ||||
User Action: N/A | ||||
nvmeof_raw_disk_ok | STATE_CHANGE HEALTHY |
INFO | no | Message: The NVMeoF disk {id}, which is not exported to the GNR, runs as expected. |
Description: NVMeoF disk, which is in the raw mode, runs as expected. | ||||
Cause: N/A | ||||
User Action: N/A | ||||
nvmeof_raw_disk_smart_failed | STATE_CHANGE DEGRADED |
WARNING | service ticket | Message: The NVMeoF disk {id}, which is not eported to the GNR, should be replaced otherwise a malfunction can occur. |
Description: The smart assessment of an NVMeoF disk, which not exported to the GNR, has failed. | ||||
Cause: An NVMeoF disk has a failed smart assessment. This disk is not exported to the GNR. | ||||
User Action: Replace the disk. Contact IBM support if you need more help. | ||||
nvmeof_raw_disk_smart_ok | STATE_CHANGE HEALTHY |
INFO | no | Message: The smart assessment of an NVMeoF disk {id}, which is not exported to the GNR, returns a healthy report. |
Description: The smart assessment of an NVMeoF disk in the raw mode returns a healthy report. | ||||
Cause: N/A | ||||
User Action: N/A | ||||
nvmeof_raw_disk_smart_unknown | STATE_CHANGE DEGRADED |
WARNING | service ticket | Message: The system is likely updating the status of an NVMeoF disk {id}. The process should be transient. |
Description: No smart information is received from an NVMeoF disk, which not exported to the GNR. | ||||
Cause: An NVMeoF disk does not report a smart assessment. This disk is not exported to the GNR. | ||||
User Action: If the hardware monitoring system continues to fail in getting the smart information of an NVMeoF disk for more than 15 minutes, then contact the IBM support for further assistance. | ||||
nvmeof_raw_disk_standby_offline | STATE_CHANGE DEGRADED |
WARNING | no | Message: The NVMeoF disk {id} is set to an offline state by the user. |
Description: An NVMeoF disk, which should not be exported to the GNR, is set to an offline state. | ||||
Cause: The hardware monitoring system detects an NVMeoF disk that was put to an offline state. This disk was not exported to the GNR before. | ||||
User Action: Activate the offline NVMeoF disk. | ||||
nvmeof_raw_disk_unavailable_offline | STATE_CHANGE DEGRADED |
WARNING | no | Message: The NVMeoF disk {id} is set offline by an unknown reason. |
Description: An NVMeoF disk, which should not be exported to the GNR, is set offline without known reason. | ||||
Cause: The hardware monitoring system detects an offline NVMeoF disk. This disk is not be exported to the GNR and is set offline without any known reason. | ||||
User Action: Check for possible problems like missing power, etc. For more information, see the 'Problem Determination Guide' of the relevant system. Contact IBM support if you need more help. | ||||
nvmeof_raw_disk_unknown | STATE_CHANGE DEGRADED |
WARNING | no | Message: The system is likely updating the status of an NVMeoF disk {id}. The process should be transient. |
Description: The status of an NVMeoF disk, which is not exported to the GNR, is unknown. | ||||
Cause: The hardware monitoring system fails to get the status of an NVMeoF disk, which not exported to the GNR. | ||||
User Action: If the hardware monitoring continues to fail in getting the NVMeoF disk status for more than 15 minutes, then contact IBM support for more information. | ||||
nvmeof_raw_disk_vanished | DEL_ENTITY | INFO | no | Message: An NVMeoF disk, which is in raw mode and was previously reported, is not detected anymore. |
Description: An NVMeoF disk in raw mode vanished. | ||||
Cause: An NVMeoF disk, which was previously detected in the IBM Storage Scale configuration, was not found. | ||||
User Action: Verify that all expected NVMeoF disks in the raw mode exist in the IBM Storage Scale configuration. |