Refined alert definitions for IBM Storage Insights Pro

Alert definitions for IBM Storage Insights Pro were reviewed and refined to improve clarity and reduce unnecessary notifications. Several alerts were added, updated, or removed from the default policies.

IBM Storage Insights Pro helps you stay ahead of issues with a comprehensive set of alert definitions that are tailored for various storage systems. But just like your environment evolves, so do the alerts that help you monitor it effectively.

To streamline your experience and ensure that alerts remain meaningful, the default alert policies for IBM Storage Virtualize, IBM Storage FlashSystem, IBM SAN Volume Controller, and DS8000 systems were carefully assessed. Based on experts analysis, specific alert definitions were added, updated, and removed, without affecting your ability to track critical health and performance indicators.

Table 1. Details of the alert definition updates for the default alert policy of IBM Storage Virtualize, IBM Storage FlashSystem, and IBM SAN Volume Controller systems
New alert definitions Updated alert definitions Removed alert definitions
Previous alert definition Changed alert definition
  • Critical alert when the volume group replication state is suspended (applies to both High Availability (HA) and Asynchronous replications)
  • Critical alert when the volume group replication state is disconnected (applies to both High Availability (HA) and Asynchronous replications)
  • Deleted Volume Group
  • Deleted Storage Partition
  • Critical alert when the enclosure power supply status is warning or error (Available only for IBM Storage Virtualize systems connected through Call Home with cloud services)
  • Pool non-DRP Available Capacity is 15% or less
  • Port Receive Bandwidth Percentage >= 75%
  • Standard Pool Available Capacity is 15% or less
  • Port Receive Bandwidth Percentage >= 80%
  • Volume Group Snapshot Written Capacity is greater than 90%
  • Peak Back-end Write Response Time >= 10000 ms (Severity: Warning)*
  • Peak Back-end Write Response Time >= 30000 ms (Severity : Critical)*
  • Write-cache Delay Percentage >= 10%
  • Write-cache Delay Percentage >= 3%
  • Max Write Cache Fullness >= 99%*
  • Deleted Node
  • Deleted IP Port
  • Deleted FC Port
  • Deleted Drive

* Alert definitions not removed from the default alert policy of IBM SAN Volume Controller systems.

Table 2. Details of the alert definition updates for the default alert policy of DS8000 storage systems
New alert definitions Removed alert definitions
  • Warning alert when volume's safeguarded used capacity is >= 99%
  • Warning alert when Pools Shortfall >= 1 % and Pools Used Capacity >= 90 %
  • Informational alert when the RAID Arrays Disk Read Response Time >= 1.5 ms/op and RAID Arrays Disk Utilization Percentage >= 60 %
  • Informational alert when FC Ports Zero Receive Buffer Credit Percentage >= 2 %
  • Informational alert when FC Ports Out of Order Data Rate >= 0.02 cnt/s
  • Informational alert when FC Ports Link Failure Rate >= 0.02 cnt/s
  • Informational alert when FC Ports CRC Error Rate >= 0.02 cnt/s
  • Cache Holding Time <= 30 sec
  • Cache Holding Time <= 60 sec
  • Credit Recovery Link Reset Rate >= 0.02 cnt/s
  • Credit Recovery Link Reset Rate >= 0.02 cnt/s
  • Credit Recovery Link Reset Rate >= 0.04 cnt/s
  • Credit Recovery Link Reset Rate >= 0.04 cnt/s
  • Port Receive Bandwidth Percentage >= 75%
  • Port Receive Bandwidth Percentage >= 85%
  • Port Send Bandwidth Percentage >= 75%
  • Port Send Bandwidth Percentage >= 85%
  • Write-cache Delay Percentage >= 10%
  • Write-cache Delay Percentage >= 3%
  • Deleted Node
  • Deleted IP Port
  • Deleted FC Port
  • Deleted Drive
Note: All the new alert definitions are available for the DS8000 storage systems with the firmware version 89 or later.

Impact of changes

As the default alert policy definitions for IBM Storage Virtualize, IBM Storage FlashSystem, IBM SAN Volume Controller, and DS8000 systems have been updated, you can manually add any removed, updated, and newly introduced alert definitions to your custom alert policies, if needed.