Metrics collected by the monitoring infrastructure
The following table shows the metrics collected by the monitoring infrastructure provided in IBM Hyper Protect Virtual Servers .
Table 1. Monitoring metrics collected
Plugin Name | Metrics Name | Labels | Description | |
---|---|---|---|---|
01 | collectd | collectd_collectd_cache_size | collectd="cache", instance |
The number of elements in the metric cache. |
02 | collectd | collectd_collectd_derive_total | collectd="write_queue", type="dropped", instance |
The number of metrics dropped due to a queue length limitation. |
03 | collectd | collectd_collectd_queue_length | collectd="write_queue", instance |
The number of metrics currently in the write queue. |
04 | cpu | collectd_cpu_percent | cpu="idle", instance |
Percentage of time that the CPU or CPUs were idle and the system did not have an outstanding disk I/O request. |
05 | cpu | collectd_cpu_percent | cpu="interrupt", instance |
Percentage of time spent by the CPU or CPUs to service hardware interrupts. |
06 | cpu | collectd_cpu_percent | cpu="nice", instance |
Percentage of time spent by the CPU or CPUs to run a niced guest. Nice is when the CPU is executing a user task having below-normal priority. |
07 | cpu | collectd_cpu_percent | cpu="softirq", instance |
Percentage of time spent by the CPU or CPUs to service software interrupts. |
08 | cpu | collectd_cpu_percent | cpu="steal", instance |
Percentage of time spent in involuntary wait by the virtual CPU or CPUs while the hypervisor was servicing another virtual processor. |
09 | cpu | collectd_cpu_percent | cpu="system", instance |
Percentage of CPU utilization while the CPU is running kernel code. This includes device drivers and kernel modules. |
10 | cpu | collectd_cpu_percent | cpu="user", instance |
Percentage of CPU utilization while the CPU is running code in user-mode. This includes your application code. |
11 | cpu | collectd_cpu_percent | cpu="wait", instance |
Percentage of time when the CPU or CPUs were waiting for an I/O operation to complete, and the CPU can’t be used for anything else. |
12 | df | collectd_df_percent_bytes | df=<MountPoint> ,type="free", instance |
Free disk space on the file system, expressed as a percentage. MountPoints: root, /hostfs/var/lib/quotagroups/lv_data_pool/appliance_data |
13 | df | collectd_df_percent_bytes | df=<MountPoint> ,type="reserved", instance |
Reserved disk space on the filesystem, expressed as a percentage. MountPoints: root, /hostfs/var/lib/quotagroups/lv_data_pool/appliance_data |
14 | df | collectd_df_percent_bytes | df=<MountPoint> ,type="used", instance |
Used disk space on the file system, expressed as a percentage. MountPoints: root, /hostfs/var/lib/quotagroups/lv_data_pool/appliance_data |
15 | load | collectd_load_longterm | load="relative", instance |
The average system load over a period of the last 15 minutes. |
16 | load | collectd_load_midterm | load="relative", instance |
The average system load over a period of the last 5 minutes. |
17 | load | collectd_load_shortterm | load="relative", instance |
The average system load over a period of 1 minute. |
18 | memory | collectd_memory | memory="buffered", instance |
Amount of memory used for buffering, mostly for I/O operations. |
19 | memory | collectd_memory | memory="cached", instance |
Memory used for caching disk data for reads, memory-mapped files or tmpfs data. |
20 | memory | collects_memory | memory="free", instance |
Total amount of unused memory. |
21 | memory | collectd_memory | memory="slab_recl", instance |
Amount of reclaimable memory used for slab kernel allocations. |
22 | memory | collectd_memory | memory="slab_unrecl", instance |
Amount of unreclaimable memory used for slab kernel allocations. |
23 | memory | collectd_memory | memory="used", instance |
Total amount of memory used. |
24 | memory | collectd_memory_percent | memory="buffered", instance |
Amount of memory used for buffering, mostly for I/O operations. |
25 | memory | collectd_memory_percent | memory="cached", instance |
Memory used for caching disk data for reads, memory-mapped files or tmpfs data. |
26 | memory | collects_memory_percent | memory="free", instance |
Total amount of unused memory. |
27 | memory | collectd_memory_percent | memory="slab_recl", instance |
Amount of reclaimable memory used for slab kernel allocations. |
28 | memory | collectd_memory_percent | memory="slab_unrecl", instance |
Amount of unreclaimable memory used for slab kernel allocations. |
29 | memory | collectd_memory_percent | memory="used", instance |
Total amount of memory used. |
30 | uptime | collectd_uptime | instance | Seconds since system boot. |