Appliance software alerts

This topic presents a list of default software related issues and events for Integrated Analytics System.

Software related issues

Issues are stateful alerts, they are ongoing until the problem is fixed.

Table 1. Software issues
Group Type Reason Code Title Severity
SW SW_SERVICE_REQUESTED 151 Cannot activate tuned.service MAJOR
SW SW_SERVICE_REQUESTED 154 Soft power off action for node failed, could not recover node MAJOR
SW SW_SERVICE_REQUESTED 155 RAID5 array needs attention WARNING
SW SW_SERVICE_REQUESTED 156 Mounts count exceeds threshold WARNING, MAJOR
SW SW_SERVICE_REQUESTED 198 Test SW alert INFORMATION
SW HW_NEEDS_ATTENTION 205 Low fibre channel path count MAJOR
SW HW_NEEDS_ATTENTION 207 SEL is full and cannot be cleaned MINOR
SW DB_SERVICE_REQUESTED 251 Invalid log path INFORMATION, WARNING, MAJOR
SW DB_NEEDS_ATTENTION 351 Database availability issue INFORMATION, WARNING, MAJOR
SW DB_NEEDS_ATTENTION 352 Physical memory usage threshold exceeded INFORMATION, WARNING, MAJOR
SW DB_NEEDS_ATTENTION 353 Virtual memory usage threshold exceeded INFORMATION, WARNING, MAJOR
SW DB_NEEDS_ATTENTION 354 File system utilization threshold exceeded INFORMATION, WARNING, MAJOR
SW DB_NEEDS_ATTENTION 355 Maximum log space exceeded INFORMATION, WARNING, MAJOR
SW DB_NEEDS_ATTENTION 356 Table space container utilization threshold exceeded INFORMATION, WARNING, MAJOR
SW DB_NEEDS_ATTENTION 357 Statement performance issue INFORMATION, WARNING, MAJOR
SW DB_NEEDS_ATTENTION 399 Other database issue INFORMATION, WARNING, MAJOR
SW SW_NEEDS_ATTENTION 401 GPFS node failed to start MAJOR
SW SW_NEEDS_ATTENTION 402 GPFS nsd failed to start MINOR
SW SW_NEEDS_ATTENTION 403 Application container cannot be started on a node MAJOR
SW SW_NEEDS_ATTENTION 404 GPFS local partition failed to be mounted MAJOR, CRITICAL
SW SW_NEEDS_ATTENTION 405 GPFS filesystem failed to be mounted MAJOR, CRITICAL
SW SW_NEEDS_ATTENTION 406 NTP Daemon not sync on external WARNING
SW SW_NEEDS_ATTENTION 407 Application component is not healthy CRITICAL, MAJOR
SW SW_NEEDS_ATTENTION 408 NTP Daemon is down WARNING
SW SW_NEEDS_ATTENTION 409 Unable to start Call Home Daemon MAJOR
SW SW_NEEDS_ATTENTION 410 Unable to stop Call Home Daemon MINOR
SW SW_NEEDS_ATTENTION 411 Heavy swap usage CRITICAL
SW SW_NEEDS_ATTENTION 412 Node time not in sync MAJOR
SW SW_NEEDS_ATTENTION 413 Directory service cannot be started MAJOR
SW SW_NEEDS_ATTENTION 414 Security daemon cannot be started MAJOR
SW SW_NEEDS_ATTENTION 415 Docker service failed MAJOR
SW SW_NEEDS_ATTENTION 416 Unable to start WebConsole container MAJOR
SW SW_NEEDS_ATTENTION 417 Unable to stop WebConsole container MAJOR
SW SW_NEEDS_ATTENTION 418 WebConsole is down MAJOR
SW SW_NEEDS_ATTENTION 419 Grow on demand limit not satisfied CRITICAL
SW SW_NEEDS_ATTENTION 421 Unable to start Lift container MAJOR
SW SW_NEEDS_ATTENTION 422 Unable to stop Lift container MAJOR
SW SW_NEEDS_ATTENTION 423 Lift down MAJOR
SW SW_NEEDS_ATTENTION 424 Token and auth service cannot be started MAJOR
SW SW_NEEDS_ATTENTION 425 DR management service cannot be started MAJOR
SW SW_NEEDS_ATTENTION 426 Firewall service cannot be started MAJOR
SW SW_NEEDS_ATTENTION 427 Unable to start IDAA Gateway container MAJOR
SW SW_NEEDS_ATTENTION 428 Unable to stop IDAA Gateway container MAJOR
SW SW_NEEDS_ATTENTION 429 IDAA Gateway down MAJOR
SW SW_NEEDS_ATTENTION 432 Application VMs cannot be started on a node
SW SW_NEEDS_ATTENTION 433 Primary SKLM proxy service cannot be started MAJOR
SW SW_NEEDS_ATTENTION 434 Secondary SKLM proxy service cannot be started MAJOR
SW SW_NEEDS_ATTENTION 435 Gateway not in routing table WARNING
SW SW_NEEDS_ATTENTION 436 Failed to collect status from resource manager MAJOR
SW SW_NEEDS_ATTENTION 437 Duplicate containers running MAJOR
SW SW_NEEDS_ATTENTION 438 Service cannot be (re)started MAJOR
SW SW_NEEDS_ATTENTION 439 Openshift node is not ready
SW SW_NEEDS_ATTENTION 440 Openshift service is not ready
SW SW_NEEDS_ATTENTION 441 Incorrect node kernel params
SW SW_NEEDS_ATTENTION 442 Timezones mismatch between nodes WARNING
SW SW_NEEDS_ATTENTION 443 Cannot mount NFS partition on virtual machine
SW SW_NEEDS_ATTENTION 444 Unable to start VDB container
SW SW_NEEDS_ATTENTION 445 Unable to stop VDB container
SW SW_NEEDS_ATTENTION 446 ICP4D service is not ready
SW SW_NEEDS_ATTENTION 447 GlusterFS component is not healthy MAJOR
SW SW_NEEDS_ATTENTION 448 NFS partition requires container restart WARNING
SW SW_NEEDS_ATTENTION 449 NFS server does not respond WARNING
SW SW_NEEDS_ATTENTION 450 NFS partition cannot be mounted properly WARNING
SW SW_NEEDS_ATTENTION 451 Webconsole service is not ready WARNING
SW SW_NEEDS_ATTENTION 452 NFS partition cannot be exported WARNING
SW SW_NEEDS_ATTENTION 453 Unreacheable or missing device for NSD WARNING
SW SW_NEEDS_ATTENTION 454 Failed to restart a process with high number of mounts MAJOR
SW SW_NEEDS_ATTENTION 455 Failed to cordon a node
SW SW_NEEDS_ATTENTION 456 Failed to uncordon a node
SW SW_NEEDS_ATTENTION 457 GPFS node is not CCR-based
SW FLOATING_IP_ISSUE 601 Unable to bring-up floating IP MAJOR
SW FLOATING_IP_ISSUE 602 Unable to bring-down floating IP MAJOR
SW FLOATING_IP_ISSUE 603 Unable to bring-up floating IP - cannot connect to server MAJOR
SW FLOATING_IP_ISSUE 604 Unable to bring-down floating IP - cannot connect to server MAJOR
SW VM_FLOATING_IP_ISSUE 605 Unable to bring-up floating IP
SW VM_FLOATING_IP_ISSUE 606 Unable to bring-down floating IP
SW APPLIANCE_APPLICATION_DOWN 701 Appliance application went down due to disabled node CRITICAL
SW APPLIANCE_APPLICATION_DOWN 703 Appliance application can't start CRITICAL
SW APPLIANCE_APPLICATION_DOWN 704 Appliance application went down CRITICAL
SW STORAGE_UTILIZATION 901 Storage utilization above threshold CRITICAL
SW STORAGE_UTILIZATION 902 Present snapshots reduce available storage WARNING
SW SW_NEEDS_ATTENTION 903 Certificate is about to expire WARNING
SW SW_NEEDS_ATTENTION 904 Certificate is expired MAJOR

Software events

Events are stateless alerts, that is, they are related to a point-in-time event.
Table 2. Software events
Group Type Reason Code Title Severity
SW SW_SERVICE_REQUESTED 152 FODC dump collected INFORMATION
SW SW_SERVICE_REQUESTED 153 Kernel panic(s) occurred WARNING
SW ACTION_FAILED 301 Action to restore a GPFS component failed WARNING
SW ACTION_FAILED 302 Container start-up action failed MAJOR
SW ACTION_FAILED 303 Container stop action failed MINOR
SW ACTION_FAILED 304 Action to restore NTP synchronization failed WARNING
SW ACTION_FAILED 305 Failed to enable a node WARNING
SW ACTION_FAILED 306 NTP cannot sync with external MINOR
SW ACTION_FAILED 307 Application disabling failed MAJOR
SW ACTION_FAILED 308 Application enabling failed MAJOR
SW ACTION_FAILED 309 WebConsole container stop action failed MAJOR
SW ACTION_FAILED 310 VM start-up action failed
SW ACTION_FAILED 311 VM stop action failed
SW ACTION_FAILED 312 Failed to enable a storage drive
SW ACTION_FAILED 313 Failed to disable a storage drive
SW ACTION_FAILED 314 Soft power off failed for node WARNING
SW ACTION_FAILED 315 Failed to set node personality
SW ACTION_FAILED 316 Failed to mount external NFS partition WARNING
SW ACTION_FAILED 317 Application initialization has failed MAJOR
SW ACTION_FAILED 318 Failed to reduce process's mounts number MAJOR
SW ACTION_FAILED 319 Failed to disable a node
SW SW_NEEDS_ATTENTION 420 Detection of change GoD MINOR
SW SW_NEEDS_ATTENTION 458 Node was disabled but vm did not reported shutdown
SW STARTUP_FAILED 501 Start-up failed due to container start error CRITICAL
SW STARTUP_FAILED 502 Application start-up timeout CRITICAL
SW STARTUP_FAILED 503 Start-up timeout on waiting for healthy nodes CRITICAL
SW STARTUP_FAILED 504 Start-up failed (db2 HA failed) CRITICAL
SW STARTUP_FAILED 505 Application start-up aborted due to FSN thermal issues CRITICAL
SW STARTUP_FAILED 506 Application start-up aborted due to VM stop error
SW APPLIANCE_APPLICATION_DOWN 702 Appliance application went down due to FSN thermal issues CRITICAL
SW APPLIANCE_EVENT 801 Node disabled by user INFORMATION
SW APPLIANCE_EVENT 802 Node disabled by system INFORMATION
SW APPLIANCE_EVENT 803 Node enabled by user INFORMATION
SW APPLIANCE_EVENT 804 Node enabled by system INFORMATION
SW APPLIANCE_EVENT 805 Node rebalance requested INFORMATION
SW APPLIANCE_EVENT 806 Node init requested INFORMATION
SW APPLIANCE_EVENT 807 Application start requested INFORMATION
SW APPLIANCE_EVENT 808 Application stop requested INFORMATION
SW APPLIANCE_EVENT 809 Unreachable node restart requested INFORMATION
SW APPLIANCE_EVENT 810 Docker service restarted INFORMATION
SW APPLIANCE_EVENT 811 NTPD service recovered INFORMATION
SW APPLIANCE_EVENT 812 GPFS issue recovered INFORMATION
SW APPLIANCE_EVENT 813 Application container restarted INFORMATION
SW APPLIANCE_EVENT 814 Application recovered INFORMATION
SW APPLIANCE_EVENT 815 FC port retrained INFORMATION
SW APPLIANCE_EVENT 816 Directory service restarted INFORMATION
SW APPLIANCE_EVENT 817 Security daemon restarted INFORMATION
SW APPLIANCE_EVENT 818 Node time synchronized INFORMATION
SW APPLIANCE_EVENT 819 WebConsole container restarted INFORMATION
SW APPLIANCE_EVENT 820 Application disabled by user INFORMATION
SW APPLIANCE_EVENT 821 Application enabled by user INFORMATION
SW APPLIANCE_EVENT 822 CPU clock tuned successfully INFORMATION
SW APPLIANCE_EVENT 823 Successfully activated tuned.service INFORMATION
SW APPLIANCE_EVENT 824 Lift container restarted INFORMATION
SW APPLIANCE_EVENT 825 Firewall service restarted INFORMATION
SW APPLIANCE_EVENT 826 Application container(s) restart requested by db2 HA INFORMATION
SW APPLIANCE_EVENT 827 Node suspended INFORMATION
SW APPLIANCE_EVENT 828 Node resumed INFORMATION
SW APPLIANCE_EVENT 829 Node is ready to be resumed INFORMATION
SW APPLIANCE_EVENT 830 CPU configuration updated INFORMATION
SW APPLIANCE_EVENT 831 Db2 crash recovery in progress INFORMATION
SW APPLIANCE_EVENT 832 Maintenance mode enabled INFORMATION
SW APPLIANCE_EVENT 833 Maintenance mode disabled INFORMATION
SW APPLIANCE_EVENT 834 Node restart requested due to docker issues INFORMATION
SW APPLIANCE_EVENT 835 Application VM restarted
SW APPLIANCE_EVENT 836 Application container(s) restart requested by user INFORMATION
SW APPLIANCE_EVENT 837 Storage drive disabled by user
SW APPLIANCE_EVENT 838 Storage drive disabled by system
SW APPLIANCE_EVENT 839 Storage drive enabled by user
SW APPLIANCE_EVENT 840 Storage drive enabled by system
SW APPLIANCE_EVENT 841 Node personality changed by user
SW APPLIANCE_EVENT 842 Successfully fixed kernel parameters
SW APPLIANCE_EVENT 843 YDB disk activated
SW APPLIANCE_EVENT 844 YDB disk deactivated
SW APPLIANCE_EVENT 845 YDB node activation requested
SW APPLIANCE_EVENT 846 YDB node deactivation requested
SW APPLIANCE_EVENT 847 YDB node rebalance requested
SW APPLIANCE_EVENT 848 Application initialization has succeeded INFORMATION
SW APPLIANCE_EVENT 852 SEL cleaned successfully
SW APPLIANCE_EVENT 853 Storage drive disabling requested by user INFORMATION
SW APPLIANCE_EVENT 854 Storage drive enabling requested by user INFORMATION
SW APPLIANCE_EVENT 855 Appliance upgrade enabled
SW APPLIANCE_EVENT 856 Appliance upgrade disabled
SW APPLIANCE_EVENT 857 Application stop action timed out