Appliance software alerts

This topic presents a list of default software related issues and events for IBM® Integrated Analytics System.

Software related issues

Issues are stateful alerts, they are ongoing until the problem is fixed.

Table 1. Software issues
Group Type Reason Code Title Severity
SW SW_SERVICE_REQUESTED 151 Cannot activate tuned.service MAJOR
SW SW_SERVICE_REQUESTED 154 Soft power off action for node failed, could not recover node MAJOR
SW SW_SERVICE_REQUESTED 198 Test SW alert INFORMATION
SW DB_SERVICE_REQUESTED 251 Invalid log path INFORMATION, WARNING, MAJOR
SW DB_NEEDS_ATTENTION 351 Database availability issue INFORMATION, WARNING, MAJOR
SW DB_NEEDS_ATTENTION 352 Physical memory usage threshold exceeded INFORMATION, WARNING, MAJOR
SW DB_NEEDS_ATTENTION 353 Virtual memory usage threshold exceeded INFORMATION, WARNING, MAJOR
SW DB_NEEDS_ATTENTION 354 File system utilization threshold exceeded INFORMATION, WARNING, MAJOR
SW DB_NEEDS_ATTENTION 355 Maximum log space exceeded INFORMATION, WARNING, MAJOR
SW DB_NEEDS_ATTENTION 356 Table space container utilization threshold exceeded INFORMATION, WARNING, MAJOR
SW DB_NEEDS_ATTENTION 357 Statement performance issue INFORMATION, WARNING, MAJOR
SW DB_NEEDS_ATTENTION 399 Other database issue INFORMATION, WARNING, MAJOR
SW SW_NEEDS_ATTENTION 401 GPFS node failed to start MAJOR
SW SW_NEEDS_ATTENTION 402 GPFS nsd failed to start MINOR
SW SW_NEEDS_ATTENTION 403 Application container cannot be started on a node MAJOR
SW SW_NEEDS_ATTENTION 404 GPFS local partition failed to be mounted MAJOR, CRITICAL
SW SW_NEEDS_ATTENTION 405 GPFS filesystem failed to be mounted MAJOR, CRITICAL
SW SW_NEEDS_ATTENTION 406 Time on node is not synchronized WARNING
SW SW_NEEDS_ATTENTION 407 Appliance application component is not healthy CRITICAL, MAJOR
SW SW_NEEDS_ATTENTION 408 The NTP daemon is down WARNING
SW SW_NEEDS_ATTENTION 409 Unable to start Call Home Daemon MAJOR
SW SW_NEEDS_ATTENTION 410 Unable to stop Call Home Daemon MINOR
SW SW_NEEDS_ATTENTION 411 Heavy swap usage CRITICAL
SW SW_NEEDS_ATTENTION 412 Node time not in sync MAJOR
SW SW_NEEDS_ATTENTION 413 Directory service cannot be started MAJOR
SW SW_NEEDS_ATTENTION 414 Security daemon cannot be started MAJOR
SW SW_NEEDS_ATTENTION 415 Docker service failed MAJOR
SW SW_NEEDS_ATTENTION 416 Unable to start console container MAJOR
SW SW_NEEDS_ATTENTION 417 Unable to stop console container MAJOR
SW SW_NEEDS_ATTENTION 418 Console is down MAJOR
SW SW_NEEDS_ATTENTION 419 Grow on demand limit not satisfied CRITICAL
SW SW_NEEDS_ATTENTION 421 Unable to start Lift container MAJOR
SW SW_NEEDS_ATTENTION 422 Unable to stop Lift container MAJOR
SW SW_NEEDS_ATTENTION 423 Lift down MAJOR
SW SW_NEEDS_ATTENTION 424 Token and auth service cannot be started MAJOR
SW SW_NEEDS_ATTENTION 425 DR management service cannot be started MAJOR
SW SW_NEEDS_ATTENTION 426 IPv4 firewall with iptables cannot be started MAJOR
SW SW_NEEDS_ATTENTION 427 Unable to start IDAA Gateway container MAJOR
SW SW_NEEDS_ATTENTION 428 Unable to stop IDAA Gateway container MAJOR
SW SW_NEEDS_ATTENTION 429 IDA Gateway down MAJOR
SW SW_NEEDS_ATTENTION 433 Primary SKLM proxy service cannot be started MAJOR
SW SW_NEEDS_ATTENTION 434 Secondary SKLM proxy service cannot be started MAJOR
SW SW_NEEDS_ATTENTION 435 Gateway not in routing table WARNING
SW SW_NEEDS_ATTENTION 436 Failed to collect status from resource manager MAJOR
SW SW_NEEDS_ATTENTION 437 Duplicate containers running MAJOR
SW SW_NEEDS_ATTENTION 438 Service cannot be (re)started MAJOR
SW SW_NEEDS_ATTENTION 442 Timezones mismatch between nodes WARNING
SW FLOATING_IP_ISSUE 601 Unable to bring-up floating IP MAJOR
SW FLOATING_IP_ISSUE 602 Unable to bring-down floating IP MAJOR
SW FLOATING_IP_ISSUE 603 Unable to bring-up floating IP – cannot connect to server MAJOR
SW FLOATING_IP_ISSUE 604 Unable to bring-down floating IP - cannot connect to server MAJOR
SW APPLIANCE_APPLICATION_DOWN 701 Appliance application went down due to disabled node CRITICAL
SW APPLIANCE_APPLICATION_DOWN 703 Appliance application can't start CRITICAL
SW APPLIANCE_APPLICATION_DOWN 704 Appliance application went down (db2 HA) CRITICAL
SW STORAGE_UTILIZATION 901 Storage utilization above threshold CRITICAL

Software events

Events are stateless alerts, that is, they are related to a point-in-time event.
Table 2. Software events
Group Type Reason Code Title Severity
SW SW_SERVICE_REQUESTED 152 FODC dump collected INFORMATION
SW SW_SERVICE_REQUESTED 153 Kernel panic(s) occurred WARNING
SW ACTION_FAILED 301 Action to restore a GPFS component failed WARNING
SW ACTION_FAILED 302 Container start-up action failed MAJOR
SW ACTION_FAILED 303 Container stop action failed MINOR
SW ACTION_FAILED 304 Action to restore NTP synchronization failed WARNING
SW ACTION_FAILED 305 Failed to enable a node WARNING
SW ACTION_FAILED 306 NTP cannot sync with external
Note: This event is currently not in use.
MINOR
SW ACTION_FAILED 307 Application disabling failed MAJOR
SW ACTION_FAILED 308 Application enabling failed MAJOR
SW ACTION_FAILED 309 WebConsole container stop action failed MAJOR
SW ACTION_FAILED 314 Soft power off failed for node WARNING
SW SW_NEEDS_ATTENTION 420 Detection of change GoD MINOR
SW STARTUP_FAILED 501 Start-up failed due to container start error CRITICAL
SW STARTUP_FAILED 502 Application start-up timeout CRITICAL
SW STARTUP_FAILED 503 Start-up timeout on waiting for healthy nodes CRITICAL
SW STARTUP_FAILED 504 Start-up failed (Db2 Warehouse HA failed) CRITICAL
SW STARTUP_FAILED 505 Application start-up aborted due to FSN thermal issues CRITICAL
SW APPLIANCE_APPLICATION_DOWN 702 Appliance application went down due to FSN thermal issues CRITICAL
SW APPLIANCE_EVENT 801 Node disabled by user INFORMATION
SW APPLIANCE_EVENT 802 Node disabled by system INFORMATION
SW APPLIANCE_EVENT 803 Node enabled by user INFORMATION
SW APPLIANCE_EVENT 804 Node enabled by system INFORMATION
SW APPLIANCE_EVENT 805 Node rebalance requested INFORMATION
SW APPLIANCE_EVENT 806 Node init requested INFORMATION
SW APPLIANCE_EVENT 807 Application start requested INFORMATION
SW APPLIANCE_EVENT 808 Application stop requested INFORMATION
SW APPLIANCE_EVENT 809 Unreachable node restart requested INFORMATION
SW APPLIANCE_EVENT 810 Docker service restart INFORMATION
SW APPLIANCE_EVENT 811 NTPD service restart INFORMATION
SW APPLIANCE_EVENT 812 GPFS issue recovered INFORMATION
SW APPLIANCE_EVENT 813 Application container restarted INFORMATION
SW APPLIANCE_EVENT 814 Application recovered by Db2 Warehouse HA INFORMATION
SW APPLIANCE_EVENT 815 FC port retrained INFORMATION
SW APPLIANCE_EVENT 816 Directory service restarted INFORMATION
SW APPLIANCE_EVENT 817 Security daemon restarted INFORMATION
SW APPLIANCE_EVENT 818 Node time synchronized INFORMATION
SW APPLIANCE_EVENT 819 Console container restarted INFORMATION
SW APPLIANCE_EVENT 820 Application disabled by user INFORMATION
SW APPLIANCE_EVENT 821 Application enabled by user INFORMATION
SW APPLIANCE_EVENT 822 CPU clock tuned successfully INFORMATION
SW APPLIANCE_EVENT 823 Successfully activated tuned.service INFORMATION
SW APPLIANCE_EVENT 824 Lift container restarted INFORMATION
SW APPLIANCE_EVENT 825 IPv4 firewall with iptables restarted INFORMATION
SW APPLIANCE_EVENT 827 Node suspended INFORMATION
SW APPLIANCE_EVENT 828 Node resumed INFORMATION
SW APPLIANCE_EVENT 829 Node is ready to be resumed INFORMATION
SW APPLIANCE_EVENT 830 CPU configuration updated INFORMATION
SW APPLIANCE_EVENT 831 Db2 crash recovery in progress INFORMATION
SW APPLIANCE_EVENT 832 Maintenance mode enabled INFORMATION
SW APPLIANCE_EVENT 833 Maintenance mode disabled INFORMATION
SW APPLIANCE_EVENT 834 Node restart requested due to docker issues INFORMATION
SW APPLIANCE_EVENT 836 Application container(s) restart requested by user INFORMATION