Proactive health checks and monitoring

To maintain your system as you complete Day 2 operations, review the following guidance for completing routine health checks and for monitoring the ongoing status and health of your IBM Cloud Pak for AIOps deployment and your Red Hat OpenShift Container Platform cluster.

Monitor Red Hat OpenShift Container Platform

Routinely check the Red Hat OpenShift Container Platform Monitoring dashboards in the web console. Red Hat OpenShift Container Platform includes a preconfigured, preinstalled, and self-updating monitoring stack that provides monitoring for core platform components. Red Hat OpenShift Container Platform also provides a comprehensive set of monitoring dashboards that help you understand the state of cluster components and user-defined workloads.

  • Use the Administrator perspective to access the monitoring dashboards to monitor the following components:
    • API performance
    • etcd
    • Kubernetes compute resources
    • Kubernetes network resources
    • Prometheus
    • Cluster and node performance (USE method dashboards)

  • Use the Developer perspective to access Kubernetes compute resources dashboards to view application metrics for monitoring the following status:
    • CPU usage
    • Memory usage
    • Bandwidth information
    • Packet rate information

For more information about Red Hat OpenShift Container Platform Monitoring and the monitoring dashboards, see:

Monitor IBM Cloud Pak for AIOps

For more information about self-monitoring with alerts, see Self-monitoring with alerts.

Run the MustGather health check

Run the MustGather tool with the healthcheck option to check the current health of your deployment and Red Hat OpenShift Container Platform cluster.

For more information about installing and running this tool, see the following topics: