Proactive health checks and monitoring
To maintain your system as you complete Day 2 operations, review the following guidance for completing routine health checks and for monitoring the ongoing status and health of your IBM Cloud Pak for AIOps deployment and your Red Hat OpenShift Container Platform cluster.
Monitor Red Hat OpenShift Container Platform
Routinely check the Red Hat OpenShift Container Platform Monitoring dashboards in the web console. Red Hat OpenShift Container Platform includes a preconfigured, preinstalled, and self-updating monitoring stack that provides monitoring for core platform components. Red Hat OpenShift Container Platform also provides a comprehensive set of monitoring dashboards that help you understand the state of cluster components and user-defined workloads.
- Use the Administrator perspective to access the monitoring dashboards to monitor the following components:
- API performance
- etcd
- Kubernetes compute resources
- Kubernetes network resources
- Prometheus
- Cluster and node performance (USE method dashboards)
- Use the Developer perspective to access Kubernetes compute resources dashboards to view application metrics for monitoring the following status:
- CPU usage
- Memory usage
- Bandwidth information
- Packet rate information
For more information about Red Hat OpenShift Container Platform Monitoring and the monitoring dashboards, see:
-
Configure and use the Red Hat OpenShift Container Platform Alerting capabilities to monitor your cluster and identify potential problems. With the web console Alerting UI you can manage alerts, silences, and alerting rules to monitor the health of your cluster. For more information, see Managing alerts.
Monitor IBM Cloud Pak for AIOps
For more information about self-monitoring with alerts, see Self-monitoring with alerts.
Run the MustGather health check
Run the MustGather tool with the healthcheck option to check the current health of your deployment and Red Hat OpenShift Container Platform cluster.
For more information about installing and running this tool, see the following topics:
-
Installing the MustGather tool (OpenShift)
Important: If you are using the (OpenShift) version, ensure that your installed version of the tool is
v1.17.6or newer. Some healthcheck data is only gathered with versionv1.17.6and newer. -
Running the MustGather tool (healthcheck)
When running the tool, use the following command:
waiops-mustgather.sh -DO healthcheck