Self-monitoring dashboard for self-hosted environments

The self-monitoring dashboard provides a comprehensive, real-time view into the health and performance of backend infrastructure components, enabling DevOps and SRE teams to proactively manage system reliability and efficiency.

By using this dashboard, you can monitor the health and performance of these services in real-time and detect issues such as resource bottlenecks or abnormal usage patterns.

The dashboard helps to you perform the following actions:

  • Anomaly detection: Identify spikes, slow queries, or resource exhaustion.
  • Capacity planning: Forecast infrastructure needs based on usage trends.
  • Incident response: Quickly pinpoint and resolve critical issues.
  • Performance optimization: Tune queries, scale services, and manage resources effectively.

For self-monitoring to work Instana agent must be deployed in the self-hosted environment with agent running in INFRASTRUCTURE mode and zone name is set instana.

For reliable self-monitoring, use a second Instana backend. For Standard Edition, it is sufficient to use the demo installation type on the second backend host. On the system to be monitored, you must assign the agent acceptor endpoint to the agent by using the --agent-endpoint-host flag.

To deploy self-monitoring dashboard, see Self-monitoring dashboard for self-hosted environmentsExternal link icon.

Viewing self-monitoring dashboard

To view self-monitoring dashboard, complete the following steps:

  1. On the Instana UI, click Custom dashboards. The "Custom dashboards" page is displayed.
  2. Click SelfMonitoring. The self-monitoring dashboard is displayed.

You can view the following tiles on self-monitoring dashboard.

Table 1. Tiles on self-monitoring dashboard
Tile Description
Open Critical Events Critical events that are open
Total count of Restarting pods Count of pods that are restarting
Critical Event Timeline Timeline at which critical events occurred
CPU Usage - By Nodes Overall CPU utilization of the nodes
Memory Used Percentage - By Nodes Overall memory utilization of the nodes in percentage
CPU Usage by Instana Backend Pods CPU usage of backend pods in percentage
Memory Usage by Instana Backend Pods Memory usage of backend pods in percentage
Disk Usage - Tracks how much disk space is used over time of the data store.
Memory Usage - Shows the memory consumption trend of the data store
CPU Usage - Displays the CPU utilization of the data store

You can view the following metrics on the dashboard.

Open Critical Events

You can view critical event that are open.

Table 2. Open Critical Events
Parameter Description
Title Title of the event
On Name of the Instana instance on which the event is open
Started Time at which the event started
End Time at which the event ended
Duration Total duration for which the event open

Pod Restart Delta

You can view the total count of restarting pods and the percentage of pods that are open.

Critical Event Timeline

You can view the timeline of critical events.

Table 3. Critical Event Timeline
Parameter Description
Critical Events The timeline of the critical events that occurred today
Critical Events Yesterday The timeline of the critical events that occurred yesterday

CPU Usage - By Nodes

You can view CPU usage of nodes in percentage.

Memory Used Percentage - By Nodes

You can view memory usage of nodes in percentage.

CPU Usage by Instana Backend Pods

You can view CPU usage of Instana backend pods in percentage. Tracks CPU consumption across microservices that power the Instana monitoring platform. Visualizes usage trends to detect anomalies and optimize resource allocation.

Memory Usage by Instana Backend Pods

You can view memory usage of Instana backend pods in percentage. Tracks memory consumption across microservices that power the Instana monitoring platform. Visualizes usage trends to detect anomalies and optimize resource allocation.

Disk Usage - >

You can view storage consumed by data stores. The disk usage shows the amount of storage consumed. To determine the percentage of storage consumed, check your allocated storage capacity.

  • For Standard Edition, the utilization of mount points is shown.
  • For Custom Edition, utilization of Persistent Volume Claim (PVC) is shown.

Memory Usage - >

You can view memory usage of data stores. Tracks RAM consumption, including buffers and caches, to prevent memory pressure.

CPU Usage - >

You can view CPU usage of data stores. Monitors processing load to identify heavy or inefficient queries.