Self-monitoring dashboard for self-hosted environments
The self-monitoring dashboard provides a comprehensive, real-time view into the health and performance of backend infrastructure components, enabling DevOps and SRE teams to proactively manage system reliability and efficiency.
By using this dashboard, you can monitor the health and performance of these services in real-time and detect issues such as resource bottlenecks or abnormal usage patterns.
The dashboard helps to you perform the following actions:
- Anomaly detection: Identify spikes, slow queries, or resource exhaustion.
- Capacity planning: Forecast infrastructure needs based on usage trends.
- Incident response: Quickly pinpoint and resolve critical issues.
- Performance optimization: Tune queries, scale services, and manage resources effectively.
For self-monitoring to work Instana agent must be deployed in the self-hosted environment with agent running in INFRASTRUCTURE mode and zone name is set instana.
For reliable self-monitoring, use a second Instana backend. For Standard Edition, it is sufficient to use the demo installation type on the second backend host. On the system to be monitored, you must assign the agent acceptor endpoint to the
agent by using the --agent-endpoint-host flag.
To deploy self-monitoring dashboard, see Self-monitoring dashboard for self-hosted environments.
Viewing self-monitoring dashboard
To view self-monitoring dashboard, complete the following steps:
- On the Instana UI, click Custom dashboards. The "Custom dashboards" page is displayed.
- Click SelfMonitoring. The self-monitoring dashboard is displayed.
You can view the following tiles on self-monitoring dashboard.
| Tile | Description |
|---|---|
| Open Critical Events | Critical events that are open |
| Total count of Restarting pods | Count of pods that are restarting |
| Critical Event Timeline | Timeline at which critical events occurred |
| CPU Usage - By Nodes | Overall CPU utilization of the nodes |
| Memory Used Percentage - By Nodes | Overall memory utilization of the nodes in percentage |
| CPU Usage by Instana Backend Pods | CPU usage of backend pods in percentage |
| Memory Usage by Instana Backend Pods | Memory usage of backend pods in percentage |
| Disk Usage -
|
Tracks how much disk space is used over time of the data store. |
| Memory Usage -
|
Shows the memory consumption trend of the data store |
| CPU Usage -
|
Displays the CPU utilization of the data store |
You can view the following metrics on the dashboard.
Open Critical Events
You can view critical event that are open.
| Parameter | Description |
|---|---|
| Title | Title of the event |
| On | Name of the Instana instance on which the event is open |
| Started | Time at which the event started |
| End | Time at which the event ended |
| Duration | Total duration for which the event open |
Pod Restart Delta
You can view the total count of restarting pods and the percentage of pods that are open.
Critical Event Timeline
You can view the timeline of critical events.
| Parameter | Description |
|---|---|
| Critical Events | The timeline of the critical events that occurred today |
| Critical Events Yesterday | The timeline of the critical events that occurred yesterday |
CPU Usage - By Nodes
You can view CPU usage of nodes in percentage.
Memory Used Percentage - By Nodes
You can view memory usage of nodes in percentage.
CPU Usage by Instana Backend Pods
You can view CPU usage of Instana backend pods in percentage. Tracks CPU consumption across microservices that power the Instana monitoring platform. Visualizes usage trends to detect anomalies and optimize resource allocation.
Memory Usage by Instana Backend Pods
You can view memory usage of Instana backend pods in percentage. Tracks memory consumption across microservices that power the Instana monitoring platform. Visualizes usage trends to detect anomalies and optimize resource allocation.
Disk Usage - >
You can view storage consumed by data stores. The disk usage shows the amount of storage consumed. To determine the percentage of storage consumed, check your allocated storage capacity.
- For Standard Edition, the utilization of mount points is shown.
- For Custom Edition, utilization of Persistent Volume Claim (PVC) is shown.
Memory Usage - >
You can view memory usage of data stores. Tracks RAM consumption, including buffers and caches, to prevent memory pressure.
CPU Usage - >
You can view CPU usage of data stores. Monitors processing load to identify heavy or inefficient queries.