Observability issues

Diagnose and resolve common issues encountered while deploying, configuring, and operating observability and logging components. The topic includes several scenarios for missing or misconfigured operators, deployment and readiness issues with observability stacks, and failures in logging, tracing, and network observability components.

Identify issues related to ODF external storage configuration, network connectivity, and associated error conditions. Resolve issues when Network Observability components, including netobserv-ebpf agent pods and flowlogs-pipeline pods, remain in the ContainerCreating state without clear error messages. Common LokiStack deployment problems are covered, such as pod readiness failures, missing or incorrect configurations, storage setup issues, sizing considerations, and resource constraints. In addition, the section provides guidance on configuring and troubleshooting namespace filters in ClusterLogForwarder, resolving observability stack deployment issues on managed clusters, and identifying and fixing common Tempo and OpenTelemetry tracing problems using recommended resolutions.

These troubleshooting topics provide guidance and best practices to help restore functionality, ensure secure and reliable observability data flow, and maintain the overall health of your observability and logging deployments.