A disk device is not accessible on one of the hosts and its corresponding Object Storage Device (OSD) is marked out by the Ceph cluster. This alert is raised when a Ceph node fails to recover within 10 minutes.
- Determine the failed node
- Get the list of worker nodes, and check for the node
oc get nodes --selector='node-role.kubernetes.io/worker','!node-role.kubernetes.io/infra'
- Describe the node which is of NotReady status to get more information on the
failure, using the following
oc describe node <node_name>
- Get the list of worker nodes, and check for the node status: