Fusion Data Foundation service issues

Use this troubleshooting information to resolve install and upgrade problems that are related to Fusion Data Foundation service.

ceph-csi-controller-manager pod in Crashloopbackoff state

Problem statement: After the upgrade of OpenShift® Container Platform 4.16 and Fusion Data Foundation 4.16 to 4.17, the Fusion Data Foundation goes into a degraded state.

Cause: It error occurs because of the Crashloopbackoff state of the ceph-csi-controller-manager pod.

Resolution

Do the following steps to resolve and validate this error:

Run the following example command to remove the limit from CSV:

oc patch csv cephcsi-operator.v4.17.1 --type json -p '[ {"op": "remove", "path": "/spec/install/spec/deployments/0/spec/template/spec/containers/1/resources/limits"} ]' 
clusterserviceversion.operators.coreos.com/cephcsi-operator.v4.17.1 patched

Run the following command to verify the pods status:

oc get pods

Example output:

NAME                                                              READY   STATUS    RESTARTS      AGE
ceph-csi-controller-manager-675dc67f7b-f64pb                      2/2     Running   0             3h39m
ceph-csi-controller-manager-8854c7f5-xdrvx                        1/2     Running   0             6s