Fusion Data Foundation service issues

Use this troubleshooting information to resolve install and upgrade problems that are related to Fusion Data Foundation service.

ceph-csi-controller-manager pod in Crashloopbackoff state

Problem statement

After the upgrade of OpenShift® Container Platform 4.16 and Fusion Data Foundation 4.16 to 4.17, the Fusion Data Foundation goes into a degraded state.

Cause
It error occurs because of the Crashloopbackoff state of the ceph-csi-controller-manager pod.
Resolution
Do the following steps to resolve and validate this error:
  1. Run the following example command to remove the limit from CSV:
    oc patch csv cephcsi-operator.v4.17.1 --type json -p '[ {"op": "remove", "path": "/spec/install/spec/deployments/0/spec/template/spec/containers/1/resources/limits"} ]' 
    clusterserviceversion.operators.coreos.com/cephcsi-operator.v4.17.1 patched
  2. Run the following command to verify the pods status:
    oc get pods
    
    Example output:
    NAME                                                              READY   STATUS    RESTARTS      AGE
    ceph-csi-controller-manager-675dc67f7b-f64pb                      2/2     Running   0             3h39m
    ceph-csi-controller-manager-8854c7f5-xdrvx                        1/2     Running   0             6s

Known issues in upgrade

  • You cannot upgrade Fusion Data Foundation from 4.14 or 4.15 to 4.16 instead do a fresh installation of Fusion Data Foundation 4.16.