Common service installation issues
Use this troubleshooting information to resolve common install and upgrade problems related to IBM Storage Fusion services.
ImagePull failure during installation or upgrade of any service
If an ImagePull
failure occurs during the installation or upgrade of any
service, then restart the pod and retry. If the issue persists, contact IBM support.
Rook-cephfs pods are in CrashLoopBackOff
off state
- Problem statement
- Data Cataloging and Backup & Restore services that are installed, go into
degraded state, and
rook-ceph-mds-ocs-storagecluster
pods are inCrashLoopBackOff
state.
- Resolution
- Follow the steps to resolve the issue:
- If IBM Storage Fusion is installed with OpenShift® Container Platform or Data Foundation v4.10.x, then you can upgrade it to v4.11.x.
- The upgrade of OpenShift Container
Platformor Data Foundation from v4.10.x to v4.11.x resolves the
CrashLoopBackOff
error inrook-ceph-mds-ocs-storagecluster
pods and get it to running state. Eventually services also go to healthy state.Note: Also, you can upgrade OpenShift Container Platform or Data Foundation to further versions supported by IBM Storage Fusion.
Service upgrade option unavailable
- Problem statement
- IBM Storage Fusion services upgrade option is
unavailable as the
FusionServiceDefinition
CR is not updated.
- Resolution
- Follow the steps to resolve the issue:
- Get the backup of the
FusionServiceDefinition
CR of the services having this issue. - Run the following command and export IBM Storage Fusionnamespace as an environmental
variable.
export FUSION_NS="namespace-where-fusion-is-installed"
- Run the following to delete the
FusionServiceDefinition
CR.oc delete fsd <fsd-name> -n $FUSION_NS
- Run the following command to delete the
isf-prereq-operator
pod in the namespace installed in the IBM Storage Fusion.oc get pods -n $FUSION_NS --no-headers | grep "isf-prereq-operator-controller-manager" | awk '{print $1}' | xargs oc delete pod -n $FUSION_NS
The
isf-prereq-operator
pod restarts and theFusionServiceDefinition
CR gets updated with the new values.
- Get the backup of the
Troubleshooting common installation issues
- If you find an error saying
Configmap fusionplatform not found in fusion namespace
in the preparer operator logs, then the error does not have any impact and can be ignored from theisf-prereq-operator-controller-manager-xxxx
pod logs -
- Problem statement
- Whenever the upgrade button is unavailable for any service
- Resolution
- During the offline upgrade, if you do not see the upgrade button for any of the services after
upgrading IBM Storage Fusion operator, then check the
catalogsource
pod, and it should be in a running state. For anyimagepullbackoff
error, ensure you have completed mirroring and updatedimagecontentsourcepolicy
.