Known issues and limitations for Watson Discovery
The following known issues and limitations apply to the Watson Discovery service.
- Default noobaa resources might cause the OOMKilled error
- Error is displayed after applying the temporary patch
- During shutdown the DATASTOREQUIESCE field does not update
- Upgrade fails due to existing Elasticsearch 6.x indices
- UpgradeError is shown after resizing PVC
- Disruption of service after upgrading, restarting, or scaling by updating scaleConfig
Default noobaa resources might cause the OOMKilled error
Applies to: 5.4.0
- Error
-
The noobaa resources might cause the
OOMKillederror due to insufficient memory. This error is triggered especially during the Watson Discovery installation or upgrade, as these operations require significant access to the noobaa storage.If you are using the noobaa backing store, run the following commands to verify and patch its resources. For other types of backing stores, refer to their respective documentation to adjust resource sizes.
oc get pods -n openshift-storagenoobaa-default-backing-store-noobaa-pod-2a4a1886 0/1 CrashLoopBackOff 104 (62s ago) 32hoc get pods -n openshift-storage noobaa-default-backing-store-noobaa-pod-2a4a1886 -o yaml... status: ... lastState: terminated: containerID: ... exitCode: 137 finishedAt: "2026-01-10T23:44:45Z" reason: OOMKilled startedAt: "2026-01-10T23:41:42Z" name: noobaa-agent ready: false restartCount: 102 started: false state: waiting: message: back-off 1m20s restarting failed container=noobaa-agent pod=noobaa-default-backing-store-noobaa-pod-2a4a1886_openshift-storage(68abc3f6-25d9-4ac6-87fa-c39f80f6e1af) reason: CrashLoopBackOffAs a result, the Watson Discovery pods that check noobaa contents might encounter download or access errors.oc logs wd-discovery-orchestrator-setup-r9brl -c verify-resourcesVerifying common-zen-wd/mt/__built-in-tenant__/fileResource/701db916-fc83-57ab-0000-000000000010.zip Check if common-zen-wd exists ... Check if object exists Read timeout on endpoint URL: "https://s3.openshift-storage.svc:443/common-zen-wd?list-type=2&prefix=mt%2F__built-in-tenant__%2FfileResource%2F701db916-fc83-57ab-0000-000000000010.zip&delimiter=%2F&encoding-type=url" Object does not exist Retry after 60 seconds - Cause
-
Not enough resource requests for noobaa.
- Solution
- Increase memory size by patching the following
resources.
oc patch -n openshift-storage backingStore/noobaa-default-backing-store --type merge --patch '{ "spec": { "pvPool": { "resources": { "requests": { "memory": "1Gi" }, "limits": { "memory": "1Gi" } } } } }' oc patch -n openshift-storage storagecluster ocs-storagecluster --type merge --patch '{ "spec": { "resources": { "noobaa-endpoint": { "limits": { "memory": "4Gi" }, "requests": { "memory": "4Gi" } } } } }'
Error is displayed after applying the temporary patch
Applies to: 5.4.0
- Error
-
After applying the temporary patch, an error is displayed.
oc get temporarypatches.oppy.ibm.comNAME READY READYREASON UPDATING UPDATINGREASON DEPLOYED VERIFIED AGE temporary-patch False Errored False Errored 1/1 1/1 4sAfter a while, an error is displayed in the status as well.oc get wdNAME VERSION READY READYREASON UPDATING UPDATINGREASON DEPLOYED VERIFIED QUIESCE DATASTOREQUIESCE AGE wd 5.2.1 False ConfigError False Errored 24/24 24/24 NOT_QUIESCED NOT_QUIESCED 11h - Cause
- This issue is caused when the Watson Discovery operator deletes the spec field of the temporary patch.
- Solution
- Apply the same patch again to fill the spec field. Then, delete the Watson Discovery operator to restart
it.
oc delete pod -l icpdsupport/addOnId=discovery,icpdsupport/app=operator -n ${PROJECT_CPD_INST_OPERATORS}
During shutdown the DATASTOREQUIESCE field does not update
- Error
- After you run the
cpd-cli manage shutdowncommand, theDATASTOREQUIESCEstate in the Watson Discovery resource is stuck inQUIESCING.Theshutdowncommand completes successfully. However, when you check the status of theWatsonDiscovery wdcustom resource (oc get WatsonDiscovery wd -n "${PROJECT_CPD_INST_OPERANDS}"), the command returns:NAME VERSION READY READYREASON UPDATING UPDATINGREASON DEPLOYED VERIFIED QUIESCE DATASTOREQUIESCE AGE wd 4.7.3 True Stable False Stable 24/24 24/24 QUIESCED QUIESCING 16h
- Cause
- Due to the way quiescing Postgres works, the Postgres pods are still running in background. This results in the metadata not updating in the Watson Discovery resource.
- Solution
- There is no fix for this. However, the state being stuck in
QUIESCINGdoes not affect the Watson Discovery operator.
Upgrade fails due to existing Elasticsearch 6.x indices
Applies to: 5.4.0
- Error
- If the existing Elasticsearch cluster has indices created with Elasticsearch 6.x, then upgrading
Watson
Discovery to Version 5.0.0 and later
fails.
> oc get wd wd NAME VERSION READY READYREASON UPDATING UPDATINGREASON DEPLOYED VERIFIED QUIESCE DATASTOREQUIESCE AGE wd 4.8.0 False InProgress True VerifyWait 2/24 1/24 NOT_QUIESCED NOT_QUIESCED 63m - Cause
- Watson Discovery checks for existence of deprecated version of indices in the Elasticsearch cluster when upgrading to Version 5.0.0 and later.
- Solution
- To determine whether existing Elasticsearch 6.x indices are the cause of the upgrade failure,
verify the log of the
wd-discovery-es-detect-indexpod using the following command:
UpgradeError is shown after resizing PVC
Applies to: 5.4.0
- Error
- After you edit the custom resource to change the size of a persistent volume claim for a data store, an error is shown.
- Cause
- You cannot change the persistent volume claim size of a component by updating the custom resource. Instead, you must change the size of the PVC on the persistent volume claim node after it is created.
- Solution
- To prevent the error, undo the changes that were made to the YAML file. For more information about the steps to follow to change the persistent volume claim size successfully, see Scaling an existing persistent volume claim size.
Disruption of service after upgrading, restarting, or scaling by
updating scaleConfig
Applies to: 5.4.0
- Error
- After upgrading, restarting, or scaling Watson
Discovery by updating
the
scaleConfigparameter, the Elasticsearch component might become non-functional, resulting in disruption of service and data loss. - Cause
- The Elasticsearch component uses a quorum of pods to ensure availability when it completes search operations. However, each pod in the quorum must recognize the same pod as the leader of the quorum. The system can run into issues when more than one leader pod is identified.
- Solution
- To determine if confusion about the quorum leader pod is the cause of the issue, complete the
following steps:
- Log in to the cluster, and then set the namespace to the project where the Discovery resources are installed.
- Check each of the Elasticsearch pod with the role of
masterto see which pod it identifies as the quorum leader.
Each pod must list the same pod as the leader.oc get pod -l icpdsupport/addOnId=discovery,app=elastic,role=master,tenant=wd \ -o jsonpath='{range .items[*]}{.metadata.name}{"\n"}{end}' | while read i; do echo $i; oc exec $i \ -c elasticsearch -- bash -c 'curl -ksS "localhost:19200/_cat/master?v"'; echo; doneFor example, in the following result, two different leaders are identified. Pods1and2identify pod2as the leader. However, pod0identifies itself as the leader.wd-ibm-elasticsearch-es-server-master-0 id host ip node 7q0kyXJkSJirUMTDPIuOHA 127.0.0.1 127.0.0.1 wd-ibm-elasticsearch-es-server-master-0 wd-ibm-elasticsearch-es-server-master-1 id host ip node L0mqDts7Rh6HiB0aQ4LLtg 127.0.0.1 127.0.0.1 wd-ibm-elasticsearch-es-server-master-2 wd-ibm-elasticsearch-es-server-master-2 id host ip node L0mqDts7Rh6HiB0aQ4LLtg 127.0.0.1 127.0.0.1 wd-ibm-elasticsearch-es-server-master-2
If you find that more than one pod is identified as the leader, contact IBM Support.
Limitations
- Formulas that are embedded as images, especially those containing division bars (horizontal fractions) or other complex notations, are not reliably recognized or extracted by Watson Discovery. As a result, these formulas might be omitted, misinterpreted, or rendered incorrectly in the extracted output. This limitation stems from how the SDU pipeline handles embedded images, and currently affects all versions of Watson Discovery that use SDU.
- The service supports single-zone deployments; it does not support multi-zone deployments.
- You cannot upgrade the Watson
Discovery service by using the
service-instance upgradecommand from the Cloud Pak for Data command-line interface. - You cannot use the Cloud Pak for Data OpenShift® APIs for Data Protection (OADP) backup and restore utility to do an offline backup and restore the Watson Discovery service. Online backup and restore with OADP is available.