Known issues and limitations for Watson Speech services
The following known issues and limitations apply to the Watson Speech services.
- Watson Speech CR does not recover after PostgreSQL disk-full condition
- Enabling Instana metrics causes degraded speech performance and increased latency
- Model upload jobs fail when Amazon S3 bucket name exceeds length limits
- cpd-cli export-import command fails when EXPORT_NAME exceeds 14 characters
- Language detection with hybrid models and rnn-t models ignores lid_confidence parameter
- Language model customization fails on en-US_Telephony or en-US_Multimedia models with 500 Internal Server Error
Note: Known issues are cumulative. Issues from previous releases persist in later releases unless removed
from the known issues or otherwise noted.
Watson Speech CR does not recover after PostgreSQL disk-full condition
Applies to: 5.4.0
- Problem
- After a storage full condition, the
watson_speechcustom resource (CR) does not recover and remains in an unhealthy state. The CR shows that it is waiting for the PostgreSQL cluster to enter a healthy state, but the PostgreSQL cluster fails to recover within the expected timeframe.The
watson_speechCR status shows:watson_speech WatsonSpeech speech-cr 2026-05-24T18:51:00Z zen 5.4.0 5.4.0 Watson Speech operator 12.0.0 build 25% Waiting for Postgres cluster to be in a healthy state 2026-05-26T20:26:27.91567Z PostgreSQL cluster failed to enter a healthy state within 15 minutes N/A InProgress edb_cp4d CPDEdbService cpd-edb-service 2026-05-24T18:24:26Z z - Solution
- Contact IBM Support for assistance.
Enabling Instana metrics causes degraded speech performance and increased latency
Applies to:5.4
- Problem
-
When Instana® metrics are enabled, you might experience slow responses or intermittent timeouts for WebSocket‑based synthesize requests. Increased latency has also been observed across Watson Speech to Text (STT) and Watson Text to Speech (TTS) responses, indicating that Instana support for Watson Speech services is currently partial and can result in degraded performance.
Model upload jobs fail when Amazon S3 bucket name exceeds length limits
Applies to: 5.4
- Problem
- Speech model upload jobs can fail with
NoSuchBucketerrors during execution, and the associated job pods remain in theErrorstate.
Pod logs show that the upload process attempts to create or access an Amazon S3 bucket with a generated name that exceeds the S3 maximum bucket name length of 63 characters. When the bucket name exceeds this limit, Amazon S3 rejects the request and the job fails with errors such asoc get pods -n ibm-software-hub-operands -w | grep -iE 'model|upload' speech-cr-stt-models-ce8c5d75ee74b3e0ead636d75b953a8feecda2dvqv 5/6 Error 0 60sbotocore.errorfactory.NoSuchBucket: An error occurred (NoSuchBucket) when calling the PutObject operation: The specified bucket does not exist.
By default, the generated bucket name follows the patternspeech-service-base-models-ibm-<release-name>-<namespace>. If the<namespace>value is longer than 21 characters, the resulting bucket name exceeds the Amazon S3 naming limit and cannot be created.
- Solution
- Edit the Speech custom resource (CR) and explicitly override the
bucketSuffixvalue to ensure that the final bucket name stays within the 63‑character limit.spec: global: datastores: s3: bucketSuffix: "ibm-{{ releaseName }}-{{ speechNamespace }}"
cpd-cli export-import command fails when EXPORT_NAME exceeds 14 characters
Applies to: 5.4
- Problem
EXPORT_NAMEused in the cpd-cli export-import command can contain a maximum of 14 characters. The command fails and returns the following error whenEXPORT_NAMEcontains more than 14 characters.
- Solution
- Define
EXPORT_NAMEusing no more than 14 characters.
Language detection with hybrid models and rnn-t models ignores lid_confidence parameter
Applies to: 5.4
- Problem
- When you use the
detect_languageendpoint with a hybrid model or a rnn-t model (for example,detect_language?model=en-US_BroadbandModel), thelid_confidencequery parameter is ignored. In this case, the request is processed as a standardrecognize/request, which counts against the client’s license quota. This behavior differs from the intended use ofdetect_languageas a free API.
- Solution
- Use only Large Speech Models (LSMs) for language detection.
Language model customization fails on en-US_Telephony or en-US_Multimedia models with 500 Internal Server Error
Applies to: 5.4
- Problem
- When you train language model customizations by using
en-US_Telephonyoren-US_Multimediamodels, it might fail with a similar error.
- Solution
- Use the English Large Speech Model (
en-US). If the model does not work, contact IBM Support.