What's new and changed in Unstructured Data Integration
Unstructured Data Integration updates can include new features and fixes. Releases are listed in reverse chronological order so that the latest release is at the beginning of the topic.
You can see a list of the new features for the platform and all of the services at What's new in IBM Software Hub.
Installing or upgrading Unstructured Data Integration
Ready to install or upgrade Unstructured Data Integration?
- To install or upgrade Unstructured Data Integration as part of the watsonx.data™ intelligence, see watsonx.data intelligence.
- To install or upgrade Unstructured Data Integration as part
of the
watsonx.data integration, see watsonx.data
integration.Remember: All of the IBM® Software Hub components associated with an instance of IBM Software Hub must be installed at the same version.
IBM Software Hub Version 5.4.0
A new version of Unstructured Data Integration was released in June 2026 with IBM Software Hub 5.4.0.
Operand version: 5.4.0
This release includes the following changes:
- New features
-
This release of Unstructured Data Integration includes the following features:
- Process unstructured documents in multiple languages
-
You can now ingest and curate unstructured data documents in the following languages:
- French
- German
- Italian
- Japanese
- Korean
- Polish
- Spanish
- Use semantic chunking in Unstructured Data Integration
-
You can now select semantic chunking in the Chunking operator. This option produces chunks that follow natural topic and meaning boundaries rather than arbitrary size limits, resulting in more coherent context units, higher‑quality embeddings, more accurate retrieval, and reduced noise during downstream question‑answering.
- Summarize chunks with AI in Unstructured Data Integration
-
Generate AI-powered summaries for each document chunk to improve context understanding and retrieval accuracy.
- Ingest and store unstructured data by using more supported connectors
- You can now ingest data from the following sources:
- Confluence
- Google Drive
You can also use the following target databases for vector store:Unstructured data curation supports a subset of these connectors.- OpenSearch
- DataStax Astra DB
- Microsoft Azure Databricks
- PostgreSQL
- Db2
- Oracle
- Work with more file types in Unstructured Data Integration
-
You can now process the following file types:
- HTML
- XLSX
- BMP
- GIF
- JFIF
- JPG
- JPEG
- PNG
- TIFF
- TIF