What's new and changed in Unstructured Data Integration

Unstructured Data Integration updates can include new features and fixes. Releases are listed in reverse chronological order so that the latest release is at the beginning of the topic.

You can see a list of the new features for the platform and all of the services at What's new in IBM Software Hub.

Installing or upgrading Unstructured Data Integration

Ready to install or upgrade Unstructured Data Integration?

To install or upgrade Unstructured Data Integration as part of the watsonx.data™ intelligence, see watsonx.data intelligence.
To install or upgrade Unstructured Data Integration as part of the watsonx.data integration, see watsonx.data integration.
Remember: All of the IBM® Software Hub components associated with an instance of IBM Software Hub must be installed at the same version.

IBM Software Hub Version 5.4.0

A new version of Unstructured Data Integration was released in June 2026 with IBM Software Hub 5.4.0.

Operand version: 5.4.0

This release includes the following changes:

New features

This release of Unstructured Data Integration includes the following features:

Process unstructured documents in multiple languages

You can now ingest and curate unstructured data documents in the following languages:

French
German
Italian
Japanese
Korean
Polish
Spanish

Use semantic chunking in Unstructured Data Integration

You can now select semantic chunking in the Chunking operator. This option produces chunks that follow natural topic and meaning boundaries rather than arbitrary size limits, resulting in more coherent context units, higher‑quality embeddings, more accurate retrieval, and reduced noise during downstream question‑answering.

Summarize chunks with AI in Unstructured Data Integration

Generate AI-powered summaries for each document chunk to improve context understanding and retrieval accuracy.

Ingest and store unstructured data by using more supported connectors

You can now ingest data from the following sources:

Confluence
Google Drive

You can also use the following target databases for vector store:

OpenSearch
DataStax Astra DB

You can use the following databases for storing document sets and for entity store:

Microsoft Azure Databricks
PostgreSQL
Db2
Oracle

Unstructured data curation supports a subset of these connectors.

Work with more file types in Unstructured Data Integration

You can now process the following file types:

HTML
XLSX
BMP
GIF
JFIF
JPG
JPEG
PNG
TIFF
TIF

Unstructured data curation supports a subset of these file types.