What's new and changed in Unstructured Data Integration

Unstructured Data Integration updates can include new features and fixes. Releases are listed in reverse chronological order so that the latest release is at the beginning of the topic.

You can see a list of the new features for the platform and all of the services at What's new in IBM Software Hub.

Installing or upgrading Unstructured Data Integration

Ready to install or upgrade Unstructured Data Integration?

  • To install or upgrade Unstructured Data Integration as part of the watsonx.data™ intelligence, see watsonx.data intelligence.
  • To install or upgrade Unstructured Data Integration as part of the watsonx.data integration, see watsonx.data integration.
    Remember: All of the IBM® Software Hub components associated with an instance of IBM Software Hub must be installed at the same version.

IBM Software Hub Version 5.4.0

A new version of Unstructured Data Integration was released in June 2026 with IBM Software Hub 5.4.0.

Operand version: 5.4.0

This release includes the following changes:

New features
This release of Unstructured Data Integration includes the following features:
Process unstructured documents in multiple languages
You can now ingest and curate unstructured data documents in the following languages:
  • French
  • German
  • Italian
  • Japanese
  • Korean
  • Polish
  • Spanish
Use semantic chunking in Unstructured Data Integration

You can now select semantic chunking in the Chunking operator. This option produces chunks that follow natural topic and meaning boundaries rather than arbitrary size limits, resulting in more coherent context units, higher‑quality embeddings, more accurate retrieval, and reduced noise during downstream question‑answering.

Summarize chunks with AI in Unstructured Data Integration

Generate AI-powered summaries for each document chunk to improve context understanding and retrieval accuracy.

Ingest and store unstructured data by using more supported connectors
You can now ingest data from the following sources:
  • Confluence
  • Google Drive
You can also use the following target databases for vector store:
  • OpenSearch
  • DataStax Astra DB
You can use the following databases for storing document sets and for entity store:
  • Microsoft Azure Databricks
  • PostgreSQL
  • Db2
  • Oracle
Unstructured data curation supports a subset of these connectors.
Work with more file types in Unstructured Data Integration
You can now process the following file types:
  • HTML
  • XLSX
  • BMP
  • GIF
  • JFIF
  • JPG
  • JPEG
  • PNG
  • TIFF
  • TIF
Unstructured data curation supports a subset of these file types.