IBM Unstructured Data Integration

Version: 

Experience: watsonx™

Description

Use IBM® Unstructured Data Integration to ingest, cleanse, transform, and enrich unstructured data. You can then use the transformed data for Retrieval-Augmented Generation (RAG) and agentic workflows.
  • Use the intuitive, drag-and-drop user interface with pre-built modules for tasks such as text data extraction, filtering, or PII and HAP redaction to process your data.
  • Ingest data from various sources: connected data from S3 or Box, local files, document sets.
  • Generate entities and embeddings for easier retrieval of data.
You can build repeatable visual data flows that help to continuously process new changes and updates in the source and ensure the latest available data is used in your projects.

Licensing information

This service is included in the following licenses:

  • IBM watsonx.data™ integration
  • IBM watsonx.data intelligence
  • IBM watsonx.data Premium Edition

For more information, see Licenses and entitlements.

Quick links

Integrated services

Table 1. Prerequisite services. This service requires the following prerequisite services to be installed.
Service Capability
watsonx.ai™ Train, validate, and deploy AI models.
watsonx.data Collect, store, query, and analyze varied data in a scalable, reliable, and highly efficient single unified open data platform.
IBM watsonx.data intelligence Create catalogs of curated assets with this secure enterprise catalog management platform that is supported by a data governance framework.