IBM watsonx.data integration

Version: 2.3.1

Experience: Data Fabric

Description

Watsonx.data integration provides unified tools that you can use to transform, integrate, and observe your data. You can use a range of diverse data integration styles, such as streaming, replication, observability, and bulk or batch processing.

Transform batch data
Transform batch data with DataStage to create batch data flows that extract data from multiple source systems, transform the data as required, and deliver the data to target systems. Batch data flows support extract, transform, and load (ETL) and extract, load, and transform (ELT) patterns.
Stream real-time data
Stream real-time data with StreamSets to create streaming data flows that act on time-sensitive data. A streaming data flow runs continuously to read, process, and write data as soon as the data becomes available. Streaming data flows support light in-flight transformations.
Replicate data
Replicate data with Data Replication to build a replication pipeline that synchronizes data between a source and target data store. Data Replication provides near-real-time data delivery with low impact to sources.
Prepare unstructured data
Prepare unstructured data with Unstructured Data Integration to ingest, transform, and enrich unstructured data from diverse sources.
Observe data
Create alerts with Data Observability to notify you when a data integration process encounters errors or behaves differently than you expect. For example, you can set up alerts that monitor DataStage job status. Investigate data incidents with Data Observability to solve any problems or issues that occur in data quality, integrity, and access.

Licensing information

This service is included in the IBM® watsonx.data™ integration license. For more information, see Licenses and entitlements.

Quick links

Integrated services

Table 1. Prerequisite services. If you plan to install Unstructured Data Integration as part of watsonx.data integration, ensure that the following services are installed and running:
Service Capability
watsonx.ai™ Train, validate, and deploy AI models.
watsonx.data Collect, store, query, and analyze varied data in a scalable, reliable, and highly efficient single unified open data platform.
IBM watsonx.data intelligence Create catalogs of curated assets with this secure enterprise catalog management platform that is supported by a data governance framework.
Table 2. Supplemental services. The following related services are often used with this service and provide complementary features, but they are not required.
Service Capability
Orchestration Pipelines Use Orchestration Pipelines and create end-to-end flows of machine learning pipelines to create models and customize various functions.