IBM BigIntegrate

Overview

Integrate Hadoop big data

IBM BigIntegrate is a big data integration solution that provides superior connectivity, fast transformation and reliable, easy-to-use data delivery features that execute on the data nodes of an Apache Hadoop cluster. IBM BigIntegrate provides a flexible and scalable platform to extract, transform and integrate your Hadoop data.

Part of the IBM InfoSphere Information Server product family built specifically to run on Hadoop clusters, BigIntegrate and IBM BigQuality offer end-to-end integration and governance capabilities for your Hadoop data.

Features

Power smarter data integration at scale

Provides a massively scalable, shared-nothing, in-memory data integration engine running natively in a Hadoop cluster to help bring enterprise big data analytics capabilities to the data lake.

Uses metadata management to help make sense of the enormous quantities of information in the data lake.

Delivers big data-related governance features such as impact analysis and data lineage on virtually any integration point, enabling scalable analytics without sacrificing organizational insight.

Transforms big data projects with real-time analytical processing. Integrates with IBM Streams. Uses standard data integration conventions to gather and pass data to powerful big data analytics.