Provides Hadoop data quality

IBM® BigQuality® is a data quality solution that provides a rich set of data profiling, cleansing and monitoring capabilities that execute on the data nodes of a Hadoop cluster. IBM BigQuality helps ensure information quality and provides the ability to quickly adapt to strategic business changes by stewardship and monitoring of data and application of data quality rules for your Hadoop data.

Delivers robust data capabilities

Provides a massively scalable, shared-nothing, in-memory data integration and quality platform. Runs natively in a Hadoop cluster to help bring enterprise robust capabilities to the data lake.

Enables deep data profiling

Delivers a rich set of data profiling capabilities to understand the assets that are moved into Hadoop distributed data storage clusters.

Supports data privacy

Enables support for data privacy, data masking and test data management initiatives by identifying where personally identifiable information (PII), sensitive and other classes of data are stored.

Improves time to value

Supports fast time to value by identifying data contained within a column using three dozen predefined, out-of-the-box data classes including credit card, taxpayer IDs, US phone number and more.

Provides powerful data tools

Enables data investigation, standardization, matching, survivorship and address verification support running directly inside a Hadoop cluster. Provides USAC and AVI address cleansing and validation.