What is data refinery?

The data refinery tool, available with IBM® Watson® Studio and Watson Knowledge Catalog, saves data preparation time by quickly transforming large amounts of raw data into consumable, high-quality information that’s ready for analytics.

Data refinery features

Analyze and transform your data

Interactively discover, cleanse, and transform your data with over 100 built-in operations. No coding skills are required.

Profile and visualize data

Understand the quality and distribution of your data using dozens of built-in charts, graphs, and statistics. Automatically detect data types and business classifications.

Connect to data wherever it resides

Access and explore data residing in a wide spectrum of data sources within your organization or the cloud.

Governed self-service data preparation

Automatically enforce policies set by data governance professionals.

Schedule job execution

Schedule data flow executions for repeatable outcomes. Monitor results and receive notifications.

Serverless execution

Easily scale out via Apache Spark to apply transformation recipes on full data sets. No management of Apache Spark clusters needed.

Related products

IBM Watson Studio

Build and train AI models, and prepare and analyze data, all in one integrated environment.

IBM Watson Knowledge Catalog

Intelligent data and analytic asset discovery, cataloging and governance to fuel AI apps

IBM Watson Machine Learning

Leverage an automated, collaborative workflow to build intelligent applications. Use your data to create, train, and deploy self-learning models.

Available in Watson Studio and Watson Knowledge Catalog