What is Data Refinery?
The Data Refinery tool, available via Watson Studio and Watson Knowledge Catalog, saves data preparation time by quickly transforming large amounts of raw data into consumable, quality information that’s ready for analytics.
Data Refinery features
Analyze and transform your data
Interactively discover, cleanse, and transform your data with over 100 built-in operations. No coding skills are required.
Profile and visualize data
Understand the quality and distribution of your data using dozens of built-in charts, graphs, and statistics. Automatically detect data types and business classifications.
Connect to data wherever it resides
Access and explore data residing in a wide spectrum of data sources within your organization or the cloud.
Governed self-service data preparation
Policies set by data governance professionals are automatically enforced in Data Refinery.
Schedule job execution
Scheduling of data flow executions for repeatable outcomes. Monitoring and notification of results.
Serverless execution
Easily scale out via Apache Spark to apply transformation recipes on full data sets. No management of Apache Spark clusters needed.