Table of contents

Watson Knowledge Catalog on Cloud Pak for Data

Version 3.2.0


Watson Knowledge Catalog provides a secure enterprise catalog management platform that is supported by a data governance framework. A catalog connects people to the data and knowledge that they need. The data governance framework ensures that data access and data quality are compliant with your business rules and standards.

A catalog is how you share assets across your enterprise:

  • Collaborators in a catalog have access to data assets without needing separate credentials or being able to see the credentials. Collaborators have roles that control what activities they can perform in the catalog.
  • An asset in a catalog consists of metadata about data, including how to access the data, the data format, the classification of the asset, which collaborators can access the data and other types of metadata that describe the data. Data assets can include both relational data and unstructured data, such as PDF or Microsoft Office documents.

Data Stewards and Data Quality Analysts govern and curate data to provide high-quality data assets that are easy to find without exposing sensitive information to unauthorized users. They discover, enrich, classify, and protect data sets with a combination of advanced data curation tools and governance artifacts. Then, they publish the data sets as data assets to your enterprise catalog where Data Scientists and Business Analysts can find them. Data Stewards and Data Quality Analysts can create governance artifacts manually, import them from files, or import Knowledge Accelerators.

Data Scientists and Business Analysts can copy catalog assets into analytics projects to analyze data and build models. The Data Refinery tool for preparing and visualizing data is included with the Watson Knowledge Catalog service.

The default catalog is your enterprise catalog. It is created automatically after you install the Watson Knowledge Catalog service and is the only catalog to which advanced data curation tools apply. The default catalog is governed so that data protection rules are enforced. For the default catalog, the information assets view shows additional properties of the assets to aid curation. Any subsequent catalogs that you create can be governed or ungoverned, do not have an information assets view, and supply basic data curation tools.

The following illustration shows the architecture of Watson Knowledge Catalog for the default catalog.

Integrated services

Supplemental services

Supplemental services
Regulatory Accelerator Reduce the amount of manual effort required to comply with new regulations.

Related services

Related services
Cognos Dashboard Identify patterns in your data with sophisticated visualizations. No coding needed.
Data Virtualization Integrate data sources across multiple types and locations into one logical data view.
Watson Studio Prepare, analyze, and model data in a collaborative environment wth tools for data scientists, developers, and domain experts.

Compatible data sources

See Supported data sources for a list of data source services that are compatible.