Watson Knowledge Catalog provides a secure enterprise catalog management
platform that is supported by a data governance framework. A catalog connects
people to the data and knowledge that they need. The data governance
framework ensures that data access and data quality are compliant with your
business rules and standards.
A catalog is how you share assets across your enterprise:
- Collaborators in a catalog have access to data assets without needing
separate credentials or being able to see the credentials. Collaborators
have roles that control what activities they can perform in the catalog.
- An asset in a catalog consists of metadata about data, including how to
access the data, the data format, the classification of the asset, which
collaborators can access the data and other types of metadata that
describe the data. Data assets can include both relational data and
unstructured data, such as PDF or Microsoft Office documents.
Data Stewards and Data Quality Analysts govern and curate data to provide
high-quality data assets that are easy to find without exposing sensitive
information to unauthorized users. They discover, enrich, classify, and
protect data sets with a combination of advanced data curation tools and
governance artifacts. Then, they publish the data sets as data assets to
your enterprise catalog where Data Scientists and Business Analysts can find
them. Data Stewards and Data Quality Analysts can create governance artifacts
manually, import them from files, or import Knowledge Accelerators.
Data Scientists and Business Analysts can copy catalog assets into analytics
projects to analyze data and build models. The Data Refinery tool for preparing
and visualizing data is included with the Watson Knowledge Catalog service.
The default catalog is your enterprise catalog. It is created automatically
after you install the Watson Knowledge Catalog service and is the only
catalog to which advanced data curation tools apply. The default catalog is
governed so that data protection rules are enforced. For the default catalog,
the information assets view shows additional properties of the assets to aid
curation. Any subsequent catalogs that you create can be governed or
ungoverned, do not have an information assets view, and supply basic data
The following illustration shows the architecture of Watson Knowledge Catalog
for the default catalog.