Governing and curating data (Watson Knowledge Catalog)
With the Watson Knowledge Catalog service, you can have catalogs of curated assets that are supported by a governance framework.
This service is not available by default. An administrator must install this service on the IBM Cloud Pak for Data platform. To determine whether the service is installed, open the Services catalog and check whether the service is enabled.
How you get started depends on your user role and permissions and your goal.
Role | Goal |
---|---|
Data Scientist | Find data assets in a catalog |
Business Analyst | Find data assets in a catalog View information assets |
Data Steward | Curate data Create governance artifacts |
Data Quality Analyst | Curate data Create governance artifacts Analyze data quality |
Administrator | Set up the first catalog |
Developer | View Watson Knowledge Catalog APIs |
Find data assets in a catalog
You can search for assets across catalogs and projects from the global search field in the application header. See Searching for assets across projects and catalogs.
To open a specific catalog, from the navigation menu, click Organize > All catalogs. The Your catalogs page lists of all the catalogs that you can access. Click a catalog name and then find assets.
Create governance artifacts
You must have the Author governance artifacts permission to create governance artifacts. To start creating governance artifacts, click Organize > Data and AI governance and then the artifact type from the navigation menu. See Governance artifacts.
Curate data
The type of data curation you perform depends on your user role or permissions:
- Basic curation is available with all user roles. Start by creating an analytics project.
- Advanced curation is available with the Data Steward and Data Quality Analyst roles. Start by clicking Organize > Metadata curation and then Metadata import or Data discovery. To analyze data quality, click Organize > Data quality. See Curate data or Analyze data quality.
View Watson Knowledge Catalog APIs
To use Watson Knowledge Catalog APIs in your application, view the API documentation at this URL:
https://<your_base_URL>/data-api/api-explorer/
Your base URL is the IP address or name of your Cloud Pak for Data application server.