Service This feature is not available by default. An administrator must install the full version of the Watson Knowledge Catalog service on the IBM Cloud Pak for Data platform. To determine whether the service is installed, open the Services catalog and check whether the service is enabled.
Information assets view (Watson Knowledge Catalog)
To access the Information assets view, choose Catalogs > Information assets from the navigation menu.
- Required permission
- You must have this user permission to view information assets:
- View information assets
Directly after installation or an upgrade, the default admin user does not have access to the Information assets view. To grant the default admin user access, remove the predefined Data Steward role from the user profile and add it again.
Use the Information assets view to edit synchronized information assets and to create artifact relationships and view additional information about data assets that are in the default catalog.
Information assets can have both short and long descriptions. The assets in the default catalog, however, can only have one description. When an information asset is synced to an asset in the default catalog, its description is processed as follows:
- If the information asset only contains a short description, this short description is used as the description of the asset in the default catalog.
- If the information asset only contains a long description, this long description is used as the description of the asset in the default catalog.
- If the information asset contains both short and long descriptions, these descriptions are concatenated and used as the description of the asset in the default catalog.
- The synchronization process between information assets and default catalog assets
- What is synchronized
- How to create and publish information assets
- How to remove synchronized information assets from the default catalog
- Relationships with other assets
- Quality analysis
- Data lineage
The synchronization process between information assets and default catalog assets
The following diagram shows the synchronization process:
The default catalog has a synced icon () next to its name on the Your catalogs page. The assets contained in the synced default catalog and the Information assets view are continuously synchronized and can be viewed in both views.
The information assets contained in the default catalog are automatically assigned the
What is synchronized
The sync behavior differs depending on the direction of the synchronization. In general, the following information is synced:
- Tables and their associated columns
- Files and their associated columns
- Supported connections
- Term assignments
Syncing from the Information assets view to the default catalog
Synchronization of connections is limited to the following types:
- Db2 (native and JDBC connections)
- Microsoft SQL Server (JDBC connections)
- Oracle (JDBC connections)
- Teradata (JDBC connections)
In the default catalog, you can add tags, ratings, or reviews, and assign terms to assets synced from the Information assets view. You can also update or remove any term assignments for such assets that were made in the default catalog. However, you cannot change or remove term assignments that were originally made in the Information assets view and then synced to the default catalog.
You can control access to synced assets by adding or removing members as appropriate.
Data preview and profiling is available only for data assets that are discovered and synchronized from the Information assets view to the default catalog using the supported connections. Data preview, for example, on table data assets synchronized from other data source types is not available, because there is no connection available to access the table’s data in the remote data source.
Syncing from the default catalog to the Information assets view
Connections and connected assets that are added to the default catalog, either directly or by publishing quick scan results, are synced to the Information assets view for the following supported connection types:
- Cloud Object Storage (infrastructure)
- Data Virtualization (Db2)
- Db2 Big SQL
- Db2 on Cloud
- Db2 Hosted
- Db2 Warehouse
- Generic JDBC (such connections require upload of the required JDBC driver JAR files)
- Microsoft SQL Server
Other types of connections and assets from such connections are not synced to the Information assets view.
How to create and publish information assets
You can create information assets and have them synchronized to the default catalog in these ways:
- Automated discovery
- Run a discovery job with the discovery option Publish results to catalog.
- Run a discovery job and publish the results. Published discovery results include term assignments, data-class assignments, quality score, and other quality-related statistics. The results are added to the Information assets view and then synced to the default catalog.
- Metadata import
- Set up and run an Express Import to import assets into a staging area and have them automatically analyzed and then shared to the Information assets view.
- Set up and run a Managed Import to import assets into a staging area, where you can analyze and work with the assets before you share them manually to the Information assets view. The information assets are then synced to the default catalog.
How to remove synchronized information assets from the default catalog
To remove an information asset from the default catalog or data quality projects, you first have to remove it from Information assets view. The synchronization process propagates the delete from Information assets view into the default catalog. However, you can remove assets from the default catalog or projects if they are not synchronized.
To remove synchronized information assets from the default catalog or data quality projects:
- Go to Catalogs > Information Assets.
- On the the Assets tab, you can search for information assets by name or filter by asset type in the Explore assets view. For example, expand the Databases section under Asset types and select Database Table.
- In the row of the asset that you want to remove, select Delete from the Actions menu.
- The asset is deleted in the Information assets view and then removed from the default catalog during synchronization.
Relationships with other assets
A relationship graph shows how assets are related to one another. With complex interconnections, graphical depiction of such relationships helps you understand the structure of your data. Each relationship has direction and name. You can see the relationship graph in the Relationships pane.
You can view the results of quality analysis, for those assets that were analyzed. Quality analysis can include data quality, column, primary key, relationship, and overlap analysis.
To view quality analysis, on the asset details page, click the Quality Analysis or Quality Dimensions tab.
Data lineage reports show the movement of data through a job or multiple jobs. These reports can show the order of activities within a run of a job.
To view data lineage, on the asset details page, choose Data Lineage Viewer from the menu.