Managing data quality checks in a metadata enrichment

In the metadata enrichment results, you can edit existing data quality checks for assets and columns, set a review status, enable or disable checks, or manually add data quality checks.

After you run metadata enrichment with the Identify data quality checks objective, all applicable checks are usually listed as not reviewed. Not reviewed checks might be overwritten when you rerun the Identify data quality checks objective and the data or the business-term criteria changed. Reviewed checks remain unchanged. A check is marked as reviewed in these cases:

  • You explicitly mark a check as reviewed.
  • You enable a disabled or suggested check.
  • You edit a system-generated check, which basically creates a user-defined new check. User-defined data quality checks are in general marked as reviewed. The original system-generated check is disabled.

Only one check per type can be active for a column. If two checks of the same type but with different origins are identified, one of them is listed as suggested. Checks that come from business terms take precedence over checks from other origins.

You can access the data quality checks in several ways:

  • To access the list of all checks for an asset and the columns it contains, open the asset details and click the Edit icon edit icon or the View details link on the Data quality tab in the side panel.

    Depending on the type and status of a data quality check, you can choose from a subset of these actions in the overflow menu for the check:

    • View
    • Mark as reviewed or Mark as not reviewed
    • Edit logic
    • Disable or Enable

    You can also select individual checks and then select an action from the toolbar. If an action is not applicable to a check due to its status, that check is skipped.

    To add a data quality check, click Add data quality check and select On a column or On the asset. If you add a check on a column, select the column. In the Add data quality checks window, only data quality checks that can still be added are available for selection. Depending on the type of check, you can customize it. Add and customize as many checks as required. When you're done, click Add all checks.

    The only asset-level check is the historical stability check, which is available in IBM Knowledge Catalog Premium. You can disable an active check, enable or delete a disabled check, or add such check to the asset if none exists.

  • To get a filtered view of the checks, open the asset details and click the number in a status section on the Data quality tab in the side panel. The overflow menu shows available actions based the status of the check.

  • To access the list of all checks for an individual column, open the column details and click the Edit icon edit icon on the Data quality tab in the side panel.

    Depending on the type and status of a data quality check, you can choose from a subset of these actions in the overflow menu for the check:

    • View
    • Mark as reviewed or Mark as not reviewed
    • Edit logic
    • Disable or Enable

    You can also select individual checks and then select an action from the toolbar. If an action is not applicable to a check due to its status, that check is skipped.

    To add a data quality check, click Add data quality check. Only data quality checks that can still be added on the column are available for selection. Depending on the type of check, you can customize it. Add and customize as many checks as required. When you're done, click Add all checks.

  • To access a specific check, open the column details and click the entry on the Data quality tab in the side panel. The side panel then shows the details of that check. The available actions depend on the status of the check.

Learn more

Parent topic: [Managing metadata enrichment