Exporting metadata to IBM Cloud Watson Knowledge Catalog

Export data along with relevant metadata tags from IBM Spectrum® Discover to Watson™ Knowledge Catalog .

Before you begin

You must follow the procedure to add the IBM Cloud Watson catalog details to IBM Spectrum Discover.

  1. Obtain the API key for accessing the Watson Knowledge Catalog (WKC). For WKC in IBM® Cloud, go to https://cloud.ibm.com/iam/overview and select Create an IBM Cloud API key.
  2. Export or copy the key.
  3. Copy the base URI of your WKC instance. For IBM Cloud, the WKC instance URI is based on the geographical location. For example,
    https://api.dataplatform.cloud.ibm.com/v2/
  4. To configure the deployment parameters for the WKC connector app, log in to the IBM Spectrum Discover instance and run the following command:
    kubectl -n spectrum-discover edit deploy/spectrum-discover-wkcconnector
  5. In the deployment editor, search and edit the environment variable WKC_API_KEY.
  6. Add a value string with the API key that is acquired from IBM Cloud in step 2. For example,
    name:WKC_API_KEY
    value:<API-KEY-VALUE>
    Note: You need to ensure that you maintain the correct indentation while you are editing in the deployment editor.
  7. Set the base URI of the WKC instance to the one copied in step 3. For example:
    name: WKC_BASE_URI
    value:https://api.dataplatform.cloud.ibm.com/v2/
    Note: If you switch to a different IBM Cloud account, you can edit the WKC_API_KEY environment variable in the deployment editor with the API key that is associated with the new account. The application automatically restarts and identifies the new account that is linked.
    If you add new catalogs to your WKC instance, IBM Spectrum Discover retains the old registration information and continues to point to the old catalog IDs. To rectify this issue, follow the procedure:
    1. Go to Metadata > Applications.
    2. Restart the WKC application instance by using the following commands:
      kubectl scale deployment --replicas=0 spectrum-discover-wkcconnector
      kubectl scale deployment --replicas=1 spectrum-discover-wkcconnector
    If you add a connection to IBM Spectrum Discover, then the WKC application has to be restarted to be able to use that connection. Use the following commands to restart the WKC:
    kubectl scale deployment --replicas=0 spectrum-discover-wkcconnector
    kubectl scale deployment --replicas=1 spectrum-discover-wkcconnector

About this task

The Watson Knowledge Catalog is a data cataloging system that is not always able to scan relevant data sources and capture the relevant metadata from those files. IBM Spectrum Discover helps to bridge this critical gap by helping to export data to Watson Knowledge Catalog with all relevant metadata tags.

Note: Start of
changeIf the source connection that IBM Spectrum Discover is accessing to export data is the one that WKC can also connect to, then you can configure the connection map within the WKC Connector App. For more information, see Mapping similar source connections in Watson Knowledge Catalog.End of
change

Procedure

  1. On the IBM Spectrum Discover Dashboard, search for the data to be exported by using a specific filter criteria.
  2. Click Export Data. The Export Data to Watson Knowledge Catalog window appears.
  3. Under Destination Catalog, select the catalog in Watson Knowledge Catalog where you want to export the data.
  4. Select the tags that you want to export from the list in Metadata Tags to Export.
  5. Click Submit.
  6. After completion of the process, the exported data is displayed in the Watson Knowledge Catalog with the tags that are imported from IBM Spectrum Discover.
    Note: Tags in IBM Spectrum Discover represent a name (for example, SizeRange) and a value (for example, small, large, or medium). In Watson Knowledge Catalog , the tags represent a value. The exported data, maps both of these attributes and it creates a single label. For example, SizeRange:Small.