IBM IGC: Quick Start Guide / Configuration Checklist

Get exports from IBM Automatic Data Lineage to IBM IGC up and running.

IBM IGC Prerequisites

For more information about the necessary prerequisites, see Manta Flow IBM IGC Requirements.

Manta Preparation and Steps

Certificate Setup for the HTTPS Connection to IBM IGC

This section is only required if the IBM IGC instance is using a self-signed certificate, as in a security certificate that is not signed by a certificate authority (CA). If the HTTPS certificate is signed by a CA, then you can move on to the IBM IGC Connection Properties section.

IBM IGC uses a secure HTTPS connection for the REST interface by default. In order to establish the secure connection from Manta Flow to IBM IGC, IGC's signer certificate needs to be added to Automatic Data Lineage list oftrusted certificates.

It is possible to use an IGC instance that is configured with a WAS using what is known as the Liberty profile. This is a smaller edition of WAS that is implemented by some accounts. Its technology operates the same, as far as IGC is concerned, but it does not have a browser-based console. This means that the certificate file has to be obtained directly. This has not been tested, but according to IBM documentation, it exists here: The SSL key that is used by the application server is stored in the {IS_install_path}/wlp/usr/servers/iis/resources/security/iis-server-keystore.p12 file. For more information, see SSL certificates for WebSphere Application Server Liberty Profile.

Manta Admin UI provides a keytool in the UI for adding SSL certificates.

No alt text provided

The SSL certificates can be uploaded directly into Admin UI if you have them available on your local machine, or the certificates can be retrieved directly from the IGC Server by providing Automatic Data Lineage with a link to the IGC page.

No alt text provided

Once these steps have been successfully completed, you can configure the Automatic Data Lineage to IBM IGC Export connection via Manta Admin UI.

IBM IGC Connection Properties

This configuration is common for all source systems and is stored in the igcExport.properties file. These configurations can be updated and maintained from within Manta Admin UI and are located in the Integrations section under the Configuration tab.

The following is a link to the page. Replace <manta-server-url> with your Automatic Data Lineage server URL.

<manta-server-url>/app/#/platform/configuration/group/Integrations/category/IGC/resource/IGC%20Export%20Common

No alt text provided

Enter the connection details for all the required fields and save the configuration.

IBM IGC Resource Mapping Configuration

In IBM Information Governance Catalog all metadata from data resources like RDBMSs is assigned to a host and database. Manta Flow has to know this structure to be able to connect the newly created metadata with an IGC native resource. This mapping is covered by a simple configuration that consists of the requested names of the host and database in IGC and a definition of the database objects that should be in this resource. The definition of database elements is slightly different for each SQL dialect. The location for configuring these definitions is <manta-server-url>/app/#/platform/configuration/group/Integrations/category/IGC/resource/${<technology>}%20IGC%20Mapping where ${<technology>} represents the exported technology.

No alt text provided

Enter the mapping details for each resource you are planning to export to IGC and save the configuration.

No alt text provided

The IGC Host and IGC Database used in the configuration relate to the highlighted elements in your IGC Implemented Data Resources as follows.

No alt text provided

Important: The IGC Host and IGC Database names must be exactly the same in Automatic Data Lineage as they are in IBM IGC.

Each resource technology has its own resource mapping configuration. Here is an example of a mapping for an MS SQL resource.

Microsoft SQL Server

For additional information regarding the configuration of IBM IGC resource mappings, see Manta Flow IBM IGC Client Configuration.

IBM IGC Preparation and Steps

Importing the Manta Meta Model to IGC

Manta Flow uses its own meta model in IBM Information Governance Catalog. This model must be uploaded prior to any exports being executed.

Note: The meta model only needs to be completely uploaded when initially setting up the Automatic Data Lineage export to IGC and after upgrading Automatic Data Lineage to a newer version (to apply any changes).

This can be done via an IBM IGC REST API call, which can conveniently be done through the IBM IGC Swagger page — the URL for this operation is <Infosphere_url>/ibm/iis/igc-rest-explorer/#!/bundles/registerBundle. The Manta Meta Model itself is stored in the directory <MANTA_DIR_HOME>/scenarios/manta-dataflow-cli/model.

Updating WebSphere API Limits for Lineage Upload

The default limits configured for the WebSphere Application Services API may not be sufficient for loading data lineage into IGC. If you need to increase the maximum memory allocation or number of objects kept in memory. To incerease the limits, see Prepare Information Governance Catalog for use with IBM Industry Models.

It is also necessary to increase the session timeout as described in Changing timeout values to allow import of large files to InfoSphere Business Glossary or to InfoSphere Information Governance Catalog.

Updating the Manta IGC Export Common Payload Size

An alternative/parallel approach to combating the limits configured for the WebSphere Application Services API is to reduce the Manta Export payload size by lowering the asset count per batch value. By default, this value is set to 10000. You can reduce it to a lower number by navigating to the Export Settings section in the IGC Export Common page: <manta-server-url>/manta-admin-gui/app/#/platform/configuration/group/Integrations/category/IGC/resource/IGC%20Export%20Common and updating the field value.

No alt text provided

Manta Flow IBM IGC Export Execution

To export metadata from the Manta Flow metadata repository and upload it to IBM IGC:

  1. Once all the necessary metadata is in Manta Flow Server, run the Run Export IGC workflow; otherwise, execute the whole extraction, analysis, and export process by running the Run IGC scenario.

  2. After the export is completed, the temporary output files used for upload are stored in the ${manta.dir.output}/<technology>/${<technology>.dictionary.id}/igc folder, where <technology> represents the exported technology.

Important: When executing the Run Export IGC workflow without targeted scans enabled, only the dictionary mapping information for the most recent extraction for the technology is available for export into IGC. This might cause errors when exporting data that was gathered from resources scanned prior to the most recent extraction. An example of an error message you might run into in this situation is There is no mapping for the.... To allow exporting data that was gathered over time, rather than the most recently extracted data set, you can either:
- Enable targeted scanning.
- Each time you want to export data that was captured earlier, run <technology>DictionaryMappingScenario for the selected connections before you run the Run Export IGC scenario. As a result, a new dictionary mapping is generated to use with the IGC communication.