GitHubContribute in GitHub: Edit online

Manta Flow Scanner (Client) Configuration

Manta Flow Client Configuration

(Optional) Create the folders input, output, temp, and log; set up read rights for the input folder and read/write rights for the rest of the folders. You can skip this step if you want to use the default locations (listed below). The exact location of each of the folders is passed to the application by setting the bash/Windows command line environment variables. The names of these properties are shown in the following table.

Folder name Property name Default value
<MANTA_DIR_USER> MANTA_DIR_USER <MANTA_DIR_HOME>
input MANTA_DIR_INPUT MANTA_DIR_USER/input
output MANTA_DIR_OUTPUT MANTA_DIR_USER/output
log MANTA_DIR_LOG MANTA_DIR_USER/log
temp MANTA_DIR_TMP MANTA_DIR_USER/temp

The default configuration MANTA_DIR_USER indicates that the Manta user directory is located in the root of the IBM Manta Data Lineage application directory by default.

Installing Third-Party Drivers

Depending on what source systems you want to analyze, you may need to install additional libraries to connect to some of them. The following table lists third-party drivers that can be used to connect Manta Data Lineage products to various source systems.

Name

Version

Link

Teradata JDBC Driver

17.10.00.20

https://downloads.teradata.com/download/connectivity/jdbc-driver

MySQL Connector/J

5.1.38

https://dev.mysql.com/downloads/connector/j/

MariaDB Connector/J

3.2.0-GA

https://mariadb.com/downloads/connectors/connectors-data-access/java8-connector/

AWS JDBC Driver for PostgreSQL

0.1.0

https://awslabs.github.io/aws-postgresql-jdbc/

Sybase / SAP ASE JDBC Driver

3, 4

https://wiki.scn.sap.com/wiki/display/SYBCON/jConnect+Driver+Overview

Hive JDBC

2.1.0

https://cwiki.apache.org/confluence/display/Hive/HiveServer2+Clients

SAP HANA JDBC Driver

2.9.16

https://developers.sap.com/tutorials/hana-clients-jdbc.html

Download the library (a *.jar file) that you want to use and add it to the <MANTA_DIR_HOME>/scenarios/manta-dataflow-cli/lib-ext/ folder.

Installing Custom Extensions

In rare cases where a customer-provided extension library needs to be added, this too should be placed in the <MANTA_DIR_HOME>/scenarios/manta-dataflow-cli/lib-ext/ folder. If the extension library needs to be initialized sooner (typically logging extensions), it is possible to place it in the <MANTA_DIR_HOME>/platform/lib-ext/ folder. (You might have to create the folder because it does not exist by default). Do not place any extensions in the <MANTA_DIR_HOME>/scenarios/manta-dataflow-cli/lib/ or <MANTA_DIR_HOME>/platform/lib/ folders—these might be cleaned during an application update and your extensions would be lost.

Resource Configuration

It is necessary to configure the Manta Flow client before its first launch and whenever the environment is changed.Based on the architecture described in Manta Flow Client Architecture, there are three kinds of configurations.

To create a configuration for a new source system (Oracle or Teradata database, PowerCenter repository, etc.), create a new connection in Admin UI / CLI / Add connection. The following sections of this knowledge base include further descriptions of configurations by source type.

Encodings

When specifying encoding in Manta Data Lineage property files (the property name ends with .encoding), refer to the column "Canonical Name for java.nio API" in the Java supported encodings (https://docs.oracle.com/en/java/javase/17/intl/supported-encodings.html#GUID-187BA718-195F-4C39-B0D5-F3FDF02C7205).

Placeholder Replacement in Input Scripts

All scanners that accept text-based input files—that is, all SQL-based scanners (input files provided according to Manta Flow Usage)—can utilize placeholder replacement rules to replace any placeholders and/or content that does not adhere to the dialect syntax with pieces of information that make the input valid script for parsing.

Transformation Logic

This module adds attribute-level transformation descriptions specific to a target (column, routine return value, etc.) on the dataflow nodes leading to it. This description contains only the logic needed to create a stored value for this specific target, without conditions.

Supported source systems: