Importing ETL jobs (Watson Knowledge Catalog)

Import ETL jobs to govern those jobs as data integration assets in catalogs.

You can import ETL jobs starting in Cloud Pak for Data 4.7.2.

This import option is not available in projects that are marked as sensitive.

Before you import metadata, design your metadata import so that you understand all your options and make the appropriate choices for your goals. See Designing metadata imports.

You can also use APIs instead of the user interface to retrieve the list of supported connections or to create a metadata import asset. The links to these APIs are listed in the Learn more section.

Asset types

Data integration assets that represent components of ETL jobs. See Asset types created through metadata import.

Supported connections

See the Other data sources section in Supported connectors.

Required permissions

To create, manage, and run a metadata import, you must have these roles and permissions:

The Manage asset discovery user permission.
The Admin or the Editor role in the project.
The Admin or the Editor role in the catalog to which you want to import the assets.

Prerequisites

Before you can import an ETL job, complete the following prerequisite tasks:

For InfoSphere DataStage, Talend, or Informatica PowerCenter ETL jobs, create an ETL job file and upload it to your project. See Preparing ETL job files.
For DataStage flows (DataStage on Cloud Pak for Data), select flows from your project.

Creating the metadata import asset and importing data models

To create a metadata import asset and a job for importing ETL jobs to a catalog:

Depending on the outcome of the metadata import job run, a completion message or an error notification is displayed.

A completion message is displayed when the job run completed successfully, completed with warnings, or completed with errors. An error notification is displayed if the entire job run failed. Either type of notification contains a link to the job run log that provides details about the specific job run.

When the import is complete, you can see the list of assets with the following information:

The asset name, which provides a link to the asset in the catalog.
The asset type, such as Data integration job.
The asset context, such as the file path of the element.
The date and time that the asset was last imported.
The import status, which can be Imported for successfully imported data, In progress, or Removed if the asset couldn't be reimported.

In the catalog, the imported assets have a tag automatically assigned that reflects the originating data integration tool.

Learn more

Next steps

Managing existing metadata imports

Parent topic: Importing metadata