Exporting data lineage to Collibra

Export the data lineage to Collibra to view it in the Collibra diagrams.

Overview

When you export data lineage to Collibra, you view it in diagrams in Collibra Data Governance Center. The data is imported to communities and domains. Assets are organized in the following domains:

  • Systems and databases: technology assets
  • Physical data assets: physical data dictionary
  • Data transformation: mapping domain
  • Reports: report catalog

Supported data sources for lineage export

Data lineage from these sources can be exported and uploaded to Collibra:

  • Amazon RDS for PostgreSQL
  • Amazon Redshift
  • Google BigQuery
  • Greenplum
  • IBM Cloud Databases for PostgreSQL
  • IBM DataStage for Cloud Pak for Data
  • IBM Db2
  • IBM Db2 for z/OS
  • IBM Db2 on Cloud
  • InfoSphere DataStage
  • Microsoft Power BI (Azure)
  • Microsoft SQL Server
  • PostgreSQL
  • Snowflake
  • Tableau
  • Teradata

The Collibra domain model

The following diagram shows the domain model of Collibra assets, relations, and complex relations.

Shows the Collibra domain model with assets and relations

Collibra API endpoints

The APIs that are listed in the following table are used in the lineage export process. The data is uploaded to Collibra in several batches. Each batch must be completely uploaded before the next one is sent. The main API that is used for the core part of the process, which is uploading data, is the synchronization API.

API name API usage
POST /rest/2.0/auth/sessions Used to create the export sessions.
GET /rest/2.0/auth/sessions/current Used to get the state of an existing export session.
POST /rest/2.0/import/synchronize/{synchronizationId}/batch/json-job Used to upload all metadata (domains, assets, relations, complex relations, attributes) when synchronization of assets on the target system is enabled.
POST /rest/2.0/import/json-job Used to upload all metadata (domains, assets, relations, complex relations, attributes) when synchronization of assets on the target system is disabled.
POST /rest/2.0/import/synchronize/{synchronizationId}/finalize/job Used to trigger the final step of the upload job.
GET /rest/2.0/jobs/{jobId} Used to get the status if a particular job.

Learn more