Data lineage

Data lineage is the process of tracking data as it is moved and used by different software tools. Use Manta Data Lineage to increases data pipeline transparency so you can determine data accuracy throughout the models and systems.

Required services
IBM Knowledge Catalog with IBM Manta Data Lineage service enabled. For more information how to enable data lineage, see Enable data lineage.
Required permissions
  • Manage data lineage or Access data lineage permission.

You can use data lineage in these ways:

  • You can understand your data through visual representation on the lineage graph.
  • You can track your data and learn where it came from, how it was transformed, and where the data was moved.
  • You can check your data in one view for quality scores or transformations.

Before you start using Manta Data Lineage, you need to perform additional tasks to prepare your data. See Preparing data for data lineage.

Data lineage roles and permissions

How you can use the data lineage depends on your assigned roles and permissions. To determine your roles and permissions, see Determining your roles and permissions.

Permission Tasks
Manage data lineage - Run metadata import jobs
- Publish assets from metadata jobs to projects or catalogs
- View monitor and manage page
- Delete lineage from monitor and manage page
- View lineage repository page
- View lineage graphs for all assets in the repository
- Add or delete external agents
- Update alias mappings and file system mappings
- Select Cloud Object Storage to enable lineage
Access data lineage - View lineage repository
- View lineage graphs for all assets in the repository
Role Permission
Lineage Administrator - Manage data lineage
- Access data lineage
- Create data source definitions

Learn more

Parent topic: Data governance