Viewing data lineage
You can view lineage for the assets that are in the lineage repository. Choose predefined configurations or select custom.
Required permission
You must have the following user permission:
- Manage data lineage or Access data lineage
Viewing lineage
To view your lineage:
- Go to Data > Data lineage > View lineage tab to view lineage.
- Add assets from the repository as the starting assets for your lineage. The chosen assets are listed in the Selected assets panel.
- Customize the initial view of your lineage. See Lineage filters.
- Click View lineage.
Starting assets and starting parents
When you select a starting asset, its related assets up and down in the hierarchy are marked as starting parents and starting assets. For example, when a table is selected as a starting asset, this table and all its columns are marked as starting assets. The schema and database are marked as starting parents.
Repository
All available lineage assets are listed in the Repository. In the search bar, you can look for a specific asset.
To narrow down your search, you can filter assets by: technologies, types, business terms, or tags.
Previous versions of the data lineage
Access earlier versions of lineage. When a system is added and rescanned, you can choose to view lineage from previous scans, allowing you to track changes and analyze how lineage has evolved over time. Click the Lineage versions icon to open a window, where you can select any earlier version of your lineage.
Lineage filters
When you select starting assets for which you want to display lineage, you can further customize the size and contents of the lineage with the filters.
Lineage scope
Define the scope of lineage based on the number of assets in relation with the starting assets. By default, three assets in the upstream and downstream directions from the starting asset are displayed.
You can use the following filters to change the size of the lineage:
- Range:
- Asset range: a custom range of assets is displayed, in relation to the starting asset. You define how many assets are displayed in the Hops from the starting assets option. For example, when you set
the number of hops to
5, then the lineage shows the starting assets and five nearest assets in the defined direction. The maximum number of hops is 50. - Only source and target assets: a starting asset, its original source asset and final target asset are displayed. All assets that are in between are hidden.
- Complete lineage: a complete lineage with all assets is displayed. The maximum number of assets that can be displayed is 50 in each direction from the starting asset.
- Asset range: a custom range of assets is displayed, in relation to the starting asset. You define how many assets are displayed in the Hops from the starting assets option. For example, when you set
the number of hops to
- Data flow direction:
- Upstream: the assets that flow towards the starting asset from the direction of the source asset are displayed.
- Downstream: the assets that flow from the starting asset in the direction of the target asset are displayed. By default, assets that flow in both directions are displayed.
Asset attributes
Apart from modifying the size of your lineage, you can decide what assets to display. You can filter on a type of asset, a technology or various asset attributes like tags or assigned business terms.
Lineage asset groups
Lineage asset groups are marked with tags to make them easy to see. By default all groups are displayed. To hide them, set each group to off. You can filter the following asset groups:
- Deduced: A deduced asset is an inferred object that is created by the system when it encounters references to unknown or missing components during data lineage extraction. Deduced assets are created to fill the gaps in an incomplete lineage.
- Operational: Operational group refers to a job, process, or step that interacts with data while the data is in motion.
- Transforming: Transforming asset is a type of operational asset that changes data by altering its values, structure, or lifecycle.
Monitoring and managing lineage
Go to the Monitor and manage tab on the Data lineage page to view lineage usage with the current number of assets and tables. You can also see lineage usage by technology and data source definition.
Deleting data lineage
You can delete your lineage on the Monitor and manage page. In the table Usage by data source definition, hover over a data source definition and click Delete lineage. When you delete a lineage, all assets that are included in this lineage are removed from the lineage repository. Related data source definitions and metadata imports are not affected. When a related metadata import job is in the running state, it is stopped and the lineage assets are removed.
You can also delete lineage for multiple data source definitions at the same time.