OpenLineage lineage configuration
To import lineage metadata from OpenLineage, create a connection, data source definition and metadata import job.
To import lineage metadata for OpenLineage, complete these steps:
- Create a data source definition.
- Create a connection to the data source in a project.
- Create a metadata import.
Creating a data source definition
Create a data source definition. Select OpenLineage as the data source type.
Creating a connection to OpenLineage
Create a connection to the data source in a project. For connection details, see OpenLineage connection.
Creating a metadata import
Create a metadata import. Learn more about options that are specific to OpenLineage data source:
Include and exclude lists
You can include or exclude assets by using job namespaces in OpenLineage events. The whole input is evaluated as a regular expression. Example values:
myPrestoApp1Namespace
: all events with job namespacemyPrestoApp1Namespace
.mySparkApp[1-5]Namespace
: all events with job namespace that starts withmySparkApp1Namespace
and ends with a digit between 1 and 5.
External inputs
You can add OpenLineage events as external inputs. The file can have the following structure:
<event_file_name>.json
Learn more
Parent topic: Supported connectors for lineage import