Amazon S3

Check out the guides below for more details on setting up this scanner.

Extraction and Analysis Phase Scenarios

Extraction Phase

For the extraction phase of Amazon S3 storage, there is only one scenario.

  1. Amazon S3 extractor scenario — connects to each configured Amazon S3 instance, scans all the buckets, filters them, and then analyzes the contents of all filtered buckets.

  2. IBM Automatic Data Lineage supports Git Ingest connections from version 42.4, for the download of files from a Git repository to the Amazon S3 workflow. For more information, see Manta Flow Agent Configuration for Extraction:Git Source

Analysis Phase

For the analysis phase for Amazon S3 storage, there is only one scenario.

  1. Amazon S3 dictionary dataflow scenario — analyzes metadata provided by the Amazon S3 extractor scenario and stores it in the internal metadata repository.