Connecting to Amazon S3 in Watson Query

The Amazon S3 connector requires specific information to create a connection to it in Watson Query.

For more information, see Data sources in object storage in Watson Query.

Before you begin

To access data that is stored in Cloud Object Storage, you must create a connection to the data source where the files are located. Watson Query integrates with object storage connections and supports PARQUET (or PARQUETFILE), Optimized Row Columnar (ORC), comma-separated values (CSV), tab-separated values (TSV), and JSON data formats. All other file formats are not supported.

Procedure

To connect to Amazon S3 in Watson Query, follow these steps.

  1. On the navigation menu, click Data > Data virtualization. The service menu opens to the Data sources page by default.

  2. Click Add connection > New connection to see a list of data sources that can be added to Watson Query.

  3. Select the Amazon S3 data source connection.

  4. Enter the connection name and description.

  5. To configure an Amazon S3 connection, you must provide values for the following parameters.
    • Bucket
    • Endpoint URL
    • Access key
    • Secret key

    To find the values for the Access key and Secret key, see Understanding and getting your AWS credentials.

  6. Click Create to add the connection to the data source environment.

Results

You can now use your Amazon S3 database as a data source in Watson Query.