Adding data from a connection to a project (Watson Studio and Watson Knowledge Catalog)

A connected data asset is a pointer to data that is accessed through a connection to an external data source. You create a connected data asset by specifying a connection, any intermediate structures or paths, and a relational table or view, a set of partitioned data files, or a file. When you access a connected data asset, the data is dynamically retrieved from the data source.

You can also add a folder asset that is accessed through a connection in the same way. See Add a connected folder asset to a project.

Partitioned data assets have previews and profiles and can be masked like relational tables. However, you cannot yet shape and cleanse partitioned data assets with the Data Refinery tool.Partitioned data is recognized and treated like a relational table if the files meet these requirements:

You can add data and COBOL copybooks assets from mainframes to projects with a connection to Data Virtualization Manager for z/OS. The process is similar to adding these types of assets to a catalog. See Adding COBOL copybook assets.

To add multiple tables or files from a connection in a single repeatable job, use the Import metadata tool. See Importing metadata.

To add a data asset from a connection to a project:

  1. Open the list of assets to import. The menu options vary depending on the version of Cloud Pak for Data you are running:
    1. From the project page, click Assets > New asset > Data access tools > Connected data.
    2. From the project page, click Assets > Import asset > Connected data.
  2. Select an existing connection asset as the source of the data. If you don't have any connection assets, cancel and go to New asset, and select Data access tools > Connection and create a connection asset.
  3. If necessary, enter your personal credentials for locked data connections that are marked with a key icon (the key symbol for connections with personal credentials). This is a one-time step that permanently unlocks the connection for you. After you have unlocked the connection, the key icon is no longer displayed. See Adding connections to projects.
  4. Select the data you want and click Select. For partitioned data, select the folder that contains the files. If the files are recognized as partitioned data, you see the message This folder contains a partitioned data set.
  5. Type a name and description.
  6. Click Create. The asset appears on the project Assets page.

When you click on the asset name, you can see this information about connected assets:

Watch this video to see how to create a connection and add connected data to a project.

This video provides a visual method as an alternative to following the written steps in this documentation.

Next steps

Learn more

Parent topic:

Adding data to an analytics project