Creating a data product from a project

You can create a data product by adding assets from a project. Projects are used for data products containing SQL queries or when you do not have access to IBM Knowledge Catalog. With an SQL query, you can customize a consumer's view of a data and generate data joins. By customizing the consumer's view on data, you can hide personal and sensitive information or create filters based on the consumer's request details. This means that data products can be reused more frequently as producers can edit the SQL query on the same data products to generate different views.

Required roles and permissions to complete this task

Collaborator role: Editor or Admin

Cloud Pak for Data permission: Data Product Hub Provider

Before you begin, you must have the following:

  • Valid credentials that verify a data source connection. Ensure that this connection is supported by Data Product Hub. For details, see Delivery methods for connectors.

Adding data assets to a project

From within a project, add new assets to populate your project:

  • Adding data assets from a connection: Select the Connect to a data source tile and complete the fields with your credentials. Then, click Import assets and select the data assets from your connection that you want to add.

  • Creating an SQL query: Select the Create a dynamic view of data tile and complete the indicated fields to define the query. Do not complete the Parameter set field.

  • Adding data assets from data lineage: Select the Import for metadata for data assets tile and complete the fields with your credentials. Then, click Import lineage metadata and select a data source definition, connection to the data source and scope for your data that you want to add.

From a subscribed data product that is accessed through watsonx.data, you can add data assets to a project:

  1. Find your data product in My Subscriptions and select the Actions menu icon Actions menu > Add to Project. Details of the connections and assets that can be created in the project will appear.
  2. To provide connection credentials: Select the connection and go to Connection Details.
  3. If you did not select the Presto engine with the access in watsonx.data delivery method, you must provide the following information in the Engine Connection Details section:
    • Engine hostname or IP address
    • Engine ID
    • Engine port
  4. In Credentials, provide your username and password.
  5. Click Test Connection to validate your input.
  6. After a successful test, return to the project to access the data assets.

Adding assets from a project to a data product

  1. From the Data Product Hub homepage, select New data product and provide a name for your data product. Then, choose the Add from project tile.

  2. Select the items from the project assets list that you want to add to your data product. You can expand the drop-down to view more details about each data asset. For an item to be included in a data product, it must be either a notebook asset or a connected data asset. If it is a connected data asset, it must come from a supported data source connector.

  3. Verify the connections of all assets. To verify a connection, click the connection status and complete the fields by providing credentials. Make sure that you use an appropriate set of credentials as these credentials are used to deliver the data product to consumers following a subscription.

  4. Click Create draft to confirm your data asset selection. When your draft is successfully created, a static visualization of your data product is generated and viewable by the consumer. The visualization is only generated if the Data Visualization toggle is on.

Adding notebooks to a data product

  1. From the Data Product Hub homepage, select New data product and provide a name for your data product. Then, choose the Add from project tile.
  2. Select the notebook you want to add to your data product. If you want to hide cells from consumers you can use this parameter in the cell: #hidden_cell.
  3. Click Create draft to confirm your data asset selection.
  4. Once you are done editing the data product details, select Publish so consumers can access your data product.

Consumers can only download notebooks and add them to a project.

Next steps

After you create your data product draft, see Completing a data product to finish preparing your data product for publication.