Joining virtual objects by using the Watson Query UI

Important: IBM Cloud Pak® for Data Version 4.8 will reach end of support (EOS) on 31 July, 2025. For more information, see the Discontinuance of service announcement for IBM Cloud Pak for Data Version 4.X.

Upgrade to IBM Software Hub Version 5.1 before IBM Cloud Pak for Data Version 4.8 reaches end of support. For more information, see Upgrading from IBM Cloud Pak for Data Version 4.8 to IBM Software Hub Version 5.1.

You can join multiple tables from multiple data sources into a single virtual table, which is also known as a join view.

Remember:
The data requests (Data > Data requests) feature was removed in Cloud Pak for Data Version 4.8.0. Consider workflows instead.

Procedure

To create a virtual view from existing virtualized tables, complete the following steps.

  1. On the navigation menu, click Data > Data virtualization to reveal the service menu.
    The service menu opens to the Data sources page by default.
  2. On the service menu, click Virtualization > Virtualized data.
    Your existing virtualized tables are listed.
  3. Select two tables that you want to join and click Join to display the Join virtual objects window.
    Tip: If you prefer, you can click Open in SQL editor to skip the following steps and use the IBM common SQL engine instead. See the SQL Reference for more details on SQL syntax and function compatibility.
  4. Use the graphical join wizard to select at least one join key, which consists of a pair of columns, of the same data type, from both of the virtualized tables. Then, from each table, select the columns to include in the join results. This step does not copy or move any data. It creates a table definition that is a combination of the two tables.
    Joining two tables
    To create a join key by using only the keyboard, follow these steps:
    1. Press Enter to select a column name in a row in Table 1.
    2. Press the Tab key to navigate to a column name in a row in Table 2.
    3. Press Enter on the row in Table 2. The rows are joined.
    Restrictions:
    • If you are joining tables with many rows, the preview of the join might time out after approximately 10 minutes if the data sources are unable to complete the processing of the join.
    • If the columns of the tables that you are joining do not share any common data, the preview of your join view might be empty. You can continue to join your virtual objects; however, the join view might not contain valuable data. Any data that you add to the virtual objects is automatically reflected in the join view.
    • You can join only two tables at one time. To join more than two tables, join two tables and get a view. Then, join the view and the third table to get another view.
    • Privileges and authorities that are granted to user groups are not considered when you create views. This limitation is a result of a Db2® limitation on groups.

      For more information, see Privileges and authorities that are granted to user groups are not considered when you create views.

  5. Click Next. You can use the new table to query the data from both of the base tables.
  6. On the Edit column names page, enter a View name, select a Schema, and edit column names as needed.
  7. Select the appropriate sharing options for the view.
  8. Click Create view to complete the process.

Results

If Watson Query and IBM® Knowledge Catalog are installed in the same OpenShift® project (namespace), your virtual object is published to the primary catalog.

What to do next

You can use the virtualized data in a number of different ways. For example, you can use them in a Jupyter Notebook, create new models within the Model Builder, or build charts or graphics on the analytics dashboard.
Note: You cannot apply data masking policies with views. For more information, see Limitations for data masking.