Running SQL queries

SQL is a standardized language for defining and manipulating data in a relational database. You can use the Query workspace interface in IBM® watsonx.data to run SQL queries and scripts against your data.

watsonx.data Developer edition

watsonx.data on IBM Software Hub

About this task

To run the SQL queries, do the following steps:

Procedure

  1. Log in to the watsonx.data console.
  2. From the navigation menu, select SQL. The Query workspace page opens.
  3. Browse to explore the following options in the Query workspace page.
    • Engine: To select an engine and view the associated catalogs.
    • Filter for tables: To search the tables and columns.
    • Worksheets: To write SQL queries.
    • watsonx.data provides pre-defined Sample and benchmarking data, such as tpch and tpcds, to test the performance of a database system under controlled conditions. It also provides System monitoring data that uses jmx and system metrics to collect data about the system's health and performance during benchmark testing to understand how the system responds to the workload. Sample and benchmarking data, tpch and tpcds can only be queried using the Presto engines.
    • Saved worksheets: To view the saved queries.
    • Sample worksheets: To run predefined sample queries to analyze the performance between different engines on the same data set.

    The Query workspace page provides basic options to undo, redo, cut, copy, paste, save, clear, and delete.

    Format selection option is enabled only when an SQL statement in a query worksheet is selected. The Format worksheet option formats all the content in the worksheet. Comment selection is used to explain sections of SQL statements.

    The Delete option is enabled only after an SQL query is saved.

  4. Select an engine from the Engine menu.
  5. Select the catalog, schema, table, or column in which you want to run the query.
  6. Click the overflow menu and select the required query.
    • For a catalog and schema, you can run the Generate Path query.
    • For a table, you can run the Generate path, Generate SELECT, Generate ALTER, and Generate DROP query.
    • For a column, you can run the Generate path, Generate SELECT, and Generate DROP query.
  7. Select the Catalog and corresponding Schema from the drop-down on top of the worksheet to run queries for all tables within the schema without having to specify the path (<catalog>.<schema>) for every queries.
  8. Click the Save icon to save the query. A Save worksheet confirmation dialog appears.
  9. Enter worksheet name, click Save.
  10. Click the Run button to run the query. Using Run to cursor or Run from cursor, you can run queries from or until your cursor position.
    Note: You can cancel one or multiple running queries. Additionally, you can remove queries from the worksheet after they are canceled or successfully completed, making it easier to keep your workspace organized. These options are available for each respective query in the Worksheet results view.
  11. Select Result set or Details tab to view the results. Click the Download icon to export the result set and details to a CSV file.
  12. Click Saved queries to view the saved queries.
  13. Click Explain to view the logical or distributed plan of execution for a specified SQL query.
    Related API: For information on related API, see