Presto connection

To access your data in Presto, create a connection asset for it.

Presto is a fast and reliable SQL engine for Data Analytics and the Open Lakehouse.

Supported versions

  • Version 0.279 and earlier

Create a connection to Presto

To create the connection asset, you need these connection details:

  • Hostname or IP address

  • Port

  • Username

  • Password (required if you connect to Presto with SSL enabled)

  • SSL certificate (if required by the Presto server)
    If you are connecting to a Presto server in the watsonx.data service in one the following environments, you must enter an SSL certificate:

    • Cloud Pak for Data version 4.7.1
    • Stand-alone watsonx.data version 1.0.1

    Follow these steps:

    1. Log in to the OpenShift cluster.
    2. Extract the self-signed TLS certificate:
      oc extract secret/ibm-lh-tls-secret --keys=tls.crt -n ${PROJECT_CPD_INSTANCE}
      The self-signed certificate is extracted to a tls.crt file.
    3. Enter the contents of the tls.crt file in the SSL certificate field.

For Credentials and Certificates, you can use secrets if a vault is configured for the platform and the service supports vaults. For information, see Using secrets from vaults in connections.

Connecting to Presto within IBM watsonx.data

You can use the Presto connection to connect to multiple types of Presto, including the Presto engine within an instance of watsonx.data. However, you must use the watsonx.data connection if you want to use Watson Knowledge Catalog to mask sensitive data with data protection rules on the source data in watsonx.data.

To connect to a Presto server within watsonx.data on IBM Cloud, use these connection details:

  • Username: ibmlhapikey
  • Password (for SSL-enabled, which is the default): An IBM Cloud API key. For more information, see Connecting to Presto server.

To connect to a Presto server within watsonx.data on Cloud Pak for Data or stand-alone watsonx.data, use the username and password that you use for the watsonx.data console.

Choose the method for creating a connection based on where you are in the platform

In a project
Click Assets > New asset > Data access tools > Connection. See Adding a connection to a project.

In a catalog
Click Add to catalog > Connection. See Adding a connection asset to a catalog.

In a deployment space
Click Add to space > Connection. See Adding connections to a deployment space.

In the Platform assets catalog
Click New connection. See Adding platform connections.

Next step: Add data assets from the connection

Where you can use this connection

You can use the Presto connection in the following workspaces and tools:

Projects

  • Data quality rules (Watson Knowledge Catalog)
  • DataStage (DataStage service). See Connecting to a data source in DataStage.
  • Metadata enrichment (Watson Knowledge Catalog)
  • Metadata import (Watson Knowledge Catalog)

Catalogs

  • Platform assets catalog

  • Other catalogs (Watson Knowledge Catalog)

    Note:

    Preview and masking do not work for this connection in Watson Knowledge Catalog.

Federal Information Processing Standards (FIPS) compliance

The Presto connection is compliant with FIPS. However, SSL certificates that you paste into the SSL certificate field are not supported. As a workaround, you can add the certificate to the OpenShift secret named connection-ca-certs. See Using a CA certificate to connect to internal servers from the platform for the procedure.

Presto setup

To set up Presto, see Presto installation.

Restriction

You can use this connection only for source data. You cannot write to data or export data with this connection.

Limitation

The Presto connection does not support the Apache Cassandra Time data type.

Running SQL statements

To ensure that your SQL statements run correctly, refer to the SQL Statement Syntax for the correct syntax.

Learn more

Presto documentation

Related connection: watsonx.data connection

Parent topic: Supported connections