Impala via Execution Engine for Hadoop connection

You can create a connection asset for Impala via Execution Engine for Hadoop.

Use the Impala via Execution Engine for Hadoop connection to connect to data stored in tables in Impala on the Hadoop cluster.

Prerequisites

Supported encryption

Credentials

Username and password

For Credentials and Certificates, you can use secrets if a vault is configured for the platform and the service supports vaults. For information, see Using secrets from vaults in connections.

Create an Impala via Execution Engine for Hadoop connection to the Hadoop cluster

  1. Click Add to project > Connection.
  2. Select Impala via Execution Engine for Hadoop.
  3. Enter a name and description.
  4. Enter the connection details:
  5. Hostname or IP Address: Hostname or IP Address where the Impala daemon is available.
  6. Port: Impala daemon's port from Hadoop cluster.
  7. Port is SSL-enabled: Select if Impala daemon is SSL-enabled.
  8. SSL certificate: Provide the SSL certificate of the Impala daemon if it is SSL-enabled.
  9. In the Jar uris drop-down list, upload the ImpalaJDBC41.jar file if it is not already there, and then select it.
  10. For Credentials, select Personal and enter the user's LDAP userid and password.
    Note: The connection to Impala via Execution Engine for Hadoop must be a personal connection. The connection cannot be shared. Other users must enter their own credentials when accessing this connection.
  11. Click Create.

Next step: Add data assets from the connection

Where you can use this connection

You can use an Impala via Execution Engine for Hadoop connection in the following workspaces and tools:

Analytics projects

Catalogs

Data Virtualization service You can connect to this data source from Data Virtualization.

Restrictions

Known issues

Troubleshooting Hadoop environments

Parent topic: Supported connections