Setting up a connection to a remote data source

The next step in provisioning a Db2® Big SQL instance involves setting up a connection to one or more remote data sources.

Before you begin

To connect Db2 Big SQL to a remote Hadoop cluster or object store, you must meet some requirements. For more information, see Remote Hadoop cluster or public or private object store.

About this task

You can connect to a Hadoop cluster on Cloudera Data Platform (CDP) Private Cloud Base 7.1.6 (Db2 Big SQL 7.2.0-7.2.3) and 7.1.7 (Db2 Big SQL 7.2.5 and later), an object store, or to both a Hadoop cluster and an object store. In the third scenario, you must configure the remote cluster to connect to the object store because Db2 Big SQL uses the Hive metastore as its catalog. For more information about each scenario, see Db2 Big SQL architecture.

Note: If updates are made to the remote Hadoop cluster's core-site.xml configuration file to set up the cluster access to an object store service, its Hive service must be restarted, even if the cluster manager doesn't indicate that a restart is needed.

Procedure

  1. To connect to a Hadoop cluster, select the Configure Hadoop cluster checkbox, and specify the following information:
    • The Hadoop cluster manager (Cloudera cluster manager server) host URL
      Note: If you are connecting to a Cloudera cluster manager server over SSL, you must configure SSL. For more information, see the post-provisioning task Configuring SSL access.
    • The Hadoop cluster manager username
    • The Hadoop cluster manager user password
  2. Under Configure Kerberos, select one of the following options.
    • If Kerberos is not enabled on the cluster, select No Kerberos.
    • To allow Db2 Big SQL to automate the creation of principals and keytabs when MIT Kerberos security is enabled on the Hadoop cluster, select the Using MIT KDC Kerberos checkbox, and specify the following information for the user that can create principals for Db2 Big SQL in the Kerberos Key Distribution Center (KDC):
      • The Kerberos admin principal
      • The Kerberos admin password
    • If Active Directory is used for Kerberos, select Using a custom keytab, and then click Upload to upload a keytab file.

      The contents of the keytab file must be base64 encoded. For more information, see Enabling Active Directory as the Kerberos server.

  3. To connect to an object store, select the Configure object store checkbox, and specify the following information:
    • The object store service endpoint
    • The object store service access ID key
    • The object store service secret key
    Remember: If the Db2 Big SQL instance is also set up to connect to a Hadoop cluster, the cluster must be configured to access the same object store service.
  4. If you are accessing a public object store, select the With SSL checkbox.
    Note: Most public cloud object stores use a well-known certificate authority, so it is generally not necessary to provide a Secure Sockets Layer (SSL) certificate in the SSL Certificate box. But select the With SSL checkbox so that the connection to the object store is secure.
  5. If you are accessing an on-premises, self-hosted object store that is accessed by using SSL, do the following steps.
    1. Select the With SSL checkbox.
    2. In the SSL Certificate box, copy the SSL certificate in base64 encoded Privacy Enhanced Mail (PEM) format, including the PEM header and footer.
      For example,
      -----BEGIN CERTIFICATE-----
      MIIFVzCCAz+gAwIBAgIJAM+JlcdkA2RaMA0GCSqGSIb3DQEBCwUAMEIxCzAJBgNV
      BAYTAlhYMRUwEwYDVQQHDAxEZWZhdWx0IENpdHkxHDAaBgNVBAoME0RlZmF1bHQg
      ...
      ZzV0pevZpJWFCt2QYEprZppj0KyiGHKQcEXAn/953YPTOmmdzGVOu5eLoTncICte
      oBzKDOdxT6CTenizfaiP5LlWH1LfPvw/+0Nz
      -----END CERTIFICATE-----
  6. If you are connecting to an object store or to both a Hadoop cluster and an object store, and you want to limit access to a single object store bucket, select the Specify the object store bucket name checkbox, and enter the bucket name.
  7. Click Next.
  8. In the Summary page, review the information that you specified.
    1. If you want to make changes, click Previous and go to the page where you want to edit information.
    2. When all configuration information is correct, click Create.

Results

A Db2 Big SQL instance is created.

What to do next

Depending on your environment, you might have to do some post-provisioning tasks. For more information, see Post-provisioning tasks.

After you provision an instance, you must add one or more users to the instance. You (instance owner) are not automatically added as a user. For more information, see the section on specifying which users can access Db2 Big SQL instances in Configuring, monitoring, and managing access to Db2 Big SQL instances.