After you specify the resources that you want to use for persistent storage, set up a
connection to one or more remote data sources.
About this task
You can connect to a Hadoop cluster on Cloudera Data Platform (CDP) Private Cloud Base 7.1.7, an
object store, or to both a Hadoop cluster and an object store. In the third scenario, you must
configure the remote cluster to connect to the object store because Db2 Big SQL uses the Hive metastore as its catalog. For more information
about each scenario, see Db2 Big SQL architecture.
Note: If updates are made to the remote Hadoop cluster's core-site.xml
configuration file to set up the cluster access to an object store service, its Hive service must be restarted, even if the cluster
manager doesn't indicate that a restart is needed.
- Using vault stored credentials and certificates
-
To connect to object store services, you can use credentials and certificates that are stored as
secrets in a vault. To connect to an object store service by using secrets from a
vault, obtain the following information:
To obtain this information, in Cloud Pak for Data, go to
. If you cannot access this
information, ask your Cloud Pak for Data administrator to
provide it.
Procedure
- To connect to a Hadoop cluster, select the Configure Hadoop
cluster checkbox, and specify the following information:
- The Hadoop cluster manager (Cloudera cluster manager server) host URL
- The Hadoop cluster manager username
- The Hadoop cluster manager user password
- Under Configure Kerberos, select one of the following
options.
- To connect to an object store, select the Configure object store
checkbox, and specify the following information:
- The object store service endpoint
- The object store service access ID key
To use a secret from a
vault, type
vault:<vault-secret-name>:<vault-secret-key>.
- The object store service secret key
To use a secret from a vault, type
vault:<vault-secret-name>:<vault-secret-key>.
Remember: If the Db2 Big SQL
instance is also set up to connect to a Hadoop cluster, the cluster must be configured to access the
same object store service.
- If you are accessing a public object store, select the Setup SSL for object
store configuration checkbox.
Note: Most public cloud object
stores use a well-known certificate authority, so it is generally not necessary to provide a Secure
Sockets Layer (SSL) certificate in the SSL Certificate box. But select the
Setup SSL for object store configuration checkbox so that the connection to
the object store is secure.
- If you are accessing an on-premises, self-hosted object store that is accessed by using
SSL, provide the SSL certificate.
- Select the Setup SSL for object store configuration
checkbox.
- If you are using a secret from a vault for the certificate, in the SSL
Certificate box, type
vault:<vault-secret-name>:<vault-secret-key>.
- If you are not using a secret from a vault for the
certificate, in the SSL Certificate box, copy the SSL certificate in base64
encoded Privacy Enhanced Mail (PEM) format, including the PEM header and footer.
For
example,
-----BEGIN CERTIFICATE-----
MIIFVzCCAz+gAwIBAgIJAM+JlcdkA2RaMA0GCSqGSIb3DQEBCwUAMEIxCzAJBgNV
BAYTAlhYMRUwEwYDVQQHDAxEZWZhdWx0IENpdHkxHDAaBgNVBAoME0RlZmF1bHQg
...
ZzV0pevZpJWFCt2QYEprZppj0KyiGHKQcEXAn/953YPTOmmdzGVOu5eLoTncICte
oBzKDOdxT6CTenizfaiP5LlWH1LfPvw/+0Nz
-----END CERTIFICATE-----
- If you are connecting to an object store or to both a Hadoop cluster and an object store,
and you want to limit access to a single object store bucket, select the Specify the
object store bucket name checkbox, and enter the bucket name.
- If the object store service is configured for path style access, select the Use
path style access checkbox.
If the object store service uses virtual hosted style, do not select the checkbox.
- Click Next.
- In the Summary page, review the information that you
specified.
- If you want to make changes, click Previous and go to the page
where you want to edit information.
- When all configuration information is correct, click
Create.
Results
A Db2 Big SQL instance is created. To check
that the instance is ready to use, run the following command:
oc get bigsql -l app.kubernetes.io/name=db2-bigsql
When the instance is ready, the command returns the following output:
NAME DB2UCLUSTER STATE AGE
bigsql-<instance_id> bigsql-<instance_id> Ready 19m
What to do next
Depending on your environment, you might have to do some post-provisioning tasks. For more
information, see Db2 Big SQL post-provisioning tasks.
After you provision an instance, you must add one or more users to the instance. You (instance
owner) are not automatically added as a user. For more information, see the section on specifying
which users can access Db2 Big SQL instances in
Configuring, monitoring, and managing access to Db2 Big SQL instances.