Creating a service instance for Data Virtualization from the web client

After you install Data Virtualization, you must create at least one Data Virtualization service instance. Each service instance must be in a different Red Hat® OpenShift® Container Platform project. You can create a service instance in the operands project or in a project that is tethered to the operands project. If you are a IBM® Software Hub user, you can use the web client to create service instances. The web client guides you through the process of creating service instances.

Who needs to complete this task?: To create a service instance from the web client, you must have the Create service instances permission in IBM Software Hub.

When do you need to complete this task?

Complete this task only if you want to create a service instance from the web client.

Alternative methods for creating a service instance

By using the cpd-cli service-instance create command. For more information, see Creating a service instance for Data Virtualization with the cpd-cli service-instance create command.
By using the /v3/service_instances REST API call. For more information, see Creating a service instance for Data Virtualization programmatically.

Before you begin

This task assumes that the following prerequisites are met:

Prerequisite	Where to find more information
Data Virtualization is installed.	If this task is not complete, see Installing Data Virtualization.

About this task

If you installed IBM Software Hub by using Argo CD

There is a known issue where the Db2 Data Management Console (DMC) fails to install in the Cloud Pak for Data control plane where you provision a Cloud Pak for Data cluster with a tethered namespace. Ensure you complete the workaround steps in DMC fails to provision when using a tethered namespace in ArgoCD so that DMC can be installed as a required dependency.

If you installed IBM Software Hub by using the cpd-cli

You can run only one Data Virtualization instance in a project. You can also run multiple instances of Data Virtualization for the same IBM Software Hub instance by setting up tethered projects and then provisioning one Data Virtualization instance in each of the tethered projects. For information about setting up tethered projects, see Tethering projects to the IBM Software Hub control plane.

The Data Virtualization service is provisioned to any compute node in the IBM Software Hub cluster that has the specified resources (cores, memory, and ephemeral storage) available.

Procedure

Log in to the web client.
Switch to the IBM Cloud Pak for Data experience, if needed.
From the navigation menu, select Services > Instances.
Click New Instance.
Click the Data Virtualization tile. Then, click Next.
On the Configure service page, select the namespace where you want to deploy the instance and enter an optional short name. To see more namespaces in this list, create tethered namespaces. For more information, see: Manually creating projects (namespaces) for an instance of Cloud Pak for Data.
Click Next.
Choose one of the following resources and sizing options:
- Select the standard t-shirt size that you want to specify for resources in the Data Virtualization instance, and then choose whether to enable autoscaling. For more information about these settings, see Scaling Data Virtualization to a standard t-shirt size and configure autoscaling.
- Configure the custom size settings to specify the resources that you want to allocate to the Data Virtualization worker nodes:
  1. Specify the number of Data Virtualization worker nodes to allocate to the service.
    Recommended: One worker node is sufficient for many workloads.
    To understand the difference between compute nodes and worker nodes, see Preparing to install Data Virtualization.
  2. Specify the number of cores to allocate per node.
    You are constrained by the total number of available cores on the OpenShift compute nodes.
  3. Specify the amount of memory in GB to allocate per node.
    You are constrained by the total amount of memory on the OpenShift compute nodes. You can scale the Data Virtualization service up and down at any time after you provision it. For more information, see Scaling Data Virtualization.
  For more information about scaling Data Virtualization by using custom size settings, see Scaling Data Virtualization.
Specify the storage classes and persistent volume sizes that you want to use for the service nodes, caching storage, and audit log storage. For more information, see Storage requirements.
1. In the Node storage section, select the block storage class and specify the size to allocate to your nodes. The default size that is shown in the Node storage section is 50Gi.
2. In the Caching storage section, select the storage class and then specify the amount of storage to allocate to your data caches. If you are using OpenShift Data Foundation or IBM Fusion Data Foundation, select a block storage class. If you are using Portworx, select a file storage class.
3. In the Audit log storage, select the file storage class and specify the amount of storage to allocate to your audit logs.
Click Next.
Ensure that the summary is correct and click Configure.
Optional: If you want to use Cloud Pak for Data while you wait for the Data Virtualization provisioning process to complete, click Home.
Repeat this procedure to provision additional instances of Data Virtualization in tethered projects. You must create a new tethered project for each instance of Data Virtualization that you want to provision. For more information about creating tethered projects, see Manually creating projects (namespaces) for an instance of Cloud Pak for Data.
Note: The c-db2u-dv-dvcaching pod remains in the "0/1 Init" state during the entire Data Virtualization instance-provisioning process. The pod switches to the "1/1 Running" state after the process is complete.

What to do next

To connect to the Data Virtualization service, use the JDBC URL that is provided in the Configure connection page for the service. Additionally, if you have a load balancer, you must open the port in your load balancer and your firewall. For more information, see Configuring network requirements for Data Virtualization.
Optional: Configure dedicated OpenShift worker nodes.
Complete post-installation administration tasks to configure service instances.
Assign the Data Virtualization Admin user role for service set up. When you provision the Data Virtualization service, you are automatically assigned the Data Virtualization Admin role. After you provision the service, you must give at least one other user the Data Virtualization Admin role to configure the features of the Data Virtualization service. Alternatively complete those tasks yourself.

Now you can use the Data Virtualization service. For more information, see Getting started with Data Virtualization.