Installing the Data Virtualization service

A project administrator can install Data Virtualization on IBM Cloud Pak for Data.

Permissions you need for this task: You must be an administrator of the OpenShift® project (Kubernetes namespace) where you will deploy Data Virtualization.

Information you need to complete this task

Data Virtualization requires a custom security context constraint (SCC). For details, see Creating required SCCs.
Data Virtualization must be installed in the same project as Cloud Pak for Data.
Data Virtualization requires IBM Db2® Data Management Console and the Cloud Pak for Data common core services. If Db2 Data Management Console or the common core services are not installed in the project where you plan to install Data Virtualization, they are automatically installed when you install Data Virtualization and installation takes longer to complete.
Data Virtualization uses the following storage classes. If you don't use these storage classes on your cluster, ensure that you have a storage class with an equivalent definition:
- OpenShift Container Storage: ocs-storagecluster-ceph-rbd
- IBM Spectrum®: ibm-spectrum-scale-sc
- NFS: managed-nfs-storage
- Portworx: portworx-db2-rwx-sc
- IBM Cloud File Storage: ibmc-file-gold-gid or ibm-file-custom-gold-gid

Before you begin

Ensure that the cluster meets the minimum requirements for installing Data Virtualization. For details, see System requirements.

Additionally, ensure that a cluster administrator completed the required Pre-installation tasks for your environment. Specifically, verify that a cluster administrator completed the following tasks:

Cloud Pak for Data is installed. For details, see Installing Cloud Pak for Data.
For environments that use a private container registry, such as air-gapped environments, the Data Virtualization software images are mirrored to the private container registry. For details, see Mirroring images to your container registry.
The cluster is configured to pull the Data Virtualization software images. For details, see Configuring your cluster to pull images.
The Data Virtualization catalog source exists. For details, see Creating catalog sources.
The Data Virtualization operator subscription exists. For details, see Creating operator subscriptions.
The security context constraints (SCCs) required to run Data Virtualization exists. For details, see Creating required SCCs.
The node settings are adjusted for Data Virtualization. For details, see Changing required node settings.

If these tasks are not complete, the Data Virtualization installation will fail.

Prerequisite services

Before you install Data Virtualization, ensure that the following services are installed and running:

Db2U: If you have not already installed the ibm-db2uoperator-catalog, create the Db2U catalog source. For more information, see Configuring your cluster to pull Cloud Pak for Data images. Then, create the Db2U operator subscription. For more information, see Creating operator subscriptions.
Db2 Data Management Console: If you do not manually install Db2 Data Management Console, Data Virtualization installs it for you. If you have already installed Db2 Data Management Console, make sure that a Db2 Data Management Console instance has been provisioned. For more information, see Installing Db2 Data Management Console.
Common core services: Data Virtualization installs Common core services on your Cloud Pak for Data cluster if you do not have it installed.

Procedure

Complete the following tasks to install Data Virtualization:

Install the service.
Verify the installation.
Find out what to do next.

Installing the service

To install Data Virtualization:

Log in to Red Hat® OpenShift Container Platform as a user with sufficient permissions to complete the task:
```
oc login OpenShift_URL:port
```

Create a DvService custom resource to install Data Virtualization.

Important: By creating a DvService custom resource with spec.license.accept: true, you are accepting the license terms for Data Virtualization. You can find links to the relevant licenses in IBM Cloud Pak for Data License Information.

Create a custom resource with the following format.

cat <<EOF |oc apply -f -
apiVersion: db2u.databases.ibm.com/v1
kind: DvService
metadata:
  name: dv-service     # This is the recommended name, but you can change it
  namespace: project-name     # Replace with the project where you will install Data Virtualization
spec:
  license:
    accept: true
    license: Enterprise | Standard     # Specify the license you purchased.
  version: 1.7.8
  size: "small"                     # Default size
EOF

When you create the custom resource, the Data Virtualization operator installs Data Virtualization.

Verifying the installation

When you create the Data Virtualization custom resource, the Data Virtualization operator processes the contents of the custom resource and starts up the microservices that comprise the Data Virtualization service, including the Data Virtualization addon and the Data Virtualization service provider. (The DvService microservice is defined by the dv-service custom resource.) Data Virtualization is installed when the DvService status is Completed.

To check the status of the installation:

Change to the project where you installed Data Virtualization:
```
oc project project-name
```
Get the status of Data Virtualization (dv-service):
- Run the following command:
```
oc get dvservice dv-service
```
  The result is similar to the following example, where the READY field indicates whether the DvService is installed.
```
NAME         READY
dv-service   True
```
- To check whether the DvService finished installing Data Virtualization service pods, run the following command:
```
oc get DvService dv-service -o jsonpath="{.status.reconcileStatus}"
```
  Data Virtualization is installed when the command returns Completed. You must now provision a Data Virtualization instance to use Data Virtualization. For more information, see Provisioning the Data Virtualization service.

What to do next

Provision the service.
Administer the service.
Optionally, set up automatic pruning of the archive log.