Table of contents

Setting up the cluster for the DataStage Enterprise service

If you plan to install the DataStage® Enterprise service on IBM® Cloud Pak for Data, a cluster administrator must set up the cluster for the service.

Before you begin

Required role: To complete this task, you must be a cluster administrator.

If you are installing the service on an air-gapped cluster, ensure that you completed the steps in Preparing for air-gapped installations to download the required files for the service.

Ensure that the Mac OS or Linux machine where you will run the commands meets the appropriate requirements for your environment:

Requirements for the machine Cluster is connected to the internet Cluster is air-gapped
Can connect to the cluster.
Is connected to the internet.  
Has the oc command-line interface.
You can download the appropriate client tools for your operating system from Red Hat® OpenShift®:

Ensure that the version is compatible with the version of Red Hat OpenShift on your cluster.

Has the Cloud Pak for Data command-line interface.

See Obtaining the installation files. Use the same version of the command-line interface each time you run the commands.

Has the updated repo.yaml file in the same directory as the Cloud Pak for Data command-line interface.

See Obtaining the installation files.

 
Has the cpd-Operating_System-workspace directory, which contains the required files.

See Preparing for air-gapped installations.

 

About this task

These instructions assume that the IBM Cloud Pak for Data control plane is already installed on your Red Hat OpenShift cluster.

Procedure

To set up the cluster for the service:

  1. Log in to your Red Hat OpenShift cluster:
    oc login OpenShift_URL:port
  2. See what changes need to be made to the cluster, by running the appropriate cpd adm command for your environment:
    Tip: For a list of all available options, enter the command: ./cpd-Operating_System adm --help.
    • To run the command on a cluster that can connect to the internet:
      1. Change to the directory where you placed the Cloud Pak for Data command-line interface and the repo.yaml file.
      2. Run the cpd adm command with the following parameters to preview the list of resources that must be created on the cluster:
        ./cpd-Operating_System adm --repo repo.yaml \
        --assembly ds-ent \
        --namespace Project

        Replace the following values:

        Variable Replace with
        Operating_System For Linux, specify linux. For Mac OS, specify darwin.
        Project The project where the IBM Cloud Pak for Data control plane is installed.

        The command returns a list of the changes that you must make to your cluster to ensure that the service can run on your cluster.

    • To run the command on an air-gapped cluster:
      1. Change to the directory where you placed the Cloud Pak for Data command-line interface.
      2. Run the cpd adm command with the following parameters to preview the list of resources that must be created on the cluster:
        ./cpd-Operating_System adm \
        --assembly ds-ent \
        --version Assembly_version \
        --namespace Project \
        --load-from Image_directory_location

        Replace the following values:

        Variable Replace with
        Operating_System For Linux, specify linux. For Mac OS, specify darwin.
        Assembly_version The version of the assembly that is specified in the cpd-Operating_System-workspace/assembly/Assembly_name/versions.yaml file.
        Project The project where the IBM Cloud Pak for Data control plane is installed.
        Image_directory_location The location of the cpd-Operating_System-workspace directory.

        The command returns a list of the changes that you must make to your cluster to ensure that the service can run on your cluster.

  3. Make the necessary changes to your cluster.
    You can choose one of the following methods to make the changes:
    • To automatically apply the changes to your cluster:

      Re-run the cpd adm command with the --apply flag:

      • To run the command on a cluster that can connect to the internet:
        ./cpd-Operating_System adm --repo repo.yaml \
        --assembly ds-ent \
        --namespace Project \
        --apply
      • To run the command on an air-gapped cluster:
        ./cpd-Operating_System adm \
        --assembly ds-ent \
        --version Assembly_version \
        --namespace Project \
        --load-from Image_directory_location \
        --apply

      Replace the variables with the same values that you used the last time you ran the command.

    • To manually apply the changes to your cluster:

      Follow the appropriate procedures from the Red Hat OpenShift documentation to complete the required tasks.

What to do next

See Installing the DataStage Enterprise service to continue the installation process.