Upgrading Execution Engine for Apache Hadoop from Version 5.1 to Version 5.3

An instance administrator can upgrade Execution Engine for Apache Hadoop from Version 5.1 to Version 5.3.

Who needs to complete this task?

Instance administrator To upgrade Execution Engine for Apache Hadoop, you must be an instance administrator. An instance administrator has permission to manage software in the following projects:

The operators project for the instance

The operators for this instance of Execution Engine for Apache Hadoop are installed in the operators project. In the upgrade commands, the ${PROJECT_CPD_INST_OPERATORS} environment variable refers to the operators project.

The operands project for the instance

The custom resources for the control plane and Execution Engine for Apache Hadoop are installed in the operands project. In the upgrade commands, the ${PROJECT_CPD_INST_OPERANDS} environment variable refers to the operands project.

When do you need to complete this task?

Review the following options to determine whether you need to complete this task:

  • If you want to upgrade the IBM Software Hub control plane and one or more services at the same time, follow the process in Upgrading an instance of IBM Software Hub instead.
  • If you didn't upgrade Execution Engine for Apache Hadoop when you upgraded the IBM Software Hub control plane, complete this task to upgrade Execution Engine for Apache Hadoop.

    Repeat as needed If you are responsible for multiple instances of IBM Software Hub, you can repeat this task to upgrade more instances of Execution Engine for Apache Hadoop on the cluster.

Information you need to complete this task

Review the following information before you upgrade Execution Engine for Apache Hadoop:

Version requirements

All the components that are associated with an instance of IBM Software Hub must be installed at the same release. For example, if the IBM Software Hub control plane is at Version 5.3.1, you must upgrade Execution Engine for Apache Hadoop to Version 5.3.1.

Environment variables
The commands in this task use environment variables so that you can run the commands exactly as written.
  • If you don't have the script that defines the environment variables, see Setting up installation environment variables.
  • To use the environment variables from the script, you must source the environment variables before you run the commands in this task. For example, run:
    source ./cpd_vars.sh
Common core services
Execution Engine for Apache Hadoop requires the IBM Software Hub common core services.

If the common core services are not at the correct version in the operands project for the instance, the common core services are automatically upgraded when you upgrade Execution Engine for Apache Hadoop. The common core services upgrade increases the amount of time the upgrade takes to complete.

Before you begin

This task assumes that the following prerequisites are met:

System requirements
This task assumes that the cluster meets the minimum requirements for Execution Engine for Apache Hadoop.
Where to find more information
If this task is not complete, see System requirements.
Workstation
This task assumes that the workstation from which you will run the upgrade is set up as a client workstation and has the following command-line interfaces:
  • IBM Software Hub CLI: cpd-cli
  • OpenShift® CLI: oc
  • Helm CLI: oc
Where to find more information
If this task is not complete, see Updating client workstations.
Control plane
This task assumes that the IBM Software Hub control plane is upgraded.
Where to find more information
If this task is not complete, see Upgrading an instance of IBM Software Hub.
Private container registry
If your environment uses a private container registry (for example, your cluster is air-gapped), this task assumes that the following tasks are complete:
  1. The Execution Engine for Apache Hadoop software images are mirrored to the private container registry.
    Where to find more information
    If this task is not complete, see Mirroring images to a private container registry.
  2. The cpd-cli is configured to pull the olm-utils-v4 image from the private container registry.
    Where to find more information
    If this task is not complete, see Pulling the olm-utils-v4 image from the private container registry.
Cluster-scoped resources
This task assumes that the cluster-scoped resources, such as custom resource definitions, cluster roles, and cluster role bindings, were updated.
Where to find more information
If this task is not complete, see Updating the cluster-scoped resources for the platform and services.
Image pull secrets
This task assumes that the secrets that contain the image pull credentials for the instance exist.
Where to find more information
If this task is not complete, see Creating image pull secrets for an instance of IBM Software Hub.

Prerequisite services

Before you upgrade Execution Engine for Apache Hadoop, ensure that the following services are upgraded and running:

Procedure

Complete the following tasks to upgrade Execution Engine for Apache Hadoop:

  1. Upgrading the service
  2. Validating the upgrade
  3. What to do next

Upgrading the service

To upgrade Execution Engine for Apache Hadoop:

  1. Log the cpd-cli in to the Red Hat® OpenShift Container Platform cluster:
    ${CPDM_OC_LOGIN}
    Remember: CPDM_OC_LOGIN is an alias for the cpd-cli manage login-to-ocp command.
  2. Update the operator and custom resource for Execution Engine for Apache Hadoop.
    cpd-cli manage install-components \
    --license_acceptance=true \
    --components=hee \
    --release=${VERSION} \
    --operator_ns=${PROJECT_CPD_INST_OPERATORS} \
    --instance_ns=${PROJECT_CPD_INST_OPERANDS} \
    --image_pull_prefix=${IMAGE_PULL_PREFIX} \
    --image_pull_secret=${IMAGE_PULL_SECRET} \
    --upgrade=true

Validating the upgrade

Execution Engine for Apache Hadoop is upgraded when the install-components command returns:
[SUCCESS]... The install-components command ran successfully

If you want to confirm that the custom resource status is Completed, you can run the cpd-cli manage get-cr-status command:

cpd-cli manage get-cr-status \
--cpd_instance_ns=${PROJECT_CPD_INST_OPERANDS} \
--components=hee

What to do next

  1. Upgrade all of the services in this instance to IBM Software Hub Version 5.3.x.
  2. Complete the catalog-api service migration to PostgreSQL.
  3. Complete post-upgrade tasks for Execution Engine for Apache Hadoop.

After you complete the preceding steps, Execution Engine for Apache Hadoop is ready to use. To get started with Execution Engine for Apache Hadoop, see Analyzing Apache Hadoop data.