Software requirements

Before you install IBM Cloud Pak® for Data, review the software requirements for the control plane, the shared cluster components, and the services that you plan to install, and review the supported web browsers.

Cloud Pak for Data platform software requirements

You must have the following software to install Cloud Pak for Data:

Red Hat® OpenShift® Container Platform cluster

For entitlement information see Licenses and entitlements.

Cloud Pak for Data supports the following versions of Red Hat OpenShift Container Platform. (Cloud Pak for Data supports the same operating system requirements as Red Hat OpenShift Container Platform.)

Important: Cloud Pak for Data supports only the specified even releases of Red Hat OpenShift Container Platform.

Different versions of Cloud Pak for Data support different versions of Red Hat OpenShift Container Platform.

Version Supported on Learn more Cluster sizing guidance
Version 4.10.0 or later fixes
  • 4.7.0
  • 4.7.1
  • 4.7.2
  • 4.7.3
For details, see the Red Hat OpenShift Container Platform documentation: Refer to the Cloud Pak for Data platform hardware requirements as you configure your cluster.
Version 4.12.0 or later fixes
  • 4.7.0
  • 4.7.1
  • 4.7.2
  • 4.7.3
  • 4.7.4
For details, see the Red Hat OpenShift Container Platform documentation: Refer to the Cloud Pak for Data platform hardware requirements as you configure your cluster.
Container runtime
Your Red Hat OpenShift Container Platform cluster must include a container runtime.
Software Notes®
CRI-O Version 1.13 or later fixes You might need to adjust the CRI-O container settings. For details, see Changing required node settings.
Kubernetes Metrics Server
If you do not enable the default monitoring stack on your Red Hat OpenShift Container Platform cluster and you want to gather use metrics for your pods and nodes, install Kubernetes Metrics Server.
Important: If you do not enable the default monitoring stack or install Kubernetes Metrics Server, the platform monitoring features in Cloud Pak for Data will not work.

Shared cluster component software requirements

There are no additional software requirements for:
  • Scheduling service
  • Common core services

Service software requirements

Use this table to determine whether the service that you want to install depends on other software being available:
  • Some services require other software to be installed outside of Cloud Pak for Data (marked as external dependencies)
  • Some services require other Cloud Pak for Data services to be installed as prerequisites or to support specific functionality (marked as service dependencies)
  • Some services require other underlying components, which the service installs if needed (marked as component dependencies)
Service External dependencies Service dependencies Component dependencies
AI Factsheets None

To install this service, you must have one of the following services installed:

  • Watson™ Knowledge Catalog
  • Watson Studio
To implement a complete AI governance solution, you must install the following services:
  • OpenPages®
  • Watson Knowledge Catalog
  • Watson OpenScale
  • Watson Machine Learning
  • Watson Studio
However, if you only want to track AI models from the Platform assets catalog, install the following services:
  • Watson Studio
None
Anaconda Repository for IBM Cloud Pak for Data For details, see the Anaconda installation requirements.
None
None
Analytics Engine powered by Apache Spark None
None
None
Cognos® Analytics To use Cognos Analytics, you must have:
None
The service automatically installs the following dependencies if they are not already installed:
  • Common core services (ccs)
Cognos Dashboards None
None
The service automatically installs the following dependencies if they are not already installed:
  • Common core services (ccs)
  • Redis (opencontent_redis)
Data Privacy None
To install this service, you must have the following services already installed:
  • Analytics Engine powered by Apache Spark
  • Watson Knowledge Catalog
None
Data Refinery None
None
None
Data Replication None
None
The service automatically installs the following dependencies if they are not already installed:
  • Common core services (ccs)
DataStage® None
The following services are not required but provide additional functionality:
  • Watson Pipelines enables you to convert DataStage sequence jobs to pipelines.
The service automatically installs the following dependencies if they are not already installed:
  • Common core services (ccs)
Db2® None
The following services are not required but provide additional functionality:
  • Db2 Data Management Console provides:
    • A graphical user interface for SQL execution
    • A runtime monitoring interface
This service automatically installs the following dependencies if they are not already installed:
  • Db2U (db2u)
Db2 Big SQL To use Db2 Big SQL, you must have remote data storage, such as:
  • A Hadoop cluster (Cloudera Data Platform Version 7.1.7)
  • Object storage
    Db2 Big SQL supports:
    • IBM® Cloud Object Storage
    • Amazon Web Services object storage
    • IBM Storage Scale object storage
    • Red Hat OpenShift Data Foundation
The following services are not required but provide additional functionality:
  • Db2 Data Management Console provides:
    • A graphical user interface for SQL execution
    • A runtime monitoring interface
This service automatically installs the following dependencies if they are not already installed:
  • Db2U (db2u)
Db2 Data Gate To use this service, you must have:
  • IBM z/OS® V2.2 (5650-ZOS) or later.
  • Db2 for z/OS, either:
    • V12 (5650-DB2 or 5770-AF3) with APAR fixes installed and running at Function Level 505 or higher. You find the list of required APAR fixes here: Software dependencies.
    • V13 (5698-DB2 or 5698-DBV)
  • Distributed data facility (DDF) with a secure port, configured for network encryption through AT-TLS. For details, see Configuring network access between Db2 Data Gate and IBM Z®.
To use this service, you must have at least one instance of a supported, integrated database:
  • Db2
  • Db2 Warehouse
The following services are not required but provide additional functionality:
  • Watson Knowledge Catalog enables you to automatically publish metadata about Db2 Data Gate tables to catalogs.
None
Db2 Data Management Console None
None
This service automatically installs the following dependencies if they are not already installed:
  • 4.7.0 - 4.7.1 Redis (opencontent_redis)
  • 4.7.2 or later Redis (ibm_redis_cp)
Db2 Warehouse None
The following services are not required but provide additional functionality:
  • Db2 Data Management Console provides:
    • A graphical user interface for SQL execution
    • A runtime monitoring interface
This service automatically installs the following dependencies if they are not already installed:
  • Db2U (db2u)
Decision Optimization None
To install this service, you must have the following services already installed:
  • Watson Studio
  • Watson Machine Learning
None
EDB Postgres None
None
This service automatically installs the following dependencies if they are not already installed:
  • Cloud Native PostgreSQL (postgresql)
Execution Engine for Apache Hadoop To use this service, you must have an Execution Engine for Apache Hadoop RPM installation on a Hadoop cluster. 
To install this service, you must have the following services already installed:
  • Watson Studio
None
IBM Match 360 with Watson None
The following service is not required but provides additional functionality:
  • Watson Knowledge Catalog enables key IBM Match 360 capabilities such as profiling, automapping, data quality workflows, and data governance.

This service automatically installs the following dependencies if they are not already installed:
  • Common core services (ccs)
  • OpenSearch (opencontent_elasticsearch)
  • FoundationDB (opencontent_fdb)
  • RabbitMQ (opencontent_rabbitmq)
  • Redis (opencontent_redis)
Informix® None
None
This service automatically installs the following dependencies if they are not already installed:
  • informix
MANTA Automated Data Lineage None
To install this service, you must have the following services already installed:
  • Watson Knowledge Catalog
None
OpenPages If you prefer to connect to an external database rather than having OpenPages automatically provision a Db2 database for you, you must have IBM Db2 on Linux.
The following services are not required but provide additional functionality:
  • AI Factsheets enables you to review machine learning models and related activities as part of enterprise risk and compliance monitoring.
  • Cognos Analytics
  • Watson Assistant
  • Watson Discovery
  • Watson Knowledge Catalog
  • Watson OpenScale
This service automatically installs the following dependencies if they are not already installed:
  • Db2U (db2u)
  • Db2 as a service (db2aaservice)
  • RabbitMQ (opencontent_rabbitmq)
Planning Analytics If you want to use Microsoft Excel, you must have the Planning Analytics for Microsoft Excel plug-in.
None
None
Product Master If you want to connect to a database outside of Cloud Pak for Data, it must be an IBM Db2 or Oracle database.
If you want to connect to an integrated database, you must have the following service already installed:
  • Db2
None
RStudio® Server Runtimes None
To install this service, you must have the following service already installed:
  • Watson Studio
None
SPSS® Modeler None
To install this service, you must have the following services already installed:
  • Watson Studio
None
Voice Gateway None
To install this service, you must have the following service already installed:
  • Watson Assistant
  • Watson Speech to Text
  • Watson Text to Speech
None
Watson Assistant To install this service, you must have Multicloud Object Gateway. For more information, see Installing and setting up Multicloud Object Gateway
The following services are not required but provide additional functionality:
  • Watson Discovery enables you to add a search skill to your assistant.
This service automatically installs the following dependencies if they are not already installed:
  • Cloud Native PostgreSQL (postgresql)
  • etcd (opencontent_etcd)
  • Elasticsearch (opencontent_elasticsearch)
  • MinIO (opencontent_minio)
  • RabbitMQ (opencontent_rabbitmq)
  • Redis
    • 4.7.0 4.7.1 opencontent_redis
    • 4.7.2 or later ibm_redis_cp
  • Watson data governor (data_governor)
  • Watson Gateway (watson_gateway)
  • Watson model trainer (model_train)
Watson Discovery To install this service, you must have Multicloud Object Gateway. For more information, see Installing and setting up Multicloud Object Gateway
None
This service automatically installs the following dependencies if they are not already installed:
  • Cloud Native PostgreSQL (postgresql)
  • etcd (opencontent_etcd)
  • Elasticsearch (opencontent_elasticsearch)
  • MinIO (opencontent_minio)
  • RabbitMQ (opencontent_rabbitmq)
  • Watson Gateway (watson_gateway)
  • Watson model trainer (model_train)
Watson Knowledge Catalog None
The service automatically installs the following services if they are not already installed:
  • Analytics Engine powered by Apache Spark
  • Data Refinery
If you choose to install the data quality feature, the service automatically installs the following services if they are not already installed:
  • DataStage Enterprise
The following services are not required but provide additional functionality:
  • Data Privacy
  • MANTA Automated Data Lineage
The service automatically installs the following dependencies if they are not already installed:
  • Common core services
  • Db2U (db2u)
  • Db2 as a service (db2aaservice)
If you choose to install the semantic search and data lineage feature, the service automatically installs the following dependencies if they are not already installed:
  • FoundationDB (opencontent_fdb)
Watson Knowledge Studio To install this service, you must have Multicloud Object Gateway. For more information, see Installing and setting up Multicloud Object Gateway
None
This service automatically installs the following dependencies if they are not already installed:
  • etcd (opencontent_etcd)
  • MinIO (opencontent_minio)
  • Cloud Native PostgreSQL (postgresql)
  • Watson Gateway (watson_gateway)
Watson Machine Learning None
The following services are not required but provide additional functionality:
  • Watson Machine Learning Accelerator provides the Experiment Builder user interface.
The service automatically installs the following dependencies if they are not already installed:
  • Common core services (ccs)
Watson Machine Learning Accelerator To install this service, you must install the NVIDIA GPU Operator:
x86-64
  • On OpenShift 4.10, use v23.3.2, v22.9.2, v22.9.1, v22.9.0, 1.11, 1.10
  • On OpenShift 4.12, use v23.3.2 and v22.9.2
To install this service, you must have the following service already installed:
  • Scheduling service
The following services are not required but provide additional functionality:
  • Watson Machine Learning enables you to use Deep Learning Experiments
None
Watson OpenScale If you prefer to connect to an external database, you must have Db2 Enterprise Server Edition 11.5 or later.
If you want to connect to an integrated database, you must have at least one instance of a supported, integrated database:
  • Db2
  • Db2 Warehouse

The following services are not required but provide additional functionality. To use the functionality, you must install the services before you install Watson OpenScale:

  • Watson Studio enables you to:
    • Create AutoAI models and Jupyter Notebooks
    • Set up a demo environment where you can quickly tour the Watson OpenScale capabilities
  • Watson Machine Learning enables you to:
    • Create a deployed model that Watson OpenScale can check for bias and drift
    • Automatically log payloads
None
Watson Pipelines None
The following services are not required but provide additional nodes when they are installed:
  • DataStage
  • Watson Machine Learning
The service automatically installs the following dependencies if they are not already installed:
  • Common core services (ccs)
Watson Query None
The service automatically installs the following services if they are not already installed:
  • Db2 Data Management Console
The service automatically installs the following dependencies if they are not already installed:
  • Common core services (ccs)
  • Db2U (db2u)
Watson Speech services To install this service, you must have Multicloud Object Gateway. For more information, see Installing and setting up Multicloud Object Gateway
None
This service automatically installs the following dependencies if they are not already installed:
  • Cloud Native PostgreSQL (postgresql)
  • MinIO (opencontent_minio)
  • RabbitMQ (opencontent_rabbitmq)
  • Watson Gateway (watson_gateway)
Watson Studio None
The service automatically installs the following services if they are not already installed:
  • Data Refinery
  • Watson Studio Runtimes
This service automatically installs the following dependencies if they are not already installed:
  • Common core services (ccs)
Watson Studio Runtimes To use the following environments, you must have the NVIDIA GPU Operator Version 1.6.2 installed:
  • Runtime 22.1 on Python 3.9 for GPU
  • Runtime 22.2 on Python 3.10 for GPU
  • Runtime 23.1 on Python 3.10 for GPU
To install this service, you must have the following service already installed:
  • Watson Studio
None
watsonx.data None None None

Supported web browsers

You can use the following web browsers to access the Cloud Pak for Data web client.

  • Mozilla Firefox (recommended)
  • Google Chrome
  • Microsoft Edge

It is recommended that you use the latest available version or the latest version approved by your enterprise.