Software requirements

Important: IBM® Cloud Pak for Data Version 4.5 will reach end of support (EOS) on 31 July, 2025. For more information, see the Discontinuance of service announcement for IBM Cloud Pak for Data Version 4.X.

Upgrade to IBM Software Hub Version 5.1 before IBM Cloud Pak for Data Version 4.5 reaches end of support. For more information, see Upgrading IBM Software Hub in the IBM Software Hub Version 5.1 documentation.

Before you install IBM Cloud Pak for Data, review the software requirements for the control plane, the shared cluster components, and the services that you plan to install, and review the supported web browsers.

Cloud Pak for Data platform software requirements
Shared cluster component software requirements
Service software requirements
Supported web browsers

Cloud Pak for Data platform software requirements

You must have the following software to install Cloud Pak for Data:

Red Hat® OpenShift® Container Platform cluster

For entitlement information see Licenses and entitlements.

The following versions of Red Hat OpenShift Container Platform are supported. (Cloud Pak for Data supports the same operating system requirements as Red Hat OpenShift Container Platform.)

Version	Learn more	Cluster sizing guidance	Notes
Version 4.6.29 or later fixes	For details, see the Red Hat OpenShift Container Platform documentation: Installation overview Minimum hardware requirements Control plane node sizing	Refer to the Cloud Pak for Data platform hardware requirements as you configure your cluster.	The following services are not supported on OpenShift Version 4.6: MongoDB Watson Machine Learning Watson Machine Learning Accelerator
Version 4.8.0 or later fixes	For details, see the Red Hat OpenShift Container Platform documentation: Installation overview Minimum hardware requirements Control plane node sizing	Refer to the Cloud Pak for Data platform hardware requirements as you configure your cluster.
Version 4.10.0 or later fixes	For details, see the Red Hat OpenShift Container Platform documentation: Installation overview Minimum hardware requirements Control plane node sizing	Refer to the Cloud Pak for Data platform hardware requirements as you configure your cluster.

Container runtime

Your Red Hat OpenShift Container Platform cluster must include a container runtime.

Software	Notes
CRI-O Version 1.13 or later fixes	You might need to adjust the CRI-O container settings. For details, see Changing required node settings.

Kubernetes Metrics Server

If you do not enable the default monitoring stack on your Red Hat OpenShift Container Platform cluster and you want to gather use metrics for your pods and nodes, install Kubernetes Metrics Server.

Important: If you do not enable the default monitoring stack or install Kubernetes Metrics Server, the platform monitoring features in Cloud Pak for Data will not work.

IBM Cloud Pak® foundational services

For entitlement information see Licenses and entitlements.

Cloud Pak for Data release	Minimum required release of IBM Cloud Pak foundational services
4.5.0	Version 3.19.0 or later fixes

Important: The Cloud Pak for Data command-line interface (cpd-cli) can automatically install and upgrade IBM Cloud Pak foundational services. You do not need to install IBM Cloud Pak foundational services separately.

Shared cluster component software requirements

There are no additional software requirements for:

Scheduling service
Common core services

Service software requirements

Use this table to determine whether the service that you want to install depends on other software being available:

Some services require other software to be installed outside of Cloud Pak for Data (marked as external dependencies)
Some services require other Cloud Pak for Data services to be installed as prerequisites or to support specific functionality (marked as service dependencies)
Some services require other underlying components, which the service installs if needed (marked as component dependencies)

Service	External dependencies	Service dependencies	Component dependencies
Anaconda Repository for IBM Cloud Pak for Data	For details, see the Anaconda installation requirements.	None	None
Analytics Engine Powered by Apache Spark	None	None	None
Cognos® Analytics	To use Cognos Analytics, you must have: A content store for configuration data, global settings, data server connections, and product-specific content. The content store can be an integrated Db2 database or an external relational database. For details, see Configuring the content store for Cognos Analytics. A database for audit records. You can optionally store the audit records in the content store. For details, see Provisioning the Cognos Analytics instance. An SMTP server, if you want to use the email notification feature. For details, see Provisioning the Cognos Analytics instance.	None	The service automatically installs the following dependencies if they are not already installed: Common core services (`ccs`)
Cognos Dashboards	None	None	The service automatically installs the following dependencies if they are not already installed: Common core services (`ccs`)
Data Privacy	None	To install this service, you must have the following services already installed: Analytics Engine Powered by Apache Spark Watson Knowledge Catalog	None
Data Refinery	None	None	None
Data Virtualization	None	The service automatically installs the following services if they are not already installed: Db2® Data Management Console	The service automatically installs the following dependencies if they are not already installed: Common core services (`ccs`) Db2U (`db2u`)
DataStage®	None	The following services are not required but provide additional functionality: Tech previewWatson Studio Pipelines enables you to convert DataStage sequence jobs to pipelines.	The service automatically installs the following dependencies if they are not already installed: Common core services (`ccs`)
Db2	None	The following services are not required but provide additional functionality: Db2 Data Management Console provides: A graphical user interface for SQL execution A runtime monitoring interface	This service automatically installs the following dependencies if they are not already installed: Db2U (`db2u`)
Db2 Big SQL	To use Db2 Big SQL, you must have remote data storage, such as: A Hadoop cluster (Cloudera Data Platform Version 7.1.7) Object storage Db2 Big SQL supports: IBM Cloud Object Storage Amazon Web Services object storage IBM Spectrum® Scale object storage Red Hat OpenShift Data Foundation	The following services are not required but provide additional functionality: Db2 Data Management Console provides: A graphical user interface for SQL execution A runtime monitoring interface	This service automatically installs the following dependencies if they are not already installed: Db2U (`db2u`)
Db2 Data Gate	To use this service, you must have: IBM z/OS® V2.2 (5650-ZOS) or later. Db2 for z/OS, either: V12 (5650-DB2 or 5770-AF3) with APAR fixes installed and running at Function Level 505 or higher. You find the list of required APAR fixes here: Software dependencies V13 (5698-DB2 or 5698-DBV) Distributed data facility (DDF) with a secure port, configured for network encryption through AT-TLS. For details, see Configuring network access between Db2 Data Gate and IBM Z®.	To use this service, you must have at least one instance of a supported, integrated database: Db2 Db2 Warehouse The following services are not required but provide additional functionality: Watson Knowledge Catalog enables you to automatically publish metadata about Db2 Data Gate tables to catalogs.	None
Db2 Data Management Console	None	None	This service automatically installs the following dependencies if they are not already installed: Redis (`opencontent_redis)`
Db2 Warehouse	None	The following services are not required but provide additional functionality: Db2 Data Management Console provides: A graphical user interface for SQL execution A runtime monitoring interface	This service automatically installs the following dependencies if they are not already installed: Db2U (`db2u`)
Decision Optimization	None	To install this service, you must have the following services already installed: Watson Studio Watson Machine Learning	None
EDB Postgres	None	None	This service automatically installs the following dependencies if they are not already installed: Cloud Native PostgreSQL (`postgresql`)
Execution Engine for Apache Hadoop	To use this service, you must have an Execution Engine for Apache Hadoop RPM installation on a Hadoop or IBM Spectrum Conductor cluster.	To install this service, you must have the following services already installed: Watson Studio	None
Guardium® External S-TAP	To use this service, you must have an existing IBM Security Guardium collector. The following versions of IBM Security Guardium are supported: Version 11.2 Version 11.3 Version 11.4	None	None
Informix®	None	None	This service automatically installs the following dependencies if they are not already installed: `informix`
IBM Match 360 with Watson	None	The following services are not required but provide additional functionality: Watson Knowledge Catalog provides the ability to profile and automap data assets.	This service automatically installs the following dependencies if they are not already installed: Common core services (`ccs`) Elasticsearch (`opencontent_elasticsearch`) FoundationDB (`opencontent_fdb`) RabbitMQ (`opencontent_rabbitmq)` Redis (`opencontent_redis)`
MongoDB	None	None	You must install the following dependencies when you install the service: `mongodb`
OpenPages®	If you prefer to connect to an external database rather than having OpenPages automatically provision a Db2 database for you, you must have IBM Db2 on Linux.	The following services are not required but provide additional functionality: Cognos Analytics Watson Assistant Watson Discovery Watson Knowledge Catalog Integration with AI Factsheets supports reviewing machine learning models and related activities as part of enterprise risk and compliance monitoring. Watson OpenScale	This service automatically installs the following dependencies if they are not already installed: Db2U (`db2u`) Db2 as a service (`db2aaservice`) RabbitMQ (`opencontent_rabbitmq)`
Planning Analytics	If you want to use Microsoft Excel, you must have the Planning Analytics for Microsoft Excel plug-in.	None	None
Product Master	If you want to connect to a database outside of Cloud Pak for Data, it must be an IBM Db2 or Oracle database.	If you want to connect to an integrated database, you must have the following service already installed: Db2	None
RStudio® Server with R 3.6	None	To install this service, you must have the following service already installed: Watson Studio	None
SPSS® Modeler	None	To install this service, you must have the following services already installed: Watson Studio	None
Voice Gateway	None	To install this service, you must have the following service already installed: Watson Assistant Watson Speech to Text Watson Text to Speech	None
Watson Assistant	None	The following services are not required but provide additional functionality: Watson Discovery enables you to add a search skill to your assistant.	This service automatically installs the following dependencies if they are not already installed: Cloud Native PostgreSQL (`postgresql`) etcd (`opencontent_etcd`) Elasticsearch (`opencontent_elasticsearch`) MinIO (`opencontent_minio`) RabbitMQ (`opencontent_rabbitmq)` Redis (`opencontent_redis)` Watson audit webhook (`opencontent_auditwebhook`) Watson data governor (`data_governor`) Watson Gateway (`watson_gateway`) Watson model trainer (`model_train`)
Watson Discovery	None	IBM Cloud Pak foundational services	This service automatically installs the following dependency if it is not already installed: Cloud Native PostgreSQL (`postgresql`) etcd (`opencontent_etcd`) Elasticsearch (`opencontent_elasticsearch`) MinIO (`opencontent_minio`) RabbitMQ (`opencontent_rabbitmq)` Watson Gateway (`watson_gateway`)
Watson Knowledge Catalog	None	The service automatically installs the following services if they are not already installed: Analytics Engine Powered by Apache Spark Data Refinery If you choose to install the data quality feature, the service automatically installs the following services if they are not already installed: DataStage Enterprise The following services are not required but provide additional functionality: Data Privacy MANTA Automated Data Lineage	The service automatically installs the following dependencies if they are not already installed: Common core services Db2U (`db2u`) Db2 as a service (`db2aaservice`) If you choose to install the semantic search and data lineage feature, the service automatically installs the following dependencies if they are not already installed: FoundationDB (`opencontent_fdb`)
Watson Knowledge Studio	None	None	This service automatically installs the following dependencies if they are not already installed: Cloud Native PostgreSQL (`postgresql`)
Watson Machine Learning	None	The following services are not required but provide additional functionality: Watson Machine Learning Accelerator provides the Experiment Builder user interface.	The service automatically installs the following dependencies if they are not already installed: Common core services (`ccs`)
Watson Machine Learning Accelerator	To install this service, you must install the NVIDIA GPU Operator, either: x86-64 NVIDIA GPU Operator 1.10 on OpenShift 4.10 NVIDIA GPU Operator 1.7.1 on OpenShift 4.8 POWER Rocket Software GPU Operator 1.10 on OpenShift 4.10 (POWER9)	To install this service, you must have the following service already installed: Scheduling service The following services are not required but provide additional functionality: Watson Machine Learning enables you to use Deep Learning Experiments	None
Watson OpenScale	If you prefer to connect to an external database, you must have Db2 Enterprise Server Edition 11.5 or later.	If you want to connect to an integrated database, you must have at least one instance of a supported, integrated database: Db2 Db2 Warehouse The following services are not required but provide additional functionality. To use the functionality, you must install the services before you install Watson OpenScale: Watson Studio enables you to: Create AutoAI models and Jupyter Notebooks Set up a demo environment where you can quickly tour the Watson OpenScale capabilities Watson Machine Learning enables you to: Create a deployed model that Watson OpenScale can check for bias and drift Automatically log payloads	None
Watson Speech services	None	None	This service automatically installs the following dependencies if they are not already installed: Cloud Native PostgreSQL (`postgresql`) MinIO (`opencontent_minio`) RabbitMQ (`opencontent_rabbitmq)` Watson audit webhook (`opencontent_auditwebhook`) Watson Gateway (`watson_gateway`)
Watson Studio	None	The service automatically installs the following services if they are not already installed: Data Refinery Watson Studio Runtimes	This service automatically installs the following dependencies if they are not already installed: Common core services (`ccs`)
Watson Studio Runtimes	To use the following environments, you must have an NVIDIA GPU Operator 1.6.2 installed: Jupyter Notebooks with Python 3.9 for GPU	To install this service, you must have the following service already installed: Watson Studio	None

Supported web browsers

You can use the following web browsers to access the Cloud Pak for Data web client:

Mozilla Firefox (recommended)
Google Chrome
Microsoft Edge

It is recommended that you use the latest available versions or the latest version approved by your enterprise.