Table of contents

System requirements for services

Ready to install a service on IBM® Cloud Pak for Data? Before you get started, use the information in this document to determine whether:
  • You have sufficient resources to install the service
  • You have the required software dependencies for the service

Service support information

The information is provided in the following sections:

Note: The information in this topic is subject to change without notice. It is recommended that you refer to the latest version of this topic on the web when planning your installation.

The minimum resource recommendations are for guidance only. Work with your IBM Sales representative to generate recommendations based on your needs.

While IBM strives to ensure that this topic is accurate, there might be mistakes in this information. Information in this topic is not legally binding.

Version support

Use this table to determine which version (or versions) of a service are supported on the version of Cloud Pak for Data that you are running.

Service Cloud Pak for Data Version 3.5.0
Anaconda Repository with IBM * 1.0.1
Analytics Engine Powered by Apache Spark 3.5.0
Analytics Zoo for Apache Spark This information is not currently available. Contact the provider for details.
CockroachDB This information is not currently available. Contact the provider for details.
Cognos® Analytics 3.5.1
Cognos Dashboards 3.5.0
Data Refinery 3.5.1
Data Virtualization 1.5.0
Datameer This information is not currently available. Contact the provider for details.
DataStage® 3.5.1
Db2® 3.5.0
Db2 Big SQL 7.1.1
Db2 Data Gate 3.5.0
Db2 Data Management Console 3.1.3
Db2 Event Store 2.0.1
Db2 for z/OS® Connector 3.2.2
Db2 Warehouse 3.5.0
Decision Optimization 3.5.0
EDB Postgres 2.0.0
Edge Analytics * 1.1.0 beta
Execution Engine for Apache Hadoop 3.5.0
Figure Eight This information is not currently available. Contact the provider for details.
Financial Crimes Insight® This service has not been released on 3.5.0
Financial Services Workbench 2.5
Guardium® External S-TAP® * 11.2.0
Intel Deep Learning Reference Stack - PyTorch This information is not currently available. Contact the provider for details.
Intel Deep Learning Reference Stack - TensorFlow This information is not currently available. Contact the provider for details.
Intel Distribution of Python This information is not currently available. Contact the provider for details.
Jupyter Notebooks with Python 3.7 for GPU 3.5.0
Jupyter Notebooks with R 3.6 3.5.0
Lightbend Platform This information is not currently available. Contact the provider for details.
Master Data Connect * 1.0.0
MongoDB 3.5.0
NetApp Trident (Previously NetApp ONTAP) This information is not currently available. Contact the provider for details.
Open Data for Industries * 1.0.0
OpenPages® * 8.2.0
Operational Analytics for ERP This information is not currently available. Contact the provider for details.
Planning Analytics 3.5.0
Portworx This information is not currently available. Contact the provider for details.
Prolifics Customer Prospecting Accelerator This information is not currently available. Contact the provider for details.
RStudio® Server with R 3.6 3.5.0
Senzing This information is not currently available. Contact the provider for details.
SPSS® Modeler 3.5.0
Streams 5.5.0
Streams Flows * 3.5.0
Virtual Data Pipeline 8.1
WAND Foundation Taxonomies This information is not currently available. Contact the provider for details.
Watson™ Assistant 1.5.0
Watson Assistant for Voice Interaction * 1.0.7
Watson Discovery 2.2.0
Watson Knowledge Catalog 3.5.1
Watson Knowledge Studio 1.1.2
Watson Language Translator 1.2.0
Watson Machine Learning 3.5.0
Watson Machine Learning Accelerator 2.2
Watson OpenScale 3.5.0
Watson Speech to Text 1.2.0
Watson Studio 3.5.0
Watson Text to Speech 1.2.0

* This service is not listed in the catalog in the web client when you install the Cloud Pak for Data control plane. For information on installing this service, see Services outside the catalog.

Hardware requirements

Use this table to determine whether you have the minimum required resources to install the service on Cloud Pak for Data.

Important: The information in this table represents the minimum resources you need to successfully install the service. You might need additional resources to support your specific workload. Work with your IBM Sales representative to generate more accurate calculations based on your expected workload.
Keep the following information in mind as you review the hardware requirements:
  • Unless explicitly stated, the service can be scheduled across existing worker nodes
  • If a service requires dedicated nodes, the documentation for the service describes how to prevent other services from running on those nodes.

Review Storage considerations to determine whether the storage that you plan to use is supported on your cluster architecture.

Service x86-64 POWER® Z vCPU Memory Storage requirements Notes
Anaconda Repository with IBM     4 vCPU 8 GB RAM 500 GB This service cannot be installed on your Red Hat® OpenShift® cluster. For details, see the Anaconda installation requirements.
Analytics Engine Powered by Apache Spark   4 vCPU 16 GB RAM 50 GB
Supported storage types:
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs

    See the storage requirements for the common core services

  • Portworx
    Required storage class:
    • portworx-shared-gp3

    See the storage requirements for the common core services

  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

Minimum resources for an installation with a single replica per service.

These resources are sufficient to install the microservices that enable you to create Spark runtimes. However, the resources required for the actual runtimes vary based on your workload.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Spark runtimes

Spark jobs use emptyDir volumes for temporary storage and for shuffling. If your Spark jobs use a lot of disk space for temporary storage or shuffling, ensure that you have sufficient space on the local disk where emptyDIR volumes are created:

If you don't have sufficient space on the local disk, Spark jobs might run slowly and some of the executors might evict jobs.

In general, it is recommended that you have a minimum of 50 GB of temporary storage for each vCPU request.

Analytics Zoo for Apache Spark     4 vCPU 20 GB RAM

This information is not currently available.

For more information, see the deployment documentation.
CockroachDB     2 vCPU 2 GB RAM 250 GB
Supported storage types:
  • ext4 Linux® file system
Development deployment
1 node
Production deployment
3 or more nodes
Recommended configuration
Refer to the official CockroachDB documentation to determine the appropriate specifications based on your expected workloads.
Cognos Analytics     11 vCPU 29 GB RAM 43 GB
Supported storage types:
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs

    See the storage requirements for the common core services

  • Portworx
    Required storage class:
    • portworx-shared-gp3

    See the storage requirements for the common core services

  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

When you provision the Cognos Analytics service, you specify the size of the instance.

The information here is for the smallest instance. For other sizes, see Provisioning the Cognos Analytics service.

Cognos Dashboards   3 vCPU 6.8 GB RAM 32 GB
Supported storage types:
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs

    See the storage requirements for the common core services

  • Portworx
    Required storage class:
    • portworx-shared-gp3

    See the storage requirements for the common core services

  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

Minimum resources for an installation with a single replica per service.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Data Refinery   1 vCPU 0.5 GB RAM No additional storage needed beyond what is required for the common core services This service is installed when you install Watson Knowledge Catalog or Watson Studio

POWER support is available only for Watson Studio. To use Data Refinery on POWER, you must install Watson Studio on POWER.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Data Virtualization     14.2 vCPU
  • 4 vCPU for the engine pod
  • 4 vCPU for one worker pod
  • 6.2 vCPU for auxiliary services
41 GB RAM
  • 16 GB RAM for the engine pod
  • 16 GB RAM for one worker pod
  • 9 GB RAM for auxiliary services
175 GB

Combined storage requirements for caching, the engine pod, and one worker pod.

Supported storage types:
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs

    See the storage requirements for the common core services

  • Portworx
    Required storage class:
    • portworx-dv-shared-gp3

    See the storage requirements for the common core services

  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid
Minimum resources for an installation that can support up to 1000 simple queries per hour, 12 moderately complex queries per hour, or 2 complex queries per hour.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

You can optionally configure node affinity. For details see Setting up node affinity.

Datameer    

Not available

Not available

Not available

Contact Datameer for assistance.
DataStage     6 vCPU 24 GB RAM 300 GB

Supported storage types:

  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs (default storage class for installation)
    • ocs-storagecluster-ceph-rdb
  • Portworx
    Required storage classes:
    • portworx-shared-gp3 (default storage class for installation)
    • portworx-solr-sc
    • portworx-kafka-sc
    • portworx-cassandra-sc
    • portworx-db2-rwo-sc
  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

Minimum resources for an installation with a single replica per service.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Db2 1.5 vCPU 5 GB RAM 200 GB
Supported storage types:
  • hostPath
  • IBM Spectrum® Scale
  • IBM Spectrum Scale CSI
    Required storage class:
    • ibm-spectrum-scale-csi
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs (default storage class for installation)
    • ocs-storagecluster-ceph-rdb
  • Portworx
    Required storage classes:
    • portworx-shared-gp3 (default storage class for installation)
    • portworx-db2-rwx-sc
    • portworx-db2-rwo-sc
  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Dedicated nodes are recommended for production deployments of Db2. For details, see Setting up dedicated nodes.

Db2 Big SQL     10 vCPU:
  • 4 vCPU for the head node
  • 4 vCPU for the worker node
  • 1 for auxiliary services
  • 1 for optional object store access
67 GB RAM
  • 32 GB RAM for the head node
  • 32 GB RAM for the worker node
  • 1 GB RAM for auxiliary services
  • 2 GB RAM for optional object store access
100 GB

Used for metadata, such as the Db2 Big SQL catalog.

Supported storage types:
  • NFS (recommended)
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs
  • Portworx
    Required storage class:
    • portworx-dv-shared-gp3
  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

Minimum resources for an installation that can support up to 20 simple queries per hour, 5 moderately complex queries per hour, and 1 complex query per hour.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Db2 Data Gate   5 vCPU 13 GB RAM 50 GB

Supported storage types:

  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs
  • Portworx
    Required storage class:
    • portworx-db2-rwx-sc
  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

Minimum resources for an installation with a single replica per service.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Db2 Data Management Console 5 vCPU 10 GB RAM 2 GB
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs
  • Portworx
    Required storage class:
    • portworx-shared-gp3
  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

Minimum resources for an installation with a single replica per service.

For information on sizing the provisioned instance, see Provisioning the service.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Db2 Event Store     8 vCPU per node 64 GB RAM per node
  • 250 GB of local storage on a SSD
  • 50 GB of shared storage on one of the supported storage types
System shared storage
Supported storage types:
  • hostPath (a mounted directory on a cluster file system, such as IBM Spectrum Scale)
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs
Data shared storage
Supported storage types:
  • hostPath (a mounted directory on a cluster file system, such as IBM Spectrum Scale)
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs
  • IBM Cloud Object Storage

Each Db2 Event Store database must have its own set of dedicated nodes.

Development deployment
3 nodes
Production deployment
3 nodes
Recommended configuration
Refer to the Db2 Event Store machine sizing guide to determine the appropriate specifications based on your expected workloads.
Db2 for z/OS Connector     2.3 vCPU 5 GB RAM 5 GB for temporary storage.
Supported storage types:
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs
  • Portworx

    Required storage class:

    • portworx-shared-gp

Minimum resources for an installation with a single replica per service.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Db2 Warehouse
SMP
7 vCPU
MPP
40.85 vCPU
SMP
98 GB RAM
MPP
614 GB RAM
200 GB
SMP
Supported storage types:
  • hostPath
  • IBM Spectrum Scale
  • IBM Spectrum Scale CSI
    Required storage class:
    • ibm-spectrum-scale-csi
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs (default storage class for installation)
    • ocs-storagecluster-ceph-rdb
  • Portworx
    Required storage classes:
    • portworx-shared-gp3 (default storage class for installation)
    • portworx-db2-rwx-sc
    • portworx-db2-rwo-sc
  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid
MPP
Supported storage types:
  • hostPath

    The hostPath must be exposed through a cluster file system.

  • IBM Spectrum Scale
  • IBM Spectrum Scale CSI
    Required storage class:
    • ibm-spectrum-scale-csi
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs (default storage class for installation)
    • ocs-storagecluster-ceph-rdb
  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid
Use dedicated nodes for:
  • Production SMP deployments
  • MPP deployments
Development deployment
  • 1 node for SMP
  • 2 nodes for MPP
Production deployment
  • 1 node for SMP
  • 2-999 nodes for MPP
Recommended configuration

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Decision Optimization   1 vCPU 1.5 GB RAM 12 GB
Supported storage types:
  • NFS
  • OpenShift Container Storage

    Required storage class:

    • ocs-storagecluster-cephfs
  • Portworx
    Required storage class:
    • portworx-shared-gp3
  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

Work with IBM Sales to get a more accurate sizing based on your expected workload.

EDB Postgres     1 vCPU 2 GB 100 GB
Supported storage types:
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-ceph-rdb
  • Portworx
    Required storage class:
    • portworx-db-gp2-sc

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Edge Analytics *     0.1 vCPU 256 MB RAM

Not applicable

The minimum recommendations are for demos and proof-of-concept with Cloud Pak for Data.

Execution Engine for Apache Hadoop   0.1 vCPU 64 MB RAM 100 GB
Supported storage types:
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs
  • Portworx
    Required storage classes:
    • portworx-shared-gp3
  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

The requirements are for an edge node on the Apache Hadoop cluster. These resources do not need to be available in Cloud Pak for Data cluster.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Figure Eight    

Not available

Not available

Not available

Contact Figure Eight for assistance.
Financial Services Workbench     16 vCPU 64 GB RAM 500 GB
Supported storage types:
  • NFS

3 or more nodes.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Guardium External S-TAP *     4 vCPU 1 GB RAM 1.5 GB
Supported storage types:
  • NFS
  • OpenShift Container Storage

    Required storage class:

    • ocs-storagecluster-ceph-rdb

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Intel Deep Learning Reference Stack - PyTorch    

Not available

Not available

Not available

Requires Intel AVX-512.

For more information, see the deployment instructions.

Intel Deep Learning Reference Stack - TensorFlow    

Not available

Not available

 

Requires Intel AVX-512.

For more information, see the deployment instructions.

Intel Distribution of Python    

Not available

Not available

2 GB

Requires Intel Streaming SIMD Extensions.

For more information, see the deployment instructions. Contact Intel for assistance.

Jupyter Notebooks with Python 3.7 for GPU    

Not applicable

Not applicable

No additional storage is required.

Supported storage types:
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs

    See the storage requirements for the common core services

  • Portworx
    Required storage classes:
    • portworx-shared-gp3

    See the storage requirements for the common core services

  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid
At least 1 GPU core is required to use this service.

The sizing for this service is included in the Watson Studio minimum installation footprint.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Jupyter Notebooks with R 3.6  

Not applicable

Not applicable

No additional storage is required.

Supported storage types:
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs

    See the storage requirements for the common core services

  • Portworx
    Required storage classes:
    • portworx-shared-gp3

    See the storage requirements for the common core services

  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

The sizing for this service is included in the Watson Studio minimum installation footprint.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Lightbend Platform     5 vCPU

Not available

Not available

To get a more accurate sizing based on your workload, see Calculating Pipelines Application resource usage.

Contact Lightbend for assistance.

Master Data Connect *     24 vCPU 48 GB RAM 100 GB

The amount of storage that you need depends on the amount of master data that you load.

Supported storage types:
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs
  • Portworx

    Required storage class:

    • portworx-shared-gp3

Minimum resources for an installation that can support up to 300 TPS read requests and 1 Million records.

Minimum resources for an installation with a single replica per service.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

MongoDB     4 vCPU 8 GB RAM 100 GB

Size the storage based on the amount of data that you plan to store.

Supported storage types:
  • Local storage
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-ceph-rdb
  • Portworx
    Required storage class:
    • portworx-db-gp2-sc
  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid
Requires dedicated nodes. For details, see Setting up dedicated nodes.
Development deployment
3 nodes
Production deployment
3 nodes
Recommended configuration
Refer to the Ops Manager System Requirements to determine the appropriate specifications based on your expected workloads.
NetApp Trident     < 1 vCPU 150 MB RAM

Not available

This service has a very small installation footprint and does not consume an appreciable amount of cluster resources.

Contact NetApp for assistance.

Open Data for Industries *     48 vCPU 96 GB RAM 4 TB
Supported storage types:
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-ceph-rdb
  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

Minimum resources for an installation with a single replica per service.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Minimum recommended configuration
  • 3 master nodes with 8 vCPU and 16 GB RAM each
  • 3 worker nodes with 16 vCPU, 36 GB RAM, and 1 TB memory each
OpenPages *     5.43 vCPU 14.2 GB RAM 250 GB shared storage

Supported storage types:

  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs (default storage class for installation)
    • ocs-storagecluster-ceph-rdb

Use the same storage class for your installation and your instance.

These values represent the minimum resources for OpenPages with Db2.

When you provision the OpenPages service, you specify the size of the instance and the storage class to use.

Db2

OpenPages uses Db2 as a service, which is different from the Db2 service in the services catalog.

You can optionally provision the Db2 database on dedicated nodes. For details, see Provisioning an instance of OpenPages.

Setting quotas
If you set a quota for the service, use the following guidelines:
vCPU quota
1.43 vCPU + the vCPU required for the instance.
Memory quota
2.2 GB + the memory required for the instance

For example, if you choose Extra Small (4 vCPU), set the vCPU quota to 5.43 and the memory quota to 14.2.

Operational Analytics for ERP    

Not available

Not available

Not available

Contact LIS.TEC GmbH for assistance.
Planning Analytics     12 vCPU 49 GB RAM 20 GB
Supported storage types:
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs
  • Portworx

    Required storage class:

    • portworx-shared-gp3

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Select the size of your instance when you provision Planning Analytics. For details, see Provisioning the Planning Analytics service.

Portworx     4 vCPU 4 GB RAM 2 GB For more information, see the Portworx prerequisites.
Prolifics Customer Prospecting Accelerator    

Not applicable

Not applicable

1 GB

This service includes an SPSS Modeler flow. Resources are consumed by the SPSS Modeler service.

A minimal amount of storage is needed for the flow and data sets.

RStudio Server with R 3.6  

Not applicable

Not applicable

No additional storage is required.

Supported storage types:
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs

    See the storage requirements for the common core services

  • Portworx
    Required storage classes:
    • portworx-shared-gp3

    See the storage requirements for the common core services

  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

The sizing for this service is included in the Watson Studio minimum installation footprint.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Senzing     4 vCPU 16 GB RAM 200 Gi For more information, see the Senzing prerequisites.
SPSS Modeler   0.65 vCPU 11 GB

No additional storage is required.

Supported storage types:
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs

    See the storage requirements for the common core services

  • Portworx
    Required storage classes:
    • portworx-shared-gp3

    See the storage requirements for the common core services

  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

Minimum resources for an installation with a single replica per service.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Streams     1.1 vCPU 13 GB RAM 30.1 GB
Supported storage types:
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs

    See the storage requirements for the common core services

  • Portworx

    Required storage class:

  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid
Resource requests are based on:
  • One instance of the service
  • One application resource
  • One builder pool resource

Resource requests increase as additional service instances are provisioned, additional Streams applications are submitted, and additional builder pools are added.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Streams Flows *     0.3 vCPU 384 MB RAM

No additional storage is required.

Supported storage types:
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs

    See the storage requirements for the common core services

  • Portworx
    Required storage classes:
    • portworx-shared-gp3

    See the storage requirements for the common core services

  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

Minimum resources for an installation with a single replica per service.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Virtual Data Pipeline    

Not available

Not available

Not available

Not available

WAND Foundation Taxonomies    

Not applicable

Not applicable

Not available

This service is a data set that is imported to Cloud Pak for Data. The service does not consume an appreciable amount of cluster resources.

Contact WAND Inc. for assistance.

Watson Assistant     15 vCPU 100 GB RAM 305 GB
Supported storage types:
  • Portworx
    • portworx-watson-assistant-sc

Minimum resources for an installation with a single replica per service.

The components that are required to process some natural languages require additional resources. For details, see Setting up the cluster for Watson Assistant.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

When you are training machine learning models, Watson Assistant requires at least one node to have 4 CPUs that can be dedicated to training. This capacity is needed only when training models. (Training occurs after changes are made to the training data for an assistant.)

Your system must meet the following additional requirements:
  • Nodes must be Intel architecture
  • CPUs must have a clock speed of 2.4 GHz or higher
  • CPUs must support Linux SSE 4.2
  • CPUs must support the AVX2 instruction set

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Watson Assistant for Voice Interaction *     2 vCPU 8 GB RAM 50 GB
Supported storage types:
  • Portworx

    Required storage class:

    • portworx-shared-gp
  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

Minimum resources for a system that can provide voice-only support for up to 11 concurrent calls.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Watson Discovery    
Development
16 vCPU
Production
23 vCPU
Development
96 GB RAM
Production
150 GB RAM

Sizing information is not currently available.

Supported storage types:
  • Portworx
    Required storage class:
    • portworx-db-gp3-sc

Development installations have a single replica per service.

Productions installations have multiple replicas per service.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

CPUs must support the AVX2 instruction set.

Watson Knowledge Catalog     27 vCPU 108 GB RAM
  • 100 GB local storage on each node
  • 600 GB shared storage
Supported storage types:
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs (default storage class for installation)
    • ocs-storagecluster-ceph-rdb

    See the storage requirements for the common core services

  • Portworx
    Required storage classes:
    • portworx-shared-gp3 (default storage class for installation)
    • portworx-cassandra-sc
    • portworx-couchdb-sc
    • portworx-db2-rwo-sc
    • portworx-elastic-sc
    • portworx-metastoredb-sc
    • portworx-gp3-sc
    • portworx-kafka-sc
    • portworx-solr-sc

    See the storage requirements for the common core services

  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid
Local storage
Adjust the amount of local storage per node based on the volume of data you are analyzing with the automated discovery. Local storage should be approximately 2 times larger than the amount of data you expect the system to process concurrently.
Shared storage
The raw size of shared storage depends on the storage class you use. For example, if you use portworx-shared-gp3, which has 3 replicas, multiply the storage by the number of replicas.

If Data Refinery is not installed, add the vCPU and memory required for Data Refinery to the information listed for Watson Knowledge Catalog.

Minimum resources for an installation with a single replica per service.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Watson Knowledge Studio     12 vCPU 120 GB RAM 420 GB
Supported storage types:
  • NFS
  • Portworx
    Required storage classes:
    • portworx-shared-gp3 (default storage class for installation)
    • portworx-db-gp3-sc

Minimum resources for an installation with a single replica per service.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Watson Language Translator    
Development
8 vCPU
Production
16 vCPU
Development
32 GB RAM
Production
92 GB RAM
60 GB
Supported storage types:
  • Portworx

    Required storage classes:

    • OpenShift 3.11: portworx-db-gp3-sc
    • OpenShift 4.5: portworx-db-gp2-sc
  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

Development installations have a single replica per service.

Productions installations have multiple replicas per service.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

CPUs must support the Streaming SIMD Extensions (SSE) 4.2 instruction set.

Watson Machine Learning   6 vCPU 12 GB RAM 150 GB
Supported storage types:
  • NFS
  • OpenShift Container Storage
    Required storage classes:
    • ocs-storagecluster-cephfs

    See the storage requirements for the common core services

  • Portworx
    Required storage classes:
    • portworx-shared-gp3

    See the storage requirements for the common core services

  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid
AVX is recommended but not required for AutoAI experiments.

Minimum resources for an installation with a single replica per service.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Watson Machine Learning Accelerator *    
  • 7 vCPU
  • 1-4 GPU
19 GB RAM 100 GB
Supported storage types:
  • NFS
  • Portworx

    Required storage class:

    • portworx-shared-gp

This service is only supported on Red Hat OpenShift Version 4.5.8 or later.

This service is not supported on Red Hat OpenShift 3.11.

Watson Machine Learning Accelerator does not support FIPS.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Watson OpenScale   14 vCPU 72 GB RAM 100 GB
Supported storage types:
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs
  • Portworx

    Required storage class:

    • portworx-shared-gp3
  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

Minimum resources for an installation with a single replica per service.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Watson Speech to Text     11 vCPU 40 GB RAM 900 GB per worker node
Supported storage types:
  • Portworx
    Required storage class:
    • portworx-sc
  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid
  • vSphere Volumes
  • Amazon Elastic Block Store (EBS)
    Required storage class:
    • gp2

The installation resources listed are for the minimum development footprint. The development footprint provides a single replica per service except for the datastore components.

Development
3 nodes
Production
3 nodes

CPUs must support the AVX2 instruction set

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Watson Studio   1 vCPU 8.8 GB RAM No additional storage needed beyond what is required for the common core services
Supported storage types:
  • NFS
  • OpenShift Container Storage
    Required storage class:
    • ocs-storagecluster-cephfs

    See the storage requirements for the common core services

  • Portworx
    Required storage classes:
    • portworx-shared-gp3

    See the storage requirements for the common core services

  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid

Minimum resources for an installation with a single replica per service.

If Data Refinery is not installed, add the vCPU and memory required for Data Refinery to the information listed for Watson Studio.

Work with IBM Sales to get a more accurate sizing based on your expected workload.

Watson Text to Speech     5 vCPU 8 GB RAM 700 GB
Supported storage types:
  • Portworx
    Required storage class:
    • portworx-sc
  • IBM Cloud File Storage
    Supported storage classes:
    • ibmc-file-gold-gid
    • ibm-file-custom-gold-gid
  • vSphere Volumes
  • Amazon Elastic Block Store (EBS)
    Required storage class:
    • gp2

The installation resources listed are for the minimum development footprint. The development footprint provides a single replica per service except for the datastore components.

Development
3 nodes
Production
3 nodes

CPUs must support the AVX2 instruction set

Work with IBM Sales to get a more accurate sizing based on your expected workload.

* This service is not listed in the catalog in the web client when you install the Cloud Pak for Data control plane. For information on installing this service, see Services outside the catalog.

Software dependencies

Use this table to determine whether the service that you want to install depends on other software being available. For example, some services require other software to be installed outside of Cloud Pak for Data (marked as external dependencies). And some services require other services to be installed on Cloud Pak for Data (marked as service dependencies).

Remember: If a service requires the Cloud Pak for Data common core services, and the common core services are not installed, they will be automatically installed when you install the service. However, if the services are already installed in the Red Hat OpenShift project (namespace) they will not be installed again.
Service External dependencies Service dependencies
Anaconda Repository with IBM For details, see the Anaconda installation requirements.

None

Analytics Engine Powered by Apache Spark

None

Common core services
Analytics Zoo for Apache Spark

This information is not currently available.

This information is not currently available.

CockroachDB

This information is not currently available.

This information is not currently available.

Cognos Analytics Cognos Analytics uses a relational database to store configuration data, global settings, data server connections, and product-specific content. For details, see Configuring the content store for Cognos Analytics. Common core services
Cognos Dashboards

None

Common core services
Data Refinery

None

None

Data Virtualization

None

  • Common core services
  • If you want to govern your virtual data or publish it to catalogs, you must install Watson Knowledge Catalog.
Datameer

This information is not currently available.

This information is not currently available.

DataStage

None

None

Db2

None

None

Db2 Big SQL To use this service you must have remote data storage, such as:
  • A Hadoop cluster
  • Object storage

None

Db2 Data Gate To use this service, you must have:
  • IBM z/OS V2.2 (5650-ZOS) or later.
  • IBM Db2 for z/OS V12 (5650-DB2® or 5770-AF3) with APAR fixes for PH20587, PH27992, PH28849, and PH29443 installed and running at Function Level 500 or higher.
  • Distributed data facility (DDF) with a secure port, configured for network encryption through AT-TLS. For details, see Configuring network access between Db2 Data Gate and Db2 for z/OS.
To use this service, you must have at least one instance of a supported, integrated database:
  • Db2
  • Db2 Warehouse
Db2 Data Management Console

None

None

Db2 Event Store

None

None

Db2 for z/OS Connector To use this service, you must have an external Db2 for z/OS system.

None

Db2 Warehouse

None

None

Decision Optimization

None

To install this service, you must have the following services already installed:
  • Watson Studio
  • Watson Machine Learning
EDB Postgres

None

None

Edge Analytics *

None

None

Execution Engine for Apache Hadoop To use this service, you must have an external Apache Hadoop cluster. To install this service, you must have the following service already installed:
  • Watson Studio
Figure Eight

This information is not currently available.

This information is not currently available.

Financial Services Workbench

This information is not currently available.

This information is not currently available.

Guardium External S-TAP * To use this service, you must have Guardium V11.2. This service does not require any other Cloud Pak for Data.

You can use the service to monitor any database that is supported by Guardium External S-TAP. For information about supported databases, see External S-TAP supported databases.

However, you can use the service to monitor integrated databases, such as:

  • Db2
  • Db2 Warehouse
Intel Deep Learning Reference Stack - PyTorch

This information is not currently available.

This information is not currently available.

Intel Deep Learning Reference Stack - TensorFlow

This information is not currently available.

This information is not currently available.

Intel Distribution of Python

This information is not currently available.

This information is not currently available.

Jupyter Notebooks with Python 3.7 for GPU

None

To install this service, you must have the following service already installed:
  • Watson Studio
Jupyter Notebooks with R 3.6

None

To install this service, you must have the following service already installed:
  • Watson Studio
Lightbend Platform

This information is not currently available.

This information is not currently available.

Master Data Connect * To use this service, you must have:
  • An external IBM InfoSphere® Master Data Management V11.6.0.11 system
  • MDM Publisher
  • Elasticsearch
    Follow the system configuration steps from the Elasticsearch documentation to configure your operating system to allow the user running Elasticsearch to access more resources than allowed by default. Consider the following settings before going to production:
    • Increase file descriptors*
    • Ensure sufficient virtual memory*
    • Disable swapping
    • Ensure sufficient threads
    • JVM DNS cache settings
    • Temporary directory not mounted with noexec

    Settings marked with an asterisk (*) are mandatory on all nodes where Master Data Connect will be installed.

None

MongoDB

None

None

NetApp Trident (Previously NetApp ONTAP) To use this service, you must have an external NetApp ONTAP storage system.

None

Open Data for Industries * To use this service you must have:
  • Argocd 0.0.8
  • Couch DB 3.1.0
  • MinIO 1.0.9
  • Elasticsearch 6.8.3
  • Keycloak 11.0.0
  • AMQ Broker 7.7.0

None

OpenPages *

None

None

However, OpenPages automatically installs Db2 as a service.

Operational Analytics for ERP To use this service, you must have:
  • An external SAP ERP system
  • An external Operational Analytics for ERP database
To install this service, you must have the following service already installed:
  • Cognos Analytics
Planning Analytics Planning Analytics for Microsoft Excel requires Microsoft Excel.

None

Portworx

This information is not currently available.

This information is not currently available.

Prolifics Customer Prospecting Accelerator

This information is not currently available.

To install this service, you must have the following services already installed:
  • Db2 Warehouse
  • SPSS Modeler
RStudio Server with R 3.6

None

To install this service, you must have the following service already installed:
  • Watson Studio
Senzing

This information is not currently available.

This information is not currently available.

SPSS Modeler

None

To install this service, you must have the following service already installed:
  • Watson Studio
Streams

None

  • Common core services
  • If you want to use the sample Python notebooks that are included with Streams, you must install the Watson Studio service.
Streams Flows *

None

To install this service, you must have the following service already installed:
  • Streams
  • Watson Studio
  • If you want to use machine learning models, you must install the Watson Machine Learning service before you install Streams Flows.
Virtual Data Pipeline

This information is not currently available.

Not applicable

WAND Foundation Taxonomies

None

To install this service, you must have the following service already installed:
  • Watson Knowledge Catalog
Watson Assistant

None

  • IBM Cloud Platform Common Services Events Service
  • Watson Discovery is required only if you want to add a search skill to your assistant.
Watson Assistant for Voice Interaction *

None

None

Watson Discovery

None

None

Watson Knowledge Catalog

None

Common core services
Watson Knowledge Studio

None

None

Watson Language Translator

None

None

Watson Machine Learning

None

  • Common core services
  • If you want to use AutoAI, you must install the Watson Studio service before you install Watson Machine Learning.
Watson Machine Learning Accelerator *
  • NVIDIA GPU driver 440.33.01
  • cert-manager 1.0.3 or later
To install this service, you must have the following service already installed:
  • IBM Cloud Pak for Data Scheduling service
  • If you want to use Deep Learning Experiments, you must install the Watson Machine Learning service before you install Watson Machine Learning Accelerator.
Watson OpenScale

None

To use some features, you must have the following services already installed:
  • If you want to use AutoAI, you must install the Watson Studio service before you install Watson OpenScale.
  • If you want to use integrated machine learning, you must install the Watson Machine Learning service before you install Watson OpenScale.
Watson Speech to Text

None

None

Watson Studio

None

Common core services
Watson Text to Speech

None

None

* This service is not listed in the catalog in the web client when you install the Cloud Pak for Data control plane. For information on installing this service, see Services outside the catalog.

Multitenancy support

According to Gartner, multitenancy is:

Multitenancy is a reference to the mode of operation of software where multiple independent instances of one or multiple applications operate in a shared environment. The instances (tenants) are logically isolated, but physically integrated. The degree of logical isolation must be complete, but the degree of physical integration will vary.

Cloud Pak for Data support different installation and deployment mechanisms for achieving multitenancy. However, not all services support the same mechanisms. Cloud Pak for Data supports the following mechanisms:
  • Installing multiple instances of a service in the same Red Hat OpenShift project (namespace).
  • Installing a single instance of the service in the Red Hat OpenShift project (namespace) and then provisioning multiple instances of that service.
  • Installing multiple instances of a service in different Red Hat OpenShift projects that are tethered to a single deployment Cloud Pak for Data.
  • Installing multiple instances of a service in different Cloud Pak for Data projects that are associated with separate Cloud Pak for Data deployments.

For more information on these mechanisms, see Architecture for Cloud Pak for Data.

Use this table to determine which mechanisms the service that you want to install supports.

Service Multiple installations in a single namespace Multiple service instances from a single installation Multiple installations in tethered namespaces Multiple installations in separate namespaces (separate Cloud Pak for Data deployments)
Anaconda Repository with IBM

Not applicable

Not applicable

Not applicable

Not applicable

Analytics Engine Powered by Apache Spark

No. One instance only.

Yes

No

Yes

Analytics Zoo for Apache Spark

This information is not currently available.

This information is not currently available.

This information is not currently available.

This information is not currently available.

CockroachDB

This information is not currently available.

This information is not currently available.

This information is not currently available.

This information is not currently available.

Cognos Analytics

No. One instance only.

No. One instance only.

No

Yes

Cognos Dashboards

No. One instance only.

No. One instance only.

No

Yes

Data Refinery

No. One instance only.

No. One instance only.

No

See Watson Knowledge Catalog or Watson Studio.
Data Virtualization

No. One instance only.

No. One instance only.

No

Yes

Datameer

This information is not currently available.

This information is not currently available.

This information is not currently available.

This information is not currently available.

DataStage

No. One instance only.

No. One instance only.

No

Yes

Db2

No. One instance only.

Yes, but you are constrained by the number of nodes in your cluster.

You cannot have multiple Db2 deployments on the same node.

No

Yes

Db2 Big SQL

No. One instance only.

Yes

No

Yes

Db2 Data Gate

No. One instance only.

Yes

No

Yes

Db2 Data Management Console

No. One instance only.

No. One instance only.

No

Yes

Db2 Event Store

No. One instance only.

No. One instance only.

No

Yes, but you are constrained by the number of nodes in your cluster.

Db2 for z/OS Connector

No. One instance only.

No. One instance only.

No

Yes

Db2 Warehouse

No. One instance only.

Yes, but you are constrained by the number of nodes in your cluster.

You cannot have multiple Db2 deployments on the same node.

No

Yes, but you are constrained by the number of nodes in your cluster.

You cannot have multiple Db2 deployments on the same node.

Decision Optimization

No. One instance only.

No. One instance only.

No

Yes

EDB Postgres

No. One instance only.

Yes

No

Yes

Edge Analytics *

No. One instance only.

No. One instance only.

No

Yes

Execution Engine for Apache Hadoop

No. One instance only.

No. One instance only.

No

Yes

Figure Eight

This information is not currently available.

This information is not currently available.

This information is not currently available.

This information is not currently available.

Financial Services Workbench

This information is not currently available.

This information is not currently available.

This information is not currently available.

This information is not currently available.

Guardium External S-TAP *

No. One instance only.

Yes

No

Yes

Intel Deep Learning Reference Stack - PyTorch

This information is not currently available.

This information is not currently available.

This information is not currently available.

This information is not currently available.

Intel Deep Learning Reference Stack - TensorFlow

This information is not currently available.

This information is not currently available.

This information is not currently available.

This information is not currently available.

Intel Distribution of Python

This information is not currently available.

This information is not currently available.

This information is not currently available.

This information is not currently available.

Jupyter Notebooks with Python 3.7 for GPU

No. One instance only.

No. One instance only.

No

Yes

Jupyter Notebooks with R 3.6

No. One instance only.

No. One instance only.

No

Yes

Lightbend Platform

This information is not currently available.

This information is not currently available.

This information is not currently available.

This information is not currently available.

Master Data Connect *

No. One instance only.

No. One instance only.

No

Yes

MongoDB

No. One instance only.

Yes, but you are constrained by the number of nodes in your cluster.

No

Yes

NetApp Trident

This information is not currently available.

This information is not currently available.

This information is not currently available.

This information is not currently available.

Open Data for Industries *

No. One instance only.

No. One instance only.

No

No. One instance only.

OpenPages *

No. One instance only.

No. One instance only.

No

Yes

Operational Analytics for ERP

This information is not currently available.

This information is not currently available.

This information is not currently available.

This information is not currently available.

Planning Analytics

No. One instance only.

No. One instance only.

No

Yes

Portworx

This information is not currently available.

This information is not currently available.

This information is not currently available.

This information is not currently available.

Prolifics Customer Prospecting Accelerator

This information is not currently available.

This information is not currently available.

This information is not currently available.

This information is not currently available.

RStudio Server with R 3.6

No. One instance only.

No. One instance only.

No

Yes

Senzing

This information is not currently available.

This information is not currently available.

This information is not currently available.

This information is not currently available.

SPSS Modeler

No. One instance only.

No. One instance only.

No

Yes

Streams

No. One instance only.

Yes

No

Yes

Streams Flows *

No. One instance only.

No. One instance only.

No

Yes

Virtual Data Pipeline

Not applicable

Not applicable

Not applicable

Not applicable

WAND Foundation Taxonomies

This information is not currently available.

This information is not currently available.

This information is not currently available.

This information is not currently available.

Watson Assistant

No. One instance only.

Yes, up to 30 instances.

Not applicable

Yes

Watson Assistant for Voice Interaction *

No. One instance only.

No. One instance only.

No

Yes

Watson Discovery

No. One instance only.

No. One instance only.

No

Yes

Watson Knowledge Catalog

No. One instance only.

No. One instance only.

No

No. One instance only.

Watson Knowledge Studio

No. One instance only.

Yes, up to 30 instances.

No

Yes

Watson Language Translator

No. One instance only.

Yes

No

Yes

Watson Machine Learning

No. One instance only.

No. One instance only.

No

Yes

Watson Machine Learning Accelerator *

No. One instance only.

No. One instance only.

No

Only one installation in a tethered namespace.

Yes

Watson OpenScale

No. One instance only.

Yes

No

Yes

Watson Speech to Text

No. One instance only.

Yes

No

Yes

Watson Studio

No. One instance only.

No. One instance only.

No

Yes

Watson Text to Speech

No. One instance only.

Yes

No

Yes

* This service is not listed in the catalog in the web client when you install the Cloud Pak for Data control plane. For information on installing this service, see Services outside the catalog.