Cloud deployment environments
You can choose to deploy IBM® Cloud Pak for Data in the environment that suits your business needs.
Cloud Pak for Data can be deployed in various private cloud and public cloud environments.
Deployment environment | Managed OpenShift® | Self-managed OpenShift |
---|---|---|
On-premises | ✓ | ✓ |
IBM Cloud | ✓ | ✓ |
Amazon Web Services (AWS) | ✓ | ✓ |
Microsoft Azure | ✓ | ✓ |
Google Cloud | ✓ |
- How to install OpenShift on the environment
- The recommended cluster architecture
- How to install Cloud Pak for Data on the environment
- How to install services on the environment
Automated deployment options
Depending on your deployment environment, you might be able to install Cloud Pak for Data and some services from your cloud provider's marketplace.
The marketplace installation options are offered in addition to the default cpd-cli
installation method.
- Marketplace installations support only express installations, where the Cloud Pak for Data operators are co-located with the IBM Cloud Pak®
foundational services operators. If you want to run
a specialized
installation, you must use the
cpd-cli
. - Marketplace installations do not support installation from a private container
registry. If you want to use a private container registry, you must use the
cpd-cli
. - Marketplace installations support only installation. If you want to upgrade an existing
installation, you must use the
cpd-cli
. - Marketplace installations can be used to automatically install a subset of Cloud Pak for Data services. However, the services that are
available depend on the cloud deployment environment. If you want to install additional services,
you must use the
cpd-cli
.
On-premises infrastructure
If you want to ensure that your environment is running securely behind your firewall, or you have an existing on-premises Red Hat OpenShift Container Platform cluster, you can deploy Cloud Pak for Data on your on-premises infrastructure.
- Installing Red Hat OpenShift Container Platform
- The method for installing Red Hat OpenShift Container Platform
depends on whether you want a managed or self-managed OpenShift cluster:
OpenShift type Deployment options Managed You can deploy a managed OpenShift cluster on your on-premises infrastructure with IBM Cloud Satellite. For details, see IBM Cloud Satellite in the IBM Cloud marketplace. Self-managed You can deploy a self-managed OpenShift cluster on your on-premises infrastructure by following the Red Hat OpenShift Container Platform documentation: Additional guidance on setting up OpenShift is available in the IBM Cloud Paks documentation.
Tip: If your license includes entitlement to Red Hat OpenShift Container Platform, you can apply the entitlement to your self-managed OpenShift cluster. - Recommended cluster architecture
- Cloud Pak for Data is deployed
on a multi-node cluster. Although you can deploy Cloud Pak for Data on a 3-node cluster for development or proof of
concept environments, it is strongly recommended that you deploy your production environment on a
larger, highly available cluster with multiple dedicated master and worker nodes. This configuration
provides better performance, better cluster stability, and increased ease of scaling the cluster to
support workload growth. The specific requirements for a production-level cluster are identified in
System
requirements.
For more information, see Architecture for IBM Cloud Pak for Data.
- Supported storage
-
Deployment environment Managed OpenShift Self-managed OpenShift On-premises IBM Cloud Satellite supports the following storage options: - OpenShift Data Foundation
- Portworx
The following storage options are supported on bare metal and VMware infrastructure: - OpenShift Data Foundation
- IBM Spectrum® Fusion
- IBM Spectrum Scale Container Native
- Portworx
- NFS
- Prerequisites
- Ensure that you review the following information:
Topic What to look for Hardware requirements Based on the services that you plan to install, determine whether you have sufficient hardware and whether you need AVX instructions or dedicated nodes. Software requirements Based on the services that you plan to install, determine whether you have the necessary external software and service dependencies. Storage requirements The storage that you use determines the services that you can run. Ensure that the services that you want to run are compatible with the storage in your environment.
- Installing Cloud Pak for Data and services
- See Installing IBM Cloud Pak for Data.
IBM Cloud infrastructure
If you already use IBM Cloud to run business-critical applications, or if you don't want to set up and manage your own hardware, you can deploy Cloud Pak for Data on IBM Cloud.
- Installing Red Hat OpenShift Container Platform
-
The method for installing Red Hat OpenShift Container Platform depends on whether you want a managed or self-managed OpenShift cluster:
OpenShift type Deployment options Managed You can deploy a managed OpenShift cluster on IBM Cloud infrastructure from the IBM Cloud catalog. For details, see Red Hat OpenShift on IBM Cloud® in the IBM Cloud catalog. Ensure that you select a supported version of Red Hat OpenShift.
You can deploy Cloud Pak for Data on Virtual Private Cloud (VPC) infrastructure. You can use either a single zone or multi zone deployment.
Self-managed You can deploy a self-managed OpenShift cluster on Virtual Private Cloud (VPC) IBM Cloud infrastructure by following the Red Hat OpenShift Container Platform documentation: Additional guidance on setting up OpenShift is available in the IBM Cloud Paks documentation.
Tip: If your license includes entitlement to Red Hat OpenShift Container Platform, you can apply the entitlement to your self-managed OpenShift cluster. - Recommended cluster architecture
- Cloud Pak for Data is deployed
on a multi-node cluster. Although you can deploy Cloud Pak for Data on a 3-node cluster for development or proof of
concept environments, it is strongly recommended that you deploy your production environment on a
larger, highly available cluster with multiple dedicated master and worker nodes. This configuration
provides better performance, better cluster stability, and increased ease of scaling the cluster to
support workload growth. The specific requirements for a production-level cluster are identified in
System
requirements.
For more information, see Architecture for IBM Cloud Pak for Data.
- Supported storage
-
Deployment environment Managed OpenShift Self-managed OpenShift IBM Cloud Red Hat OpenShift on IBM Cloud supports the following storage options on VPC infrastructure: - Portworx
The following storage options are supported on VPC IBM Cloud infrastructure: - Portworx
- NFS
- Prerequisites
- Ensure that you review the following information:
Topic What to look for Hardware requirements Based on the services that you plan to install, determine whether you have sufficient hardware and whether you need AVX instructions or dedicated nodes. Software requirements Based on the services that you plan to install, determine whether you have the necessary external software and service dependencies. Storage requirements The storage that you use determines the services that you can run. Ensure that the services that you want to run are compatible with the storage in your environment.
- Installing Cloud Pak for Data and services
-
- Manual installation options
- See Installing IBM Cloud Pak for Data.
- Automated installation options
-
Remember: The automated installation options have several limitations. For details, see Automated deployment options.
You can install Cloud Pak for Data from the IBM Cloud Catalog.
Amazon Web Services (AWS) infrastructure
If you already use AWS and you don't want to set up and manage your own hardware, you can deploy Cloud Pak for Data on AWS.
- Installing Red Hat OpenShift Container Platform
-
The method for installing Red Hat OpenShift Container Platform depends on whether you want a managed or self-managed OpenShift cluster:
OpenShift type Deployment options Managed - ROSA CLI
- You can use the ROSA CLI to manually install ROSA. For details, see Red Hat OpenShift Service on AWS (ROSA) in the Red Hat OpenShift Container Platform documentation.
- IBM Cloud Satellite
- You can use IBM Cloud Satellite in the IBM Cloud marketplace to install Red Hat OpenShift Container Platform on AWS.
Self-managed You can deploy a self-managed OpenShift cluster on AWS infrastructure by following the Red Hat OpenShift Container Platform documentation:Additional guidance on setting up OpenShift is available in the IBM Cloud Paks documentation.
Tip: If your license includes entitlement to Red Hat OpenShift Container Platform, you can apply the entitlement to your self-managed OpenShift cluster. - Recommended cluster architecture
- Cloud Pak for Data is deployed
on a multi-node cluster. Although you can deploy Cloud Pak for Data on a 3-node cluster for development or proof of
concept environments, it is strongly recommended that you deploy your production environment on a
larger, highly available cluster with multiple dedicated master and worker nodes. This configuration
provides better performance, better cluster stability, and increased ease of scaling the cluster to
support workload growth. The specific requirements for a production-level cluster are identified in
System
requirements.
For more information, see Architecture for IBM Cloud Pak for Data.
- Supported storage
-
Deployment environment Managed OpenShift Self-managed OpenShift Amazon Web Services (AWS) Red Hat OpenShift Service on AWS (ROSA) supports the following storage options: - Amazon Elastic Block Store (EBS)
- Amazon Elastic File System (EFS)
The following storage options are supported on AWS infrastructure: - OpenShift Data Foundation
- Amazon Elastic Block Store (EBS)
- Amazon Elastic File System (EFS)
- Portworx
- NFS
- Prerequisites
- Ensure that you review the following information:
Topic What to look for Hardware requirements Based on the services that you plan to install, determine whether you have sufficient hardware and whether you need AVX instructions or dedicated nodes. Software requirements Based on the services that you plan to install, determine whether you have the necessary external software and service dependencies. Storage requirements The storage that you use determines the services that you can run. Ensure that the services that you want to run are compatible with the storage in your environment.
- Installing Cloud Pak for Data and services
-
- Manual installation options
- See Installing IBM Cloud Pak for Data.
- Automated installation options
-
Remember: The automated installation options have several limitations. For details, see Automated deployment options.
You can install Cloud Pak for Data from the AWS marketplace.
Microsoft Azure infrastructure
If you already use Microsoft Azure and you don't want to set up and manage your own hardware, you can deploy Cloud Pak for Data on Azure.
- Installing Red Hat OpenShift Container Platform
-
OpenShift type Deployment options Managed You can use theaz aro
CLI to manually install Azure Red Hat OpenShift 4 (ARO). For details, see Azure Red Hat OpenShift 4 in the Red Hat OpenShift Container Platform documentation.Self-managed You can deploy a self-managed OpenShift cluster on Azure infrastructure by following the Red Hat OpenShift Container Platform documentation:Additional guidance on setting up OpenShift is available in the IBM Cloud Paks documentation.
Tip: If your license includes entitlement to Red Hat OpenShift Container Platform, you can apply the entitlement to your self-managed OpenShift cluster. - Recommended cluster architecture
- Cloud Pak for Data is deployed
on a multi-node cluster. Although you can deploy Cloud Pak for Data on a 3-node cluster for development or proof of
concept environments, it is strongly recommended that you deploy your production environment on a
larger, highly available cluster with multiple dedicated master and worker nodes. This configuration
provides better performance, better cluster stability, and increased ease of scaling the cluster to
support workload growth. The specific requirements for a production-level cluster are identified in
System
requirements.
For more information, see Architecture for IBM Cloud Pak for Data.
- Supported storage
-
Deployment environment Managed OpenShift Self-managed OpenShift Microsoft Azure Azure Red Hat OpenShift (ARO) supports the following storage options: - OpenShift Data Foundation
- NFS
The following storage options are supported on Microsoft Azure infrastructure: - OpenShift Data Foundation
- Portworx
- NFS, specifically Microsoft Azure locally redundant Premium SSD storage
- Prerequisites
- Ensure that you review the following information:
Topic What to look for Hardware requirements Based on the services that you plan to install, determine whether you have sufficient hardware and whether you need AVX instructions or dedicated nodes. Software requirements Based on the services that you plan to install, determine whether you have the necessary external software and service dependencies. Storage requirements The storage that you use determines the services that you can run. Ensure that the services that you want to run are compatible with the storage in your environment.
- Installing Cloud Pak for Data and services
-
- Manual installation options
- See Installing IBM Cloud Pak for Data.
- Automated installation options
-
Remember: The automated installation options have several limitations. For details, see Automated deployment options.
You can install Cloud Pak for Data from the Azure Marketplace.
Google Cloud infrastructure
If you already use Google Cloud and you don't want to set up and manage your own hardware, you can deploy Cloud Pak for Data on Google Cloud.
- Installing Red Hat OpenShift Container Platform
-
OpenShift type Deployment options Managed Managed OpenShift on Google Cloud infrastructure is not supported. Self-managed You can deploy a self-managed OpenShift cluster on Google Cloud infrastructure by following the Red Hat OpenShift Container Platform documentation:Additional guidance on setting up OpenShift is available in the IBM Cloud Paks documentation.
Remember: If your license includes entitlement to Red Hat OpenShift Container Platform, you can apply the entitlement to your self-managed OpenShift cluster. - Recommended cluster architecture
- Cloud Pak for Data is deployed
on a multi-node cluster. Although you can deploy Cloud Pak for Data on a 3-node cluster for development or proof of
concept environments, it is strongly recommended that you deploy your production environment on a
larger, highly available cluster with multiple dedicated master and worker nodes. This configuration
provides better performance, better cluster stability, and increased ease of scaling the cluster to
support workload growth. The specific requirements for a production-level cluster are identified in
System
requirements.
For more information, see Architecture for IBM Cloud Pak for Data.
- Supported storage
-
Deployment environment Managed OpenShift Self-managed OpenShift Google Cloud Managed OpenShift on Google Cloud is not supported. The following storage options are supported on Google Cloud infrastructure: - OpenShift Data Foundation
- Portworx
- NFS
- Prerequisites
- Ensure that you review the following information:
Topic What to look for Hardware requirements Based on the services that you plan to install, determine whether you have sufficient hardware and whether you need AVX instructions or dedicated nodes. Software requirements Based on the services that you plan to install, determine whether you have the necessary external software and service dependencies. Storage requirements The storage that you use determines the services that you can run. Ensure that the services that you want to run are compatible with the storage in your environment.
- Installing Cloud Pak for Data and services
-
- Manual installation options
- See Installing IBM Cloud Pak for Data.
- Automated installation options
- No automated installation options are available on Google Cloud.
In addition to Cloud Pak for Data software, IBM offers IBM Cloud Pak for Data as a Service. IBM Cloud Pak for Data as a Service might be right for you if you already use IBM Cloud to run business-critical applications and you don't want to set up and manage your own deployment of Cloud Pak for Data. IBM Cloud Pak for Data as a Service differs from the Cloud Pak for Data software in several ways. For details, see the IBM Cloud Pak for Data as a Service documentation.