Planning for IBM Cloud
Learn about prerequisites, configurations, and other important planning information that you must consider before the deployment of an IBM Storage Scale cluster on IBM® Cloud.
- Checking the limitations.
- Preparing the IBM Cloud environment to deploy the IBM Storage Scale cluster.
- Planning the virtual network cloud (VPC) architecture.
- Planning for DNS.
- Planning for bastion.
- Creating an IBM Storage Scale custom image in advance.
- Planning for the IBM Storage Scale deployment architecture on IBM Cloud.
- Determining the deployment purpose.
- Selecting the deployment profiles of the IBM Storage Scale cluster.
- Determining your performance, scalability, data availability, and data protection.
Checking the limitations
- If the key that is used to encrypt IBM Storage Scale is deleted, data cannot be retrieved.
- Only the general purpose type of disk is supported with IBM Cloud.
Preparing the IBM Cloud environment to deploy the IBM Storage Scale cluster
Complete the following steps to prepare the IBM Cloud environment.
- If you do not already have one, create an IBM Cloud account.
- Create the necessary IAM API keys that are required for cloudkit to make programmatic requests to IBM Cloud. For more information, see Managing user API keys in the IBM Cloud documentation.
- Configure the required permission to the IAM API key by following the procedure from Configuring permissions to use cloudkit on public clouds.
- In the preferred region, create a key pair.
- Prepare the installer node where cloudkit is meant to run. For more information, see Preparing the installer node
Planning the virtual network cloud (VPC) architecture
When resources are being deployed in the cloud, cloudkit can either create a new virtual private cloud (VPC) and provision the resources into it, or use a previously created VPC. When cloudkit creates a new VPC, cloudkit designs your network infrastructure from scratch; it chooses the subnets, network address ranges, and security groups that best suit the IBM Storage Scale deployment.
Cloudkit create networkcommand, which creates only the VPC.Cloudkit create clustercommand, which creates the cluster and its VPC.
? VPC Mode: [Use arrows to move, type to filter, ? for more help]
> New
ExistingIf the IBM Storage Scale cluster is to be deployed in an existing VPC, the following requirements must be met:
- Dynamic Host Configuration Protocol (DHCP) options set or DNS configured.
- In the availability zone, subnets with allocatable IP address; the minimum is one private subnet per availability zone.
- A public subnet (a subnet with internet gateway attached) is only required
if either the following cases are met:
- Where the cluster is planned to be accessed through a jump host. For more information, see Other considerations.
- If it is necessary to precreate an IBM Storage Scale image. For more information, see Precreating an IBM Storage Scale AMI.
- A network address translation (NAT) gateway that is attached to the private subnet is only required when the IBM Storage Scale instances need access to the internet (for operating system updates and security patches, for example).
Planning for DNS
Use cloudkit create dns command to create a DNS domain. During the DNS creation
process, you must create a service instance.
? Do you wish to use an existing DNS zone: (y/N)cloudkit create cluster command.cloudkit create cluster
command:E:
No existing DNS zones found. In order to configure, use 'cloudkit create
dns'If no existing zones are found, this command fails and displays an error message.
Planning for bastion or jump host
Use cloudkit create jumphost command to create a jump host or bastion in the
public subnet.
cloudkit create cluster
command:? Bastion OS: ibm-redhat-9-6-minimal-amd64-6 | r006-2bc93792-b76e-45ca-aa8a-d7f44029aae4 | x86_64 | vpcuser
? Bastion instance type: bx2-2x8 | 2 vCPU | 8 GB RAM | Up to 10 Gbps bandwidth
? Bastion/Jumphost SSH private key file path (will be used only for configuration): <private_key_path>
? JumpHost CIDR allow list: xxx.xxx.xxx.xxx/0? Bastion instance type: [Use arrows to move, type to filter]
> bx2-2x8 | 2 vCPU | 8 GB RAM | Up to 10 Gbps bandwidth
bx2-4x16 | 4 vCPU | 16 GB RAM | Up to 10 Gbps bandwidth
bx2-8x32 | 8 vCPU | 32 GB RAM | Up to 16 Gbps bandwidthCreating an IBM Storage Scale custom image in advance
With cloudkit, you can either automatically create a stock image or provide a custom image.
- Automatically created image by using
cloudkit create clustercommand -
Red Hat Enterprise Linux® (RHEL) 8.10 or 9.6 is required for cloudkit to automatically create a stock image. When you select this option, cloudkit automatically performs all required actions to create an IBM Storage Scale VM image and create the IBM Storage Scale cluster.
- New or existing custom image by using
cloudkit create imagecommand -
A custom image can be created by using
cloudkit create imagecommand. This command supports two options to create a custom image: it accepts inputs in the form of an existing image and it provides the ability to create an image from a RHEL image.An existing custom image consists of the following components:- A RHEL version that is supported by IBM Storage Scale.
- Optionally, for any customer applications that are already preinstalled,
cloudkit create imagecommand also installs all required IBM Storage Scale packages on top of the existing image.
Planning for the IBM Storage Scale deployment architecture on IBM Cloud
Before the IBM Storage Scale cluster can be created on the cloud, plan the deployment architecture.
Cloudkit offers the following deployment models for IBM Storage Scale.
- Combined-compute-storage
-
A unified IBM Storage Scale cluster with both storage and compute nodes. It is recommended that any customer workload runs only on the compute nodes.
- Compute-only
-
An IBM Storage Scale cluster that only consists of compute nodes. This cluster does not have any local file system and therefore must remotely mount the file system from an IBM Storage Scale storage cluster.
- Storage-only
-
This IBM Storage Scale cluster consists of only storage nodes, which have access to storage where the file system is created. This file system is remotely mounted to any number of compute-only IBM Storage Scale clusters.
cloudkit create cluster command, the following prompt asks you to
select from among the different deployment models that are
offered:? IBM Storage Scale deployment model: [Use arrows to move, type to filter, ? for more help]
> Storage-only
Compute-only
Combined-compute-storageDetermining the deployment purpose
The deployment purpose is applicable only to storage clusters.
- Non-Production
-
This profile lists instance types that are experimental, which are best suited for proof of concepts. A non-production deployment may require tuning to run scale-suited workloads.
- Production
-
This profile lists instance types that are well suited for scale workloads. Optimal tuning parameters are calculated and set during the deployment.
? Deployment purpose: [Use arrows to move, type to filter]
Non-Production
> ProductionSelecting the deployment profiles for an IBM Storage Scale cluster
Cloudkit offers the following deployment profiles.
- Throughput-Performance-Persistent-Storage
-
This profile uses a single availability zone with persistent storage, which means that the file system device retains data after the instance is stopped. In this mode, cloudkit calculates the number of storage nodes based on the provided file system capacity. This mode uses
general-purposeas disk type.Important: When you choose this profile, the file system is configured in such manner that the data is not replicated, only the metadata gets replicated across the instances. - Throughput-Performance-Scratch-Storage
-
This profile uses a single availability zone and a placement group with
clusterpolicy, which means that it packs instances close together inside an availability zone. This strategy enables workloads to achieve the low-latency network performance that is necessary for tightly coupled node-to-node communication that is typical of high-performance computing (HPC) applications, with instance or temporary storage; meaning that the file system device loses data after the instance is stopped.In this mode, the number of storage instances is limited to 10, compute instances are limited to 65.
Important: This profile uses local NVMe SSD or instance storage, which offers high performance and low latency for data-intensive workloads. However, the data that is stored in this mode is volatile and can be lost if the instance is stopped or terminated. Therefore, it is recommended to take frequent backups.This profile must not be used for long-term storage or if the data is not backed up elsewhere.
- Throughput-Advance-Persistent-Storage
-
This profile uses a single availability zone with persistent storage, which means that the file system device retains data after the instance is stopped. This mode is meant for storage capacity rather performance; the number of storage instances is limited to 64, compute instances is limited to 65.
Important: When you choose this profile, the file system is configured in such manner that the data is not replicated, only the metadata gets replicated across the instances. - Balanced
-
This profile uses multiple (3) availability zones to deploy the IBM Storage Scale cluster into. In this mode, the number of storage instances is limited to 64.
Important: The IBM Storage Scale file system is configured in such a way that data and metadata get replicated across availability zones.? Tuning profile: [Use arrows to move, type to filter, ? for more help] > Throughput-Performance-Persistent-Storage Throughput-Advance-Persistent-Storage Balanced Throughput-Performance-Scratch-Storage
Determining performance, scalability, data availability, and data protection
Before you deploy the IBM Storage Scale cluster, it is important to understand your requirements in terms of performance, scalability, data availability, and data protection. These criteria determine what IBM Cloud instance types to use for the storage nodes and compute nodes, also what elastic block store types should be used.
Cloudkit provides the following choices for VM instances types.
- For IBM Storage Scale compute nodes, all the VM instance types are supported, according to their availability per region.
- For IBM Storage Scale storage nodes, the support depends on
the profile, as described in the following list.
-
For the
Throughput-Performance-Persistent-Storage,Throughput-Advance-Persistent-Storage, andBalancedprofiles, the next VM instance types are supported.- cx2-16x32 | 16 vCPU | 32 GiB RAM | Max network bandwidth 32 Gbps - cx2-32x64 | 32 vCPU | 64 GiB RAM | Max network bandwidth 64 Gbps - cx2-48x96 | 48 vCPU | 96 GiB RAM | Max network bandwidth 80 Gbps - cx2-64x128 | 64 vCPU | 128 GiB RAM | Max network bandwidth 80 Gbps - cx2-96x192 | 96 vCPU | 192 GiB RAM | Max network bandwidth 80 Gbps -
For the
Throughput-Performance-Scratch-Storageprofile, the next VM instance types are supported.- ox2-8x64 | 8 vCPU | 64 GiB RAM | Instance Storage 2 x 1200 GB local SSD | Network Bandwidth Up to 16 Gbps - ox2-16x128 | 16 vCPU | 128 GiB RAM | Instance Storage 2 x 2400 GB local SSD | Network Bandwidth Up to 32 Gbps - ox2-32x256 | 32 vCPU | 256 GiB RAM | Instance Storage 3 x 3200 GB local SSD | Network Bandwidth Up to 64 Gbps - ox2-64x512 | 64 vCPU | 512 GiB RAM | Instance Storage 6 x 3200 GB local SSD | Network Bandwidth Up to 80 Gbps - ox2-96x768 | 96 vCPU | 768 GiB RAM | Instance Storage 9 x 3200 GB local SSD | Network Bandwidth Up to 80 Gbps
-