IBM Spectrum Scale cluster configurations

An IBM Spectrum Scale™ cluster can be configured in a variety of ways. The cluster can be a heterogeneous mix of hardware platforms and operating systems.

IBM Spectrum Scale clusters can contain a mix of all supported node types including Linux, AIX®, and Windows Server and these operating systems can run on various hardware platforms such as IBM® POWER®, x86-based servers, and IBM Z. These nodes can all be attached to a common set of SAN storage or through a mix of SAN and network attached nodes. Nodes can all be in a single cluster, or data can be shared across multiple clusters. A cluster can be contained in a single data center or spread across geographical locations. To determine which cluster configuration is best for your application start by determining the following:
  • Application I/O performance and reliability requirements
  • Properties of the underlying storage hardware
  • Administration, security, and ownership considerations

Understanding these requirements helps you determine which nodes require direct access to the disks and which nodes should access the disks over a network connection through an NSD server.

There are four basic IBM Spectrum Scale Configurations:
  • All nodes attached to a common set of LUNS
  • Some nodes are NSD clients
  • A cluster is spread across multiple sites
  • Data is shared between clusters

All nodes attached to a common set of LUNS

In this type of configuration, all of the nodes in the cluster are connected to a common set of LUNS (for example, over a SAN). The following are areas to consider with this configuration:
  • The maximum number of nodes accessing a LUN you want to support
  • The fact that you cannot mix different operating systems with IBM Spectrum Scale to directly access the same set of LUNs on SAN.
For an example, see Figure 1.
Figure 1. A cluster with disks that are SAN-attached to all nodes
This figure shows a cluster with four nodes that have direct SAN attachment to a common set of disks.

Some nodes are NSD clients

In this type of configuration, only some nodes are connected to disks. Other nodes access the disks using the NSD path.

For an example, see Figure 2.
Figure 2. A cluster with some nodes connected to disks
This figure shows a cluster with some nodes connected to disks.

IBM Spectrum Scale Servers and clients

You can configure an IBM Spectrum Scale cluster in which some nodes have a direct attachment to the disks and others access the disks through other IBM Spectrum Scale nodes. This configuration is often used in large clusters or to provide a cost-effective, high-performance solution.

When an IBM Spectrum Scale node is providing access to a disk for another IBM Spectrum Scale node, the node that provides access is called an NSD Server. The node that accesses the data through an NSD server is called an IBM Spectrum Scale client.

Sharing data across multiple IBM Spectrum Scale clusters

IBM Spectrum Scale allows you to share data across multiple IBM Spectrum Scale clusters. After a file system is mounted in another IBM Spectrum Scale cluster, all access to the data is the same as if you were in the host cluster. You can connect multiple clusters within the same data center or across long distances over a WAN. In a multicluster configuration, each cluster can be placed in a separate administrative group simplifying administration or provide a common view of data across multiple organizations.
Figure 3. A multicluster configuration
This figure shows a multicluster configuration.
Note: For more information, see Accessing a remote GPFS file system.