Model 2: Remote mount with limited Hadoop nodes as IBM Storage Scale nodes

Use this model if you are using IBM® Elastic Storage Server and you have huge Hadoop node size, typically more than 1000 Hadoop nodes.

This is illustrated by the following figure:
Figure 1. Remote mount with limited Hadoop nodes as IBM Storage Scale nodes
Remote mount with limited Hadoop nodes as IBM Storage Scale nodes

This deployment model is used for large number of nodes in the Hadoop cluster (for example, more than 1000 nodes). Creating a large IBM Storage Scale cluster requires careful planning and increased demands on the network. The deployment model in Figure 1 limits the IBM Storage Scale deployment to just the nodes that are running the HDFS Transparency service rather than the entire Hadoop cluster. The data traffic goes from Hadoop nodes, network RPC, HDFS Transparency nodes and IBM Storage Scale Clients, network RPC, IBM Storage Scale NSD servers, and SAN storage. Short-circuit read/write configuration does not help the data reading performance.