Installing
This section will describe the steps to install the HDFS Transparency nodes (NameNode and DataNodes) as GPFS client nodes to be added to the centralized storage system to create a single GPFS cluster. All other Hadoop nodes (master and clients) are to be set up outside of the GPFS cluster.
Before you proceed, see the following sections:
Note: For Cloudera® HDP distribution, CES HDFS is not supported.
Note:
- The centralized storage file system needs to be available before setting up the CES HDFS protocol nodes.
- Required to create the CES shared root (cesSharedRoot) file system.
- Do not follow steps that deploy NSDs on the HDFS Transparency nodes because centralized storage mode is the only one supported currently.
- FPO is not supported.
- HDFS Transparency does not require to have the Hadoop distribution installed onto the IBM Storage Scale HDFS Transparency nodes. However, if the HDFS client is not installed on the CES HDFS NameNodes and DataNodes, then functions like distcp will not work because HDFS Transparency does not include the bin/hadoop command.
- When adding HDFS protocol into CES, the other protocols (NFS, SMB, Object) and GUI and performance monitor can be configured and deployed at the same time.
- SMB requires
NFSv4 ACL
permission while HDFS requiresALL ACL
permission. Therefore, a warning will be seen if HDFS protocol is added to the protocol node and the ACL is not correct after the install toolkit deployment. The ACL should always be set toALL
if the HDFS protocol is used after deployment of the protocols.