subscribe iconSubscribe to this information
POWER6 information

Checking for fabric configuration and functional problems

Use this information to learn how to check for fabric configuration and functional problems.

To check for fabric configuration and functional problems, perform the following procedure.
On the fabric management server run the all_analysis fast fabric health check command. For details, see Health checking. To diagnose symptoms reported by health check see Table 3.
Note: The health check is most effective for checking for configuration problems if a baseline health check has been performed and is stored in the /var/opt/iba/analysis/baseline directory on the fabric management server. Otherwise changes in configuration cannot be sensed.
If there is no baseline health check for comparison, you need to perform the same type of configuration checks that were done during installation. For details, see Installing and configuring the InfiniBand switch. For the host-based subnet managers, also use the Installing the fabric management server topic. You need to check that the following configuration parameters match the installation plan. A reference or setting for IBM® System p® and IBM Power Systems™ HPC Clusters is provided for each parameter that you check.
Table 1. Health check parameters
Parameter Reference or setting
GID prefix The GID prefix must be different for each subnet. For details, see Planning for global identifier prefixes.
LMC Must be 2 for IBM HPC Clusters.
Maximum transfer unit (MTU) For details, see Planning for maximum transfer units (MTUs). This parameter is the fabric MTU and not the MTU in the stack, which can be a much greater number.
Cabling plan See the vendor Switch Users Guide and Planning and Installation Guide
Balanced Topology It is typically best to ensure that you have distributed the HCA ports from the servers in a consistent manner across subnets. For example, all corresponding ports on HCAs within servers must connect to the same subnet; similar to, all port 1 on HCA 1 must connect to subnet 1, and all port 2 on HCA 1 must connect to port 2.
Full bandwidth topology? Did you choose to implement a full-bandwidth topology by using the vendor recommendations found in the vendor Switch Users Guide and Planning and Installation Guide?

Send feedback | Rate this page

Last updated: Tue, February 08, 2011