We have a CNFS setup where the FS are exported over two different networks (IPoIB and Ethernet) by two server. The setup works OK but when one (and only one) of the networks is brought down (by pulling a cable), the virtual IP addresses fails over (which is good) but it tries to bring them up again two minutess later, then it fails again and repeat.
If both network cables are pulled it seems to stay down.
Is this expected? Is there a way to (automatically) disable the node if only one network fails?
This is GPFS 18.104.22.168 on ScientificLinux 6.4 (CNFS servers).