NSD and underlying disk subsystem failures
There are indications that will lead you to the conclusion that your file system has disk failures.
Some of those indications include:
- Your file system has been forced to unmount. For more information about forced file system unmount, see File system forced unmount.
- The mmlsmount command indicates that the file system is not mounted on certain nodes.
- Your application is getting EIO errors.
- Operating system error logs indicate you have stopped using a disk in a replicated system, but your replication continues to operate.
- The mmlsdisk command shows that disks are down.
Note: If you are reinstalling the operating system on one node
and erasing all partitions from the system, GPFS descriptors will be removed from any NSD
this node can access locally. The results of this action might require
recreating the file system and restoring from backup. If you experience
this problem, do not unmount the file system on any node that is currently
mounting the file system. Contact the IBM® Support
Center immediately to see if the problem can be corrected.