When a storage controller or disk fails in a replicated
file system, the system routes access to the other replica. No administrator
intervention is required and there is no impact to accessing the file
systems. After the problem is resolved and the disks become available
again, they need to be started manually.
Before you begin
Important: To run the db2cm utility as the root user, ensure the DB2INSTANCE environment
variable is set to the instance owner.
- To perform this task, you must be a Db2® cluster services
administrator.
Procedure
- To determine if a file system has any disks in down state,
use the following command:
db2cluster -cfs -list -filesystem <fs name>
-
If there are any disks in Down state under the STATE column,
restart them by using the following command:
db2cm -cfs -start -filesystem <fs name> -disk
On
AIX:
db2cluster -cfs -start -filesystem <fs name> -disk
This
does not trigger the replication of data that were added while the other storage controller was
down. When this command returns, new data starts replicating on both storage.
The disk
startup operation involves auto repair of file system metadata and user files if applicable. The
latter, if required, will result in access restriction to the impacted files and block on-going
transactions that require access to them. The duration depends on the extent to which the repair is
required. Therefore, it is recommended to run this command at off-peak usage period.
-
Verify all disks in the file system have been successfully started and the states are
Up by running the following command:
db2cm -cfs -list -filesystem <fs name>
On AIX:
db2cluster -cfs -list -filesystem <fs name>
The above command
will re-validate the configuration and status of the target file system. As part of the
re-validation, existing alerts will be removed if condition has been resolved. New alerts will be
raised if applicable.
- Run the following command to replicate data added to the system during the storage
downtime. This is an I/O intensive operation and is recommended to run at off-peak usage
period.
db2cm -cfs -replicate -filesystem <fs name>
On
AIX:
db2cluster -cfs -replicate -filesystem <fs name>
- Finally, a rebalance of the file system might be needed, run db2instance
-list to check for any alerts. Use db2cm -list -alerts
(db2cluster -list -alerts on AIX) to get the details of the alert. If an
unbalanced alert is raised, run the following command to rebalance the file system.
This is an I/O intensive operation and is recommended to run at off-peak usage period.
db2cm -cfs -rebalance -filesystem <fs name>
On AIX:
db2cluster -cfs -rebalance -filesystem <fs name>