Pdisk-group fault tolerance: an example
Every data stripe (including user data and system configuration data) within the IBM Storage Scale RAID system is protected through a distinct form of redundancy. Each of these data stripes has a set of disks within which they constrain their strip placement. Each stripe of the data (for which there are many stripes in each whole) has individual strips that serve in the redundancy code protection of the object's data. The placement of these strips is distributed across a set of pdisks residing within a set of drawers. These drawers reside within a set of enclosures.
By segregating each individual strip across as wide a set of disk groups as possible, IBM Storage Scale RAID ensures that the loss of any set of disk groups up to fault tolerance of the RAID redundancy code is survivable. So in the give example the pdisk-group fault-tolerance is one enclosure because RAID code can survive two strips.
After the failure, the GNR software tries to rebuild the DA subject to availability of space such that maximum fault tolerance can be achieved.
If you want enclosure failure on four enclosure system, you would need to set the RAID code to 8+3p.
Limiting factor of pdisk-group fault-tolerance
mmvdisk recoverygroup list --recovery-group <RgName> –all
or
mmvdisk recoverygroup list --recovery-group <RgName> -–fault-tolerance
mmlsrecoverygroup <RgName> -L
configuration data disk group fault tolerance remarks
------------------ --------------------------------- -------
rg descriptor 2 enclosure limiting fault tolerance
system index 2 enclosure limited by rg descriptor
vdisk RAID code disk group fault tolerance remarks
------------------ --------------- --------------------------------- -------
RG001LOGHOME 4WayReplication 2 enclosure limited by rg descriptor
RG001LOGTIP 2WayReplication 1 pdisk
RG001LOGTIPBACKUP Unreplicated 0 pdisk
RG001VS001 8+2p 1 enclosure
RG001VS002 8+2p 1 enclosure
configuration data disk group fault tolerance remarks
------------------ --------------------------------- -------
rg descriptor 4 pdisk limiting fault tolerance
system index 4 pdisk limited by rg descriptor
vdisk RAID code disk group fault tolerance remarks
------------------ --------------- --------------------------------- -------
RG001LOGHOME 4WayReplication 3 pdisk limited by rg descriptor
RG001LOGTIP 2WayReplication 1 pdisk
RG001LOGTIPBACKUP Unreplicated 0 pdisk
RG001VS001 8+2p 2 pdisk
RG001VS004 3WayReplication 2 pdisk limited by rg descriptor