Storwize V7000 node canisters may shut down or reboot during normal operation, leading to a loss of host I/O access.
Stowize V7000 node canisters running V188.8.131.52 - V184.108.40.206 code levels may shut down without warning during normal I/O operations.
These shut down events will typically occur on both node canisters in the Storwize V7000 system, with the second node canister shutting down a number of hours after the first. Once the second node canister has shut down, this will cause a loss of host access to disks presented by the Storwize V7000, until at least one of the node canisters has been manually brought back online.
If this issue is encountered on V220.127.116.11 - V18.104.22.168, the recovery action is to reseat each offline node canister in order to bring it back online.
Partial Fix Introduced in V22.214.171.124
A partial fix was introduced in V126.96.36.199, which caused node canisters that experienced this condition to reboot and automatically resume I/O operations, rather than shut down and remain offline. Customers running V188.8.131.52 code are however still exposed to the risk of both node canisters rebooting at the same time, which could lead to a short, temporary outage to host I/O.
This issue has been fully resolved by APAR IC74088 in the V184.108.40.206 release. Please visit the following URL to download the latest V6.1.0.x code:
17 June 2018