IBM Support

Nodes in Metro or Global Mirror Inter-cluster Partnerships May Reboot if the Inter-cluster Link Becomes Overloaded

Flashes (Alerts)


Abstract


SVC, Storwize V7000 or Storwize V7000 Unified nodes in Metro or Global Mirror inter-cluster partnerships may experience lease expiry reboot events if an inter-cluster link to a partner system becomes overloaded. These reboot events may occur on all nodes simultaneously, leading to a temporary loss of host access to Volumes.

Content

If an inter-cluster link becomes severely and abruptly overloaded, it is possible for the local fibre channel fabric to become congested to the extent that no fibre channel ports on the local SVC, Storwize V7000 or Storwize V7000 Unified nodes are able to perform local intra-cluster heartbeat communication. This may result in the nodes experiencing lease expiry events, in which a node will reboot in order to attempt to re-establish communication with the other nodes in the system. If all nodes lease expire simultaneously, this may lead to a loss of host access to Volumes for the duration of the reboot events.

Workaround

The recommended default zoning recommendation for inter-cluster Metro and Global Mirror partnerships has now been revised to ensure that, if link-induced congestion occurs, only two of the four fibre channel ports on each node are able to be subjected to this congestion. The remaining two ports on each node will remain unaffected, and therefore able to continue performing intra-cluster heartbeat communication without interruption.



The revised zoning recommendation is as follows:

  • For each node in a clustered system, exactly two fibre channel ports should be zoned to exactly two fibre channel ports from each node in the partner system.
  • If dual-redundant ISLs are available, then the two ports from each node should be split evenly between the two ISLs, i.e. exactly one port from each node should be zoned across each ISL.
  • Local system zoning should continue to follow the standard requirement for all ports on all nodes in a clustered system to be zoned to one another.

This implies that for each system, there will be two ports on each SVC node that have no remote zones, only local zones.


Please refer to the What's new in the IBM System Storage SAN Volume Controller Information Center for more information on SVC Terminology changes.

Internal Use Only

27/04/10 - Wording providers - Ronda Hruby, Steve White; Reviewed by - Greg Dalton
Future publishing date set to 12/11/2010.
Added: Please refer to the What's new in the IBM System Storage SAN Volume Controller Information Center for more information on SVC Terminology changes.

08/11/2010: SW: I removed the ref to V7000 as the symptom described by this flash is unique to SVC only,

07/06/2011: Updated 'cluster' to 'system' or 'clustered system'. Updated the Alternate References for SVC and V7000 6.2. Set Future Publishing date to 10/06/2011.

14/11/2011: Updated for 6.3. Set Future Publishing date to 18/11/2011.
Note: Unless this is modified between now and 6,4.1 GA, it will not need to go any further.
18/11/2011: Updated for V7000 and Unified.

[{"Product":{"code":"STPVGU","label":"SAN Volume Controller"},"Business Unit":{"code":"BU010","label":"Systems - Storage"},"Component":"6.4","Platform":[{"code":"","label":"SAN Volume Controller"}],"Version":"Version Independent","Edition":""},{"Product":{"code":"STPVGU","label":"SAN Volume Controller"},"Business Unit":{"code":"BU010","label":"Systems - Storage"},"Component":"6.3","Platform":[{"code":"","label":"SAN Volume Controller"}],"Version":"4.3.1;5.1;6.1;6.2;6.3;6.4","Edition":""},{"Product":{"code":"ST3FR7","label":"IBM Storwize V7000 (2076)"},"Business Unit":{"code":"BU010","label":"Systems - Storage"},"Component":"6.3","Platform":[{"code":"","label":"IBM Storwize V7000"}],"Version":"6.1;6.2;6.3;6.4","Edition":""},{"Product":{"code":"ST5Q4U","label":"IBM Storwize V7000 Unified (2073)"},"Business Unit":{"code":"BU010","label":"Systems - Storage"},"Component":"1.3","Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"1.3","Edition":""}]

Document Information

Modified date:
17 June 2018

UID

ssg1S1003634