APAR status
Closed as program error.
Error description
The GM session get suspended but GC stay at Copy Pending on all device pairs. csmMessage.log shows: [Server] IWNR6000I [May 31, 2012 9:42:00 AM] Starting all pairs in role pair H1-J2 ... [Server] IWNR1041I [May 31, 2012 9:42:01 AM] The command start was successfully issued to all pairs under role pair H1-J2 for session GM_Discount_site3. [Server] IWNR1950I [May 31, 2012 9:42:02 AM] Session GM_Discount_site3 changed from the Preparing state to the Prepared state. [Server] IWNH1037E [May 31, 2012 9:43:59 AM] Device DS8000:BOX:2107.XXXXX managed by hardware connection ESSNI:ip-address::2107.XXXXX is no longer accessible. [Server] IWNH1037E [May 31, 2012 9:43:59 AM] Device DS8000:BOX:2107.XXXXX managed by hardware connection ESSNI:ip-address::2107.XXXXX is no longer accessible. [Server] IWNR1955E [May 31, 2012 9:43:59 AM] Tivoli Storage Productivity Center for Replication Server (server name) has encountered communication errors with storage system DS8000:BOX:2107.XXXXX. [UNKNOWN] IWNH0007I [May 31, 2012 9:44:02 AM] The hardware device has been stopped for location ESSNI:ip-address::2107.XXXXX. [Server] IWNH1037E [May 31, 2012 9:44:02 AM] Device DS8000:BOX:2107.XXXXX managed by hardware connection ESSNI:ip-address::2107.XXXXX is no longer accessible. [Server] IWNH1037E [May 31, 2012 9:44:02 AM] Device DS8000:BOX:2107.XXXXX managed by hardware connection ESSNI:ip-address::2107.XXXXX is no longer accessible. [UNKNOWN] IWNH0007I [May 31, 2012 9:44:05 AM] The hardware device has been stopped for location ESSNI:ip-address::2107.XXXXX. [UNKNOWN] IWNR3081I [May 31, 2012 9:44:05 AM] High availability for the server (server name) has been shutdown. [UNKNOWN] IWNR1901I [May 31, 2012 9:44:08 AM] Copy Services Manager has stopped successfully. ************ Start Display Current Environment ************ Replication Manager - build: k20111212-1654 version: 4.2.2.1 Hostname: (server name) -- (server name)/ip-address ************* End Display Current Environment ************* [Server] IWNR3080I [May 31, 2012 9:44:31 AM] High availability for the server (server name) is now running. [Server] IWNH0001I [May 31, 2012 9:44:32 AM] The hardware device started. [Server] IWNH1600I [May 31, 2012 9:44:32 AM] The SNMP event listener has been successfully started and is now ready for SNMP traps. [Server] IWNH0001I [May 31, 2012 9:44:32 AM] The hardware device started. [Server] IWNH0002I [May 31, 2012 9:44:38 AM] The hardware device started for location ESSNI:ip-address::2107.XXXXX. [Server] IWNH0002I [May 31, 2012 9:44:43 AM] The hardware device started for location ESSNI:ip-address::2107.XXXXX. [Server] IWNR1900I [May 31, 2012 9:44:44 AM] Copy Services Manager version 4.2.2.1 has started successfully. [Server] IWNH1038I [May 31, 2012 9:44:46 AM] Device DS8000:BOX:2107.XXXXX managed by hardware connection ESSNI:ip-address::2107.XXXXX is now accessible.
Local fix
There is a known defect which is fixed in 4.2.2.2.
Problem summary
After a suspend command the hardware does not show GC pairs suspended but TPC-R does show that each role pair was suspended.
Problem conclusion
This is caused by a race condition after issuing Pause to the master on the hardware. TPC-R kicks off the pause poller that queries the master for pause completion. The flag to tell the code not to suspend the GC pairs caused the Pause Poller to end up suspending the GC pairs AFTER the session had already been restarted.
Temporary fix
Use dscli to manually suspend the pairs
Comments
APAR Information
APAR number
IC83998
Reported component name
TPC FOR REPL SE
Reported component ID
5608TRMSV
Reported release
340
Status
CLOSED PER
PE
NoPE
HIPER
NoHIPER
Special Attention
NoSpecatt
Submitted date
2012-06-08
Closed date
2012-06-11
Last modified date
2012-06-11
APAR is sysrouted FROM one or more of the following:
APAR is sysrouted TO one or more of the following:
Fix information
Fixed component name
TPC FOR REPL SE
Fixed component ID
5608TRMSV
Applicable component levels
R420 PSY
UP
[{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SSMN28","label":"Tivoli Storage Productivity Center for Replication"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"340","Edition":"","Line of Business":{"code":"LOB26","label":"Storage"}}]
Document Information
Modified date:
11 June 2012