APAR status
Closed as program error.
Error description
After a network issue a very long waiter is seen reason is 'RPC wait' for tmMsgBRTellAcquire1. Waiter such as: waiting 351.787361072 seconds, LockByteRangeHandlerThread: on ThCond 0x113963FD8 (0x113963FD8) (MsgRecordCondvar), reason 'RPC wait' for tmMsgBRTellAcquire1 on node <IP>
Local fix
Problem summary
Token manager should never return E_RESTART for ESET_STATE request as for reset requests, there is no retry loop (see ctResetServer/crtResetServer). In this case token domain is in recovering, stAwaitStable return E_RESTART for a RESET_STATE request, token was left in COPSET state that causes subsequent token requsts to hang.
Problem conclusion
since a RESET_STATE request never grants a new token, there is no need to block or fail the request because of pending recovery (just like CTM_A_RELEASE).
Temporary fix
Comments
APAR Information
APAR number
IV93596
Reported component name
SPECTRUM SCALE
Reported component ID
5725Q01AP
Reported release
422
Status
CLOSED PER
PE
NoPE
HIPER
NoHIPER
Special Attention
NoSpecatt / Xsystem
Submitted date
2017-02-21
Closed date
2017-02-21
Last modified date
2017-03-21
APAR is sysrouted FROM one or more of the following:
APAR is sysrouted TO one or more of the following:
Fix information
Fixed component name
SPECTRUM SCALE
Fixed component ID
5725Q01AP
Applicable component levels
R422 PSY U876415
17/03/21 I 1000
[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"STXKQY","label":"IBM Spectrum Scale"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"422","Edition":"","Line of Business":{"code":"LOB26","label":"Storage"}},{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSFKCN","label":"General Parallel File System"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"422","Edition":"","Line of Business":{"code":"","label":""}}]
Document Information
Modified date:
21 March 2017