A fix is available
APAR status
Closed as program error.
Error description
System hang with the following stack. . --- CPU #0 --- pvthread+000200 STACK: [004B16C8]krlock+000248 (F1000A0480110E00, F1000A00232EB288 [??]) [00534138]slock_krlock_acquire+000258 (??, ??, ??) [00534778]slock+000318 (??, ??) [000090C4].disable_lock+0000C0 () [00469548]soputonfreelist+0009A8 (??) [043BB204]in_pcbdetach+0002E4 (??) [044455C4]tcp_close+000164 (??) [04445CFC]tcp_drop+0000FC (??, ??) .. --- CPU #12 --- pvthread+0BAB00 STACK: [005346C4]slock+000264 (F1000E00010B4300, F000000030853110 [??]) [000090C4].disable_lock+0000C0 () [004E6D34]sodequeue+000374 (??, ??, ??, ??) [004E6388]k_soaccept+000548 (??, ??, ??, ??, ??) [004E56A0]accept1+000200 (??, ??, ??, ??, ??) [00003850]ovlya_addr_sc_flih_main+000130 () [kdb_get_virtual_memory] no real storage @ 368386B8 [D0230AA8]D0230AA8 () [kdb_read_mem] no real storage @ FFFFFFFFFFF8EF0
Local fix
N/A
Problem summary
System may hang because of a deadlock between sodequeue() and soputonfreelist(). This may happen when there is one thread trying to accept a connection and another thread is trying to drop that same connection because of an error. The stack traces of two threads would look like this: --- CPU #0 --- pvthread+000200 STACK: 004B16C8 krlock+000248 (F1000A0480110E00, F1000A00232EB288 ?? ) 00534138 slock_krlock_acquire+000258 (??, ??, ??) 00534778 slock+000318 (??, ??) 000090C4 .disable_lock+0000C0 () 00469548 soputonfreelist+0009A8 (??) 043BB204 in_pcbdetach+0002E4 (??) 044455C4 tcp_close+000164 (??) 04445CFC tcp_drop+0000FC (??, ??) .. --- CPU #12 --- pvthread+0BAB00 STACK: 005346C4 slock+000264 (F1000E00010B4300, F000000030853110 ?? ) 000090C4 .disable_lock+0000C0 () 004E6D34 sodequeue+000374 (??, ??, ??, ??) 004E6388 k_soaccept+000548 (??, ??, ??, ??, ??) 004E56A0 accept1+000200 (??, ??, ??, ??, ??) 00003850 ovlya_addr_sc_flih_main+000130 () ..
Problem conclusion
Fixed a lock order issue in socket code to avoid the deadlock.
Temporary fix
Comments
6100-06 - use AIX APAR IV21636 6100-07 - use AIX APAR IV34802 6100-08 - use AIX APAR IV42094 7100-00 - use AIX APAR IV42247 7100-01 - use AIX APAR IV42221 7100-02 - use AIX APAR IV42147
APAR Information
APAR number
IV42221
Reported component name
AIX V7.1
Reported component ID
5765H4000
Reported release
710
Status
CLOSED PER
PE
NoPE
HIPER
NoHIPER
Submitted date
2013-05-09
Closed date
2013-05-09
Last modified date
2013-11-26
APAR is sysrouted FROM one or more of the following:
APAR is sysrouted TO one or more of the following:
Fix information
Fixed component name
AIX V7.1
Fixed component ID
5765H4000
Applicable component levels
R710 PSY U860705
UP13/07/30 I 1000
[{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SSMV87","label":"AIX 6.1 Enterprise Edition"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"710","Edition":"","Line of Business":{"code":"LOB08","label":"Cognitive Systems"}},{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSMVAX","label":"AIX Express Edition"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"710","Edition":"","Line of Business":{"code":"LOB08","label":"Cognitive Systems"}},{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SG11R","label":"AIX 7.1 HIPERS, APARs and Fixes"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"710","Edition":"","Line of Business":{"code":"","label":""}}]
Document Information
Modified date:
26 November 2013