IBM Support

IT00445: NDMP "BACKUP NODE" HANGS IF THE DATAMOVER SOCKET IS WRONG AT OPERATING SYSTEM LAYER.

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • During BACKUP NODE the Tivoli Storage Manager server may loop
    if the NDMP socket is broken at the operating system layer.
    No errors are logged in the Tivoli Storage Manager activity
    log. The server should abort the process with an error message,
    instead of running into a loop to do a retry.
    Customer/L2 Diagnostics:
    A server trace collected with the trace classes BRNODE BFREMOTE
    NA SPI SPID SESSREMOTE shows the the agent thread hung at
    "waiting for next request":
    *****************
    [pvr.c][14205][AgentThread]:PVR I/O agent (49) finished WRITENC
    request; rc=0.
    [pvr.c][13718][AgentThread]:PVR I/O agent (49) waiting for next
    request.
    [pvr.c][13765][AgentThread]:PVR I/O agent (49) processing
    WRITENC request.
    [pvrntp.c][3284][NtpWriteNC]:Writing 262096 bytes to volume
    XXXXXXXX.
    [pvrntp.c][6331][DumpBlock]:Dumping block 2740082 to NTP drive
    DRIVE (mtx.x.x.x); 262144 bytes, lbp = 0.
    [pvrntp.c][5800][NtpBuildComBlockHdr]:Building NTP Block Header
    - block ID = 2740082 and data bytes = 262096.
    [pspvr.c][3878][PvrPsDevWrite]:handle = 103936, wrote amt =
    262144, numBytes requested = 262144
    [pvrntp.c][3394][NtpWriteNC]:262096 bytes written to volume
    XXXXXXXX.
    [pvr.c][14205][AgentThread]:PVR I/O agent (49) finished WRITENC
    request; rc=0.
    [pvr.c][13718][AgentThread]:PVR I/O agent (49) waiting for next
    request.
    [pvr.c][13765][AgentThread]:PVR I/O agent (49) processing
    PREPARECLOSE request. <-- cancel by user
    [pvrntp.c][3730][NtpPrepareClose]:Preparing to close volume
    XXXXXXXX, logging 100058368 kbytes.
    [pvr.c][14205][AgentThread]:PVR I/O agent (49) finished
    PREPARECLOSE request; rc=0.
    [pvr.c][13718][AgentThread]:PVR I/O agent (49) waiting for next
    request.
    [pvr.c][13765][AgentThread]:PVR I/O agent (49) processing
    FORCEEOD request.
    [pvr.c][14205][AgentThread]:PVR I/O agent (49) finished
    FORCEEOD request; rc=0.
    [pvr.c][13718][AgentThread]:PVR I/O agent (49) waiting for next
    request.
    *****************
    In the meanwhile, the filer ndmp.log shows bytes processed not
    changed:
    *****************
    [kern_ndmpd:info:1870] [58032]  DEBUG:
    bytes_processed=102428794880 (0x17d93b6000)
    [kern_ndmpd:info:1870] [58032]  DEBUG:
    bytes_processed=102428794880 (0x17d93b6000)
    [kern_ndmpd:info:1870] [58032]  DEBUG:
    bytes_processed=102428794880 (0x17d93b6000)
    *****************
    Tivoli Storage Manager Versions Affected:
    All supported Tivoli Storage Manager Server on all supported
    platforms.
    Initial Impact:
    Medium
    Additional Keywords:
    tsm ndmp nas hang abort loop zz61 zz62 zz63 zz64 zz71
    

Local fix

  • Cancel BACKUP NODE process to stop the hanging backup.
    

Problem summary

  • ****************************************************************
    * USERS AFFECTED:                                              *
    * All Tivoli Storage Manager server users of network-attached  *
    * storage.                                                     *
    ****************************************************************
    * PROBLEM DESCRIPTION:                                         *
    * See ERROR DESCRIPTION.                                       *
    ****************************************************************
    * RECOMMENDATION:                                              *
    * Apply fixing levels when available.                          *
    * This problem is currently projected to be fixed in levels    *
    * 6.3.5 and 7.1.1.                                             *
    * Note that this is subject to change at the discretion of     *
    * IBM.                                                         *
    ****************************************************************
    

Problem conclusion

  • This problem was fixed.
    Affected platforms:  AIX, HP-UX, Solaris, Linux, and Windows.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IT00445

  • Reported component name

    TSM SERVER

  • Reported component ID

    5698ISMSV

  • Reported release

    63W

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt

  • Submitted date

    2014-03-20

  • Closed date

    2014-04-17

  • Last modified date

    2015-07-09

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    TSM SERVER

  • Fixed component ID

    5698ISMSV

Applicable component levels

  • R63A PSY

       UP

  • R63H PSY

       UP

  • R63L PSY

       UP

  • R63S PSY

       UP

  • R63W PSY

       UP

  • R71A PSY

       UP

  • R71H PSY

       UP

  • R71L PSY

       UP

  • R71S PSY

       UP

  • R71W PSY

       UP

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSGSG7","label":"Tivoli Storage Manager"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"63W","Edition":"","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
09 July 2015