IBM Support

IC71672: FILER-TO-SERVER NDMP RESTORE FAILS DUE TO TIMEOUT

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • An NDMP Restore operation of an image in a native storage pool
    (created by an NDMP Filer to server backup operation) that
    contains a large number of files may fail as a result of a
    time-out condition in the Tivoli Storage Manager server.  When
    this occurs the restore operation will fail with the following
    error:
    
       ANR1104E NAS Restore process 1249 terminated - NDMP session
       errors encountered.
    
    During an NDMP restore operation the NAS device typically
    performs the restore in a two step process. The first step is to
    recreate the destination directory or file system's tree
    structure.  Very little data is transmitted between the Tivoli
    Storage Manager server and NAS device in this step.  If the
    image being restored contains a large number of files this step
    might take several hours.  Once the tree structure is recreated,
    the second step is to copy actual data on to the destination
    directory or file system.
    
    The Tivoli Storage Storage Manager server's RESTORE NODE process
    monitors status from both the NAS device and the Tivoli Storage
    Manager server's NDMP tape server.  It is possible that no
    status will be returned from the NDMP tape server during a long
    running "step 1" described above.  Should this time exceed six
    hours the server will abort the restore process.  A TSM Server
    trace (SPI SPID) will show the following errors indicating that
    the restore operation is being aborted:
    
       14:03:17.560 [166219][ndmpsdk.c][441][spiTrace]:
       ndmp_recv_msg: reply_error=1..
       14:03:17.564 [166219][ndmpsdk.c][575][spiTrace]:
       error handling reply..
       14:03:17.564 [166219][ndmover.c][154][spiTrace]:
       Mover Get State rc= -1 .
       14:03:17.565 [166219][ndmpspi.c][2851][spiTrace]:
       Error obtaining the state of NDMP data or mover interface.
       Total bytes restored 4765044736.
       14:03:17.565 [166219][ndmpspi.c][4977][spiTrace]:
       Aborting NDMP data interface.
    
    Platforms affected:
    All versions and platforms
    
    Additional Keywords:
    IC54481 nas ndmp netapp
    

Local fix

Problem summary

  • ****************************************************************
    * USERS AFFECTED: All Tivoli Storage Manager server users.     *
    ****************************************************************
    * PROBLEM DESCRIPTION: See ERROR DESCRIPTION.                  *
    ****************************************************************
    * RECOMMENDATION: Apply fixing level when available. This      *
    *                 problem is currently projected to be fixed   *
    *                 in levels 5.5.6, 6.1.5 and 6.2.3.  Note that *
    *                 this is subject to change at the discretion  *
    *                 of IBM                                       *
    ****************************************************************
    *
    This fix introduces a new server option NDMPCONNECTIONTIMEOUT
    that allows users to increase, if necessary, the amount of
    time that the Tivoli Storage Manager server will wait to
    receive status updates from the NDMP tape server during a
    restore operation. Increasing the timeout value may be
    necessary to avoid failures during large NDMP restore
    operations.
    The NDMPCONNECTIONTIMEOUT parameter accepts values ranging
    from 1 to 360 hours, with a default value of 6 hours. To
    increase the timeout period to 24 hours, for example, the
    following entry would need to be added to the dsmserv.opt
    file:
        NDMPCONNECTIONTIMEOUT 24
    The Tivoli Storage Manager server will then need to be
    restarted to recognize the new timeout value.
    

Problem conclusion

  • This problem was fixed.
    Affected platforms:  AIX, HP-UX, Sun Solaris, Linux and zOS.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IC71672

  • Reported component name

    TSM SERVER

  • Reported component ID

    5698ISMSV

  • Reported release

    55A

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt

  • Submitted date

    2010-10-04

  • Closed date

    2010-11-16

  • Last modified date

    2013-12-03

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    TSM SERVER

  • Fixed component ID

    5698ISMSV

Applicable component levels

  • R55A PSY

       UP

  • R55H PSY

       UP

  • R55L PSY

       UP

  • R55S PSY

       UP

  • R55W PSN

       UP

  • R55Z PSY

       UP

  • R61A PSY

       UP

  • R61H PSY

       UP

  • R61L PSY

       UP

  • R61S PSY

       UP

  • R61W PSN

       UP

  • R61Z PSY

       UP

  • R62A PSY

       UP

  • R62H PSY

       UP

  • R62L PSY

       UP

  • R62S PSY

       UP

  • R62W PSN

       UP

  • R62Z PSY

       UP

[{"Line of Business":{"code":"LOB26","label":"Storage"},"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSGSG7","label":"Tivoli Storage Manager"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"55A"}]

Document Information

Modified date:
16 September 2021