Viewing NDMP log information
Use the Storwize V7000 Unified CLI lsndmplog command to view information in the NDMP log. You can view the most recent 10 lines of the log, specify a number of lines to limit the display to the most recent entries, or save the entire log file as a local file on the active management node. You can also use the graphical user interface (GUI) to work with this function.
Use the Storwize V7000 Unified CLI lsndmplog command
with the -g or --nodeGroup option
to view all of the NDMP log information related to the specified file module group.
In the following example, the file module group ndmpg1 is
specified:
# lsndmplog -g NDMPG1
The
most recent 10 lines of the NDMP log files for the file modules in
the specified file module group
are displayed by the CLI interface.Note: You must specify either a file module group
as in the above example, or a list of individual file modules using
the -n option as in the example below, but not both
in the same iteration of the lsndmplog command.
Use the Storwize V7000 Unified CLI lsndmplog command
with the -n or --nodes option to
view all of the NDMP log information related to the specified file module.
In the following example, the file module mgmt001st001 is
specified:The
most recent 10 lines
# lsndmplog -n mgmt001st001
of
the NDMP log file for the specified file module are
displayed by the CLI interface.Use the Storwize V7000 Unified CLI lsndmplog command
with the -l or --lines option to
view the most recent l lines of the NDMP log
information related to the specified file module.
In the following example, the file module mgmt001st001 and 25 lines
are specified:
# lsndmplog -n mgmt001st001 -l 25
The
most recent l lines of the NDMP log file for
the specified file module is
displayed by the CLI interface.Use the Storwize V7000 Unified CLI lsndmplog command
with the -o or --outputLogFilePath option
to save the log file as a temporary file, as in the following example:
# lsndmplog -n mgmt001st001 -o /ibm/gpfs0/cifs_export/ndmplog.out
Note: The designated directory must be a current share or
export on the file system. Attempting to save the log file to /root or
any other non-exported directory will result in a message similar
to the following one:
Error: the location chosen to save the NDMP log file must be an existing export.
While the Storwize V7000 Unified alert
log viewed from the GUI is more helpful to most users, NDMP information
is displayed in the NDMP log file in the format of the following sample.
- The first column, XX:YY is for internal use only.
- The second column is month and day in MM/DD format.
- The third column is hour, minute, second, fraction of a second in hh:mm:ss:ffffff format.
- The fourth column in decimal:hexadecimal format is the process
ID of the process that generated the message.
- For many entries, this is the process ID of the NDMP backup or restore operation, and can be viewed with the -i or --session ID option of the lsndmpsession command.
- The fifth column is program:subroutine:line, for internal use only.
- The sixth and final column has multiple formats, depending on
the type of entry:
- Some are text messages, some start with an event ID, and some display internal counters and their values.
IO:CN 10/06 19:34:47.546755 997774:258430f0 comm.c:ndmpRun:904 0001: Starting ndmpd listener on port 10000 of all IP address
The
above message indicates that the NDMP data server is starting, and
will be listening for control connections from a DMA of any IP address
on port 10000IO:CN 10/06 19:34:47.553348 997969:258430f0 comm.c:ndmpRun:980 0005: ndmpd started
The
above message indicates that the NDMP data server has completed startup
and is running.DG:AS 10/06 19:34:55.563824 997979:8ffb4090 session_monitor.c:main:279 Cleaning up session 726147.da ...
The
above message indicates that the NDMP data server is performing background
maintenance to clean up after a previous NDMP session that did not
complete normally. This would occur when the NDMP data server is restarted
after an NDMP session failure.IO:CN 10/06 19:43:55.816298 1035333:258430f0 data.c:ndmpdDataConnect_v4:902 1002: Data Server in CONNECTED state
The
above message indicates that a control request has been received from
a DMA, that the handshake process has completed, and that the NDMP
data server is ready to communicate with the DMA.IO:CN 10/06 19:43:55.890994 1035333:42b4c940 ndmp_backup.c:startBackup:443 7006: Level 0 backup of
/ibm/gpfs0/shared/home/datafiles/test_dir3 started at Wed Oct 6 19:43:55 2010
The
above message indicates that a full backup of the listed directory
has started. Level 0 indicates a full backup.DG:AS 10/06 19:43:55.909662 1035391:258430f0 log.c:logErrorInternal:642 CIL_event failed.
The
above message indicates that an attempt by the NDMP data server to
send an alert to the alert log has failed. CIL is an acronym for 'custom
interface layer', which is an internal mechanism. One of several
reasons that this error might display in the NDMP log might be that
multiple identical Information messages have been sent. It is not
necessary to call support based solely on the appearance of this message.
Any actionable information is still displayed in the alert log.IO:CN 10/06 19:44:42.216909 1035605:258430f0 data.c:ndmpdDataAbort:326 1010: Data Server received abort request
DG:AS 10/06 19:44:42.227389 1035605:420ac940 util.c:waitForData:1189 session aborted
IO:CN 10/06 19:44:42.227444 1035605:420ac940 util.c:waitForData:1190 1011: Data Server aborted
The
above three messages were created by an NDMP stop session request
from a DMA.DG:AS 10/06 19:44:42.367566 1035605:416ab940 buffer.c:freeWriteQueueIndex:1472 numProcesses[4], perProcessBufferCount[41]
The
above message is informational, showing the number of NDMP processes
that are running (numProcesses[4]) and how many shared memory buffers
are allocated (perProcessBufferCount[41]). This message appears when
the number of sessions is reduced, because a backup or restore operation
has either completed or been stopped manually. IO:CN 10/06 19:46:19.538617 1035333:42b4c940 ndmp_backup.c:startBackup:652 7012: Level 0 backup of
/ibm/gpfs0/shared/home/datafiles/test_dir3 finished at Wed Oct 6 19:46:19 2010.
The
above message indicates that a full backup of the listed directory
has completed.IO:CN 10/06 19:46:19.545279 1035333:42b4c940 ndmp_backup.c:startBackup:662 7013: Total Bytes = 2048011272,
Files:20 Directories:1 Throughput = 14250464.464822 bytes/sec.
The
above message lists data transfer information for a completed backup:
the number of bytes, files and directories that were restored, and
throughput.IO:CN 10/06 20:36:56.555597 1249104:43111940 ndmp_backup.c:startBackup:443 7006: Level 1 backup of
/ibm/gpfs0/shared/home/datafiles/test_dir1 started at Wed Oct 6 20:36:56 2010.
The
above message indicates that an incremental backup of the listed directory
has started. Level 1 indicates an incremental backup.DG:AS 10/06 20:35:23.057341 1240190:41f52940 util.c:waitForData:1211 select error: Success, abortFlag = 1
DG:AS 10/06 20:35:23.057417 1240190:41f52940 ndmp_brm_util.c:dataBufferWriteThread:633 Error sendfile
DG:AS 10/06 20:35:23.057440 1240190:41f52940 ndmp_brm_util.c:dataBufferWriteThread:684 rdErr:0 wrErr:1 skipping.
DG:AS 10/06 20:35:23.057780 1240190:41551940 ndmp_brm_util.c:flushDataCache:1080 rdErr:0 wrErr:1 skipping.
DG:AS 10/06 20:35:23.057833 1240190:41551940 ndmp_backup.c:startBackup:581 Error flushing data cache.
DG:AS 10/06 20:35:23.057857 1240190:41551940 buffer.c:freeWriteQueueIndex:1472 numProcesses[0], perProcessBufferCount[125]
DG:AS 10/06 20:35:23.064173 1242468:85f1b010 session_mgm_file.c:adminKillServer:472 Server 1240190 not terminated 6 seconds after SIGUSR1
DG:AS 10/06 20:35:29.065327 1242468:85f1b010 session_mgm_file.c:adminKillServer:472 Server 1240190 not terminated 7 seconds after SIGUSR1
The
group of eight messages above appeared after an stopndmpsession command
had been submitted. The exact number of each type of message varies.
The first six messages in the group were generated by an NDMP backup
session with a session ID of 1240190. The last two entries indicate
that clean up operations for the NDMP session are still running. The
NDMP data server is reporting the duration in seconds of the listed
NDMP clean up session (Session ID 1240190). The following five messages
were generated from a restore operation: IO:CN 10/07 16:41:36.506940 1804292:5a8140f0 data.c:ndmpdDataConnect_v4:902 1002: Data Server in CONNECTED state
The
above message indicates that a control request has been received from
a DMA, that the handshake process has completed, and that the NDMP
data server is ready to communicate with the DMA.IO:CN 10/07 16:41:37.238097 1804292:42618940 ndmp_restore.c:ndmpRestore:327 8000: Restore
[/ibm/gpfs1/ndmp] started at Thu Oct 7 16:41:37 2010.
The
above message indicates that a restore operation has started and lists
the directory that is being restored.IO:CN 10/07 16:41:41.684893 1804292:42618940 ndmp_restore.c:ndmpRestore:413 8003: Restore
processed bytes = 1536010248
The above message lists
the total number of bytes that were processed during the restore operation.IO:CN 10/07 16:41:41.691832 1804292:42618940 ndmp_restore.c:ndmpRestore:426 8005: Restore
[/ibm/gpfs1/ndmp] completed at Thu Oct 7 16:41:41 2010.
The
above message indicates when the restore of the listed directory completed.IO:CN 10/07 16:41:41.697577 1804292:42618940 ndmp_restore.c:ndmpRestore:436 Total Bytes = 1536010248,
Files:15 Directories:4 Throughput = 344438002.021766 bytes/sec.
The
above message lists data transfer information for a completed restore:
the number of bytes, files and directories that were restored, and
throughput.DG:AS 12/14 14:27:20.896993 1223253:e8c0e010 session_mgm_file.c:adminReleaseSlot:568 Invalid file handle:
/var/ndmp/session//1216740.dat:No such file or directory
This is a harmless condition. The session_monitor runs in a loop checking the session info files. In this case,
session_monitor noticed that there is a session_info file and checks to see if the session is still there. But between
this, the session is exiting and has already cleaned up the session info file. So by the time the session_monitor tries
to read the session info file to perform clean up, the session info file is already gone.
The
above message an informational message stating that an NDMP session
has cleaned up its own temporary session file before the session_monitor
daemon was able to clean up the temporary session file for that session.
This message does not indicate an error; no action is required.Note: Display
output wraps differently, depending on the user interface window.
Note: If
NDMP has recently been activated, it is normal to see several warning
messages of type AK016CW with a statement that the NDMP daemon and/or
session_monitor daemon is not running.