AIX operating systemsLinux operating systemsWindows operating systems

QUERY DEDUPSTATS (Query data deduplication statistics)

Use this command to display information about data deduplication statistics for a directory-container storage pool or a cloud storage pool.

You must issue the GENERATE DEDUPSTATS command before you can issue the QUERY DEDUPSTATS command.

Privilege class

Any administrator can issue this command.

Syntax

Read syntax diagramSkip visual syntax diagram
>>-Query DEDUPSTats--+-----------+--+-----------+--------------->
                     '-pool_name-'  '-node_name-'   

   .-*------------------.  .-Format--=--Standard-----.   
>--+--------------------+--+-------------------------+---------->
   | .-,--------------. |  '-Format--=--+-Standard-+-'   
   | V                | |               '-Detailed-'     
   +---filespace_name-+-+                                
   | .-,--------.       |                                
   | V          |       |                                
   '-----FSID---+-------'                                

   .-CODEType--=--BOTH-----------.   
>--+-----------------------------+------------------------------>
   '-CODEType--=--+-UNIcode----+-'   
                  +-NONUNIcode-+     
                  '-BOTH-------'     

   .-NAMEType--=--SERVER------.                           
>--+--------------------------+--+--------------------+--------->
   '-NAMEType--=--+-SERVER--+-'  '-BEGINDate--=--date-'   
                  +-UNIcode-+                             
                  '-FSID----'                             

>--+--------------------+--+------------------+----------------->
   '-BEGINTime--=--time-'  '-ENDDate--=--date-'   

                         .-ALLSTats--=--No------.   
>--+------------------+--+----------------------+--------------><
   '-ENDTime--=--time-'  '-ALLSTats--=--+-Yes-+-'   
                                        '-No--'     

Parameters

pool_name
Specifies the name of the directory-container storage pool whose data is contained in the data deduplication statistics. This parameter is optional. If you do not specify a value for this parameter, all storage pools are displayed. You can specify up to 30 characters for the storage pool name. If you specify more than 30 characters, the command fails.
Restriction: You can specify directory-container storage pools or cloud storage pools only.
node_name
Specifies the name of the client node whose data is contained in the data deduplication statistics. This parameter is optional. If you do not specify a value for this parameter, all nodes are displayed. You can specify up to 64 characters for the node name. If you specify more than 64 characters, the command fails.
filespace_name or FSID
Specifies the names of one or more file spaces that contain the data to be included in the data deduplication statistics. This parameter is optional. You can use wildcard characters to specify this name. If you do not specify a value for this parameter, all file spaces are displayed. You can specify more than one file space by separating the names with commas and no intervening spaces.
For a server that has clients with support for file spaces that are in Unicode format, you can enter either a file space name or a file space identifier (FSID). If you enter a file space name, you might need to have the server convert the file space name that you enter. For example, you might need to have the server convert the name that you enter from the server's code page to Unicode.
Restrictions: The following restrictions apply to file space names and FSIDs:
  • You must specify a node name if you specify a file space name.
  • Do not mix file space names and FSIDs in the same command.
Format
Specifies how the information is displayed. This parameter is optional. Specify one of the following values:
Standard
Specifies that partial information is displayed for the specified data deduplication sets. This is the default.
Detailed
Specifies that complete information is displayed for the specified data deduplication sets.
CODEType
Specify what type of file spaces to include in the operation. The default value is BOTH, which specifies that file spaces are included regardless of code page type. Use this parameter only when you enter a single wildcard character for the file space name. Specify one of the following values:
UNIcode
Include file spaces that are in Unicode format.
NONUNIcode
Include file spaces that are not in Unicode format.
BOTH
Include file spaces regardless of code page type. This is the default.
NAMEType
Specify how you want the server to interpret the file space names that you enter. This parameter is useful when the server has clients with support for file spaces that are in Unicode format. You can use this parameter for IBM Spectrum Protect™ clients that use Windows, NetWare, or Macintosh OS X operating systems.
Use this parameter only when you enter a node name and a file space name or FSID.
Restriction: When you specify this parameter, the file space name cannot contain a wildcard.

Specify one of the following values:

SERVER
The server uses the server's code page to interpret the file space names. This is the default.
UNIcode
The server converts the file space name that is entered from the server code page to the UTF-8 code page. The success of the conversion depends on the actual characters in the name and the server's code page. Conversion can fail if the string includes characters that are not available in the server code page, or if the server cannot access system conversion routines.
FSID
The server interprets the file space names as their FSIDs.
BEGINDate
Specifies the start date to query data deduplication statistics. This parameter is optional. You can use this parameter with the BEGINTIME parameter to specify a range for the date and time. If you specify a begin date without a begin time, the time is at 12:00 a.m. (midnight) on the date you specify.
Restriction: You can specify this parameter only when you specify the ALLSTATS=YES parameter.
Specify one of the following values:
Value Description Example
MM/DD/YYYY A specific date. 09/15/2015
TODAY The current date. TODAY
TODAY-days or-days The current date minus days specified. TODAY -3 or -3.
EOLM (End Of Last Month) The last day of the previous month. EOLM
EOLM-days The last day of the previous month minus days specified. EOLM-1

To include records that were active a day before the last day of the previous month.

BOTM (Beginning Of This Month) The first day of the current month. BOTM
BOTM+days The first day of the current month, plus days specified. BOTM+9

To include records that were active on the 10th day of the current month.

BEGINTime
Specifies the start time to query the data deduplication statistics. This parameter is optional. You can use this parameter with the BEGINDATE parameter to specify a range for the date and time. If you specify a begin time without a begin date, the date is the current date at the time you specify.
Restriction: You can specify this parameter only when you specify the ALLSTATS=YES parameter.
Specify one of the following values:
Value Description Example
HH:MM:SS A specific time. 10:30:08
NOW The current time. NOW
NOW+HH:MM or +HH:MM The current time plus hours and minutes specified. NOW+02:00 or +02:00.
NOW-HH:MM or-HH:MM The current time minus hours and minutes specified. NOW-02:00 or –02:00.
ENDDate
Specifies the end date to query data deduplication statistics. This parameter is optional. You can use this parameter with the ENDTIME parameter to specify a range for the date and time. If you specify an end date without an end time, the time is at 11:59:59 p.m. on the specified end date.
Restriction: You can specify this parameter only when you specify the ALLSTATS=YES parameter.
Specify one of the following values:
Value Description Example
MM/DD/YYYY A specific date 09/15/1999
TODAY The current date TODAY
TODAY-days or -days The current date minus days specified. TODAY -3 or -3.
EOLM (End Of Last Month) The last day of the previous month. EOLM
EOLM-days The last day of the previous month minus days specified. EOLM-1

To include records that were active a day before the last day of the previous month.

BOTM (Beginning Of This Month) The first day of the current month. BOTM
BOTM+days The first day of the current month, plus days specified. BOTM+9

To include records that were active on the 10th day of the current month.

ENDTime
Specifies the end time of the range to query the data deduplication statistics. This parameter is optional. You can use this parameter with the ENDDATE parameter to specify a range for the date and time. If you specify an end time without an end date, the date is the current date at the time you specify.
Restriction: You can specify this parameter only when you specify the ALLSTATS=YES parameter.
Specify one of the following values:
Value Description Example
HH:MM:SS A specific time. 10:30:08
NOW The current time. NOW
NOW+HH:MM or +HH:MM The current time plus hours and minutes on the specified end date NOW+02:00 or +02:00.
NOW-HH:MM or -HH:MM The current time minus hours and minutes on the specified end date NOW-02:00 or -02:00.
ALLSTats
Specifies whether to display all data deduplication statistics or only the most recently generated data deduplication statistics. This parameter is optional. Specify one of the following values:
No
Displays only data deduplication statistics that were most recently generated for each node and file space.
Yes
Displays all data deduplication statistics.

Example: View data deduplication statistics in standard format

Display data deduplication statistics for a storage pool that is named POOL1. The data deduplication statistics are for node NODE1 and the statistics from 8 May 2015 are displayed. See Field descriptions for field descriptions.
query dedupstats pool1 node1 begindate=05/08/2015
                         Date/Time: 05/05/2015 15:15:23
                 Storage Pool Name: POOL1
                         Node Name: NODE1
                    Filespace Name: \\fs1\a1
                              FSID: 41
                              Type: Bkup
           Total Saving Percentage: 86.62   
         Total Data Protected (MB): 311                

Example: View detailed data deduplication statistics

Display detailed information for data deduplication for a storage pool that is named POOL1.
query dedupstats pool1 format=detailed
                         Date/Time: 05/05/2015 15:15:23
                 Storage Pool Name: POOL1
                         Node Name: NODE1
                    Filespace Name: \\fs1\a1
                              FSID: 41
                              Type: Bkup
         Total Data Protected (MB): 47,646
             Total Space Used (MB): 10,139
            Total Space Saved (MB): 37,507
           Total Saving Percentage: 78.72
             Deduplication Savings: 21,278,892,501
          Deduplication Percentage: 42.59   
     Non-Deduplicated Extent Count: 1,658
Non-Deduplicated Extent Space Used: 732,626
               Unique Extent Count: 189,791
          Unique Extent Space Used: 23,385,014,635
               Shared Extent Count: 178,712
      Shared Extent Data Protected: 26,575,010,669
          Shared Extent Space Used: 5,267,815,421
               Compression Savings: 5,267,815,421
            Compression Percentage: 62.93
           Compressed Extent Count: 352,498
         Uncompressed Extent Count: 17,663
      

Field descriptions

Date/Time
Displays the time and date that the data deduplication statistics are generated.
Storage Pool Name
The name of the storage pool.
Node Name
The name of the client node whose data is contained in the data deduplication statistics.
Filespace Name
The name of the file space.
FSID
The name of the file space identifier.
Type
The type of data. The following values are possible:
Arch
Data that has been archived.
Bkup
Data that has been backed up.
SpMg
Data that has been migrated from an IBM Spectrum Protect for Space Management client.
Total Data Protected (MB)
The logical amount of data, in megabytes, that is protected in the storage pool before data deduplication and compression. This value represents the sum of the Total Space Used (MB) and Total Space Saved (MB) values.
Total Space Used (MB)
The total amount of used space in the storage pool, in megabytes. This value is the physical amount of data that is backed up after data deduplication and compression.
Total Space Saved (MB)
The total amount of space, in megabytes, of data that is removed from the storage pool because of data deduplication and compression. This value represents the sum of the Deduplication Savings and Compression Savings values.
Total Saving Percentage
The percentage of data that is removed from the storage pool because of compression and data deduplication.
Deduplication Savings
The amount of used space that is saved in the storage pool because of data deduplication.
Deduplication Percentage
The percentage of data that is removed from the storage pool because of data deduplication.
Non-Deduplicated Extent Count
The number of data extents that are not deduplicated in the storage pool.
Non-Deduplicated Extent Space Used
The amount of space that is used by data extents that are not deduplicated in the storage pool. This value applies to containers that have a .ncf file type and that do not have deduplicated data.
Tip: Data extents that are not deduplicated consist of the following data or file types:
  • File metadata.
  • Files that are less than 2 KB.
  • Files that use client encryption.
Unique Extent Count
The number of data extents that are not shared by a node.
Unique Extent Space Used
The amount of space in the storage pool that is not shared by a node. This value applies to containers that have a .dcf file type and that do not have deduplicated data.
Shared Extent Count
The number of data extents that are used multiple times by the same node or by different nodes because of data deduplication.
Shared Extent Data Protected
The amount of space in the storage pool that is protected by shared data extents before data deduplication.
Shared Extent Space Used
The amount of space in the storage pool that is used by shared data extents after data deduplication.
Compression Savings
The amount of used space that is saved in the storage pool because of compression after data deduplication.
Compression Percentage
The percentage of data that is removed from the storage pool because of compression.
Compressed Extent Count
The number of data extents that are compressed.
Uncompressed Extent Count
The number of data extents that are uncompressed.

Related commands

Table 1. Commands related to QUERY DEDUPSTATS
Command Description
DELETE DEDUPSTATS Deletes data deduplication statistics.
GENERATE DEDUPSTATS Generates data deduplication statistics.