GPFS metrics

The following section lists all the GPFS™ metrics:

GPFSDisk
For each NSD in the system, for example myMachine|GPFSDisk|myCluster|myFilesystem|myNSD|gpfs_ds_bytes_read
  • gpfs_ds_bytes_read: Number of bytes read.
  • gpfs_ds_bytes_written: Number of bytes written.
  • gpfs_ds_max_disk_wait_rd: The longest time spent waiting for a disk read operation.
  • gpfs_ds_max_disk_wait_wr: The longest time spent waiting for a disk write operation.
  • gpfs_ds_max_queue_wait_rd: The longest time between being enqueued for a disk read operation and the completion of that operation.
  • gpfs_ds_max_queue_wait_wr: The longest time between being enqueued for a disk write operation and the completion of that operation.
  • gpfs_ds_min_disk_wait_rd: The shortest time spent waiting for a disk read operation.
  • gpfs_ds_min_disk_wait_wr: The shortest time spent waiting for a disk write operation.
  • gpfs_ds_min_queue_wait_rd: The shortest time between being enqueued for a disk read operation and the completion of that operation.
  • gpfs_ds_min_queue_wait_wr: The shortest time between being enqueued for a disk write operation and the completion of that operation.
  • gpfs_ds_read_ops: Number of read operations.
  • gpfs_ds_tot_disk_wait_rd: The total time in seconds spent waiting for disk read operations.
  • gpfs_ds_tot_disk_wait_wr: The total time in seconds spent waiting for disk write operations.
  • gpfs_ds_tot_queue_wait_rd: The total time spent between being enqueued for a read operation and the completion of that operation.
  • gpfs_ds_tot_queue_wait_wr: The total time spent between being enqueued for a write operation and the completion of that operation.
  • gpfs_ds_write_ops: Number of write operations.
GPFSFileset
For each independent fileset in the file system: Cluster name - GPFSFileset - filesystem name - fileset name.
For example: myCluster|GPFSFileset|myFilesystem|myFileset|gpfs_fset_maxInodes.
  • gpfs_fset_maxInodes: Maximum number of inodes for this independent fileset.
  • gpfs_fset_freeInodes: Number of free inodes available for this independent fileset.
  • gpfs_fset_allocInodes: Number of inodes allocated for this independent fileset.
GPFSFileSystem
For each file system, for example myMachine|GPFSFilesystem|myCluster|myFilesystem|gpfs_fs_bytes_read
  • gpfs_fs_bytes_read: Number of bytes read.
  • gpfs_fs_bytes_written: Number of bytes written.
  • gpfs_fs_disks: Number of disks in the file system.
  • gpfs_fs_max_disk_wait_rd: The longest time spent waiting for a disk read operation.
  • gpfs_fs_max_disk_wait_wr: The longest time spent waiting for a disk write operation.
  • gpfs_fs_max_queue_wait_rd: The longest time between being enqueued for a disk read operation and the completion of that operation.
  • gpfs_fs_max_queue_wait_wr: The longest time between being enqueued for a disk write operation and the completion of that operation.
  • gpfs_fs_min_disk_wait_rd: The shortest time spent waiting for a disk read operation.
  • gpfs_fs_min_disk_wait_wr: The shortest time spent waiting for a disk write operation.
  • gpfs_fs_min_queue_wait_rd: The shortest time between being enqueued for a disk read operation and the completion of that operation.
  • gpfs_fs_min_queue_wait_wr: The shortest time between being enqueued for a disk write operation and the completion of that operation.
  • gpfs_fs_read_ops: Number of read operations
  • gpfs_fs_tot_disk_wait_rd: The total time in seconds spent waiting for disk read operations.
  • gpfs_fs_tot_disk_wait_wr: The total time in seconds spent waiting for disk write operations.
  • gpfs_fs_tot_queue_wait_rd: The total time spent between being enqueued for a read operation and the completion of that operation.
  • gpfs_fs_tot_queue_wait_wr: The total time spent between being enqueued for a write operation and the completion of that operation.
  • gpfs_fs_write_ops: Number of write operations.
GPFSFileSystemAPI
These metrics gives the following information for each file system (application view). For example: myMachine|GPFSFilesystemAPI|myCluster|myFilesystem|gpfs_fis_bytes_read.
  • gpfs_fis_bytes_read: Number of bytes read.
  • gpfs_fis_bytes_written: Number of bytes written.
  • gpfs_fis_close_calls: Number of close calls.
  • gpfs_fis_disks: Number of disks in the file system.
  • gpfs_fis_inodes_written: Number of inode updates to disk.
  • gpfs_fis_open_calls: Number of open calls.
  • gpfs_fis_read_calls: Number of read calls.
  • gpfs_fis_readdir_calls: Number of readdir calls.
  • gpfs_fis_write_calls: Number of write calls.
GPFSNSDDisk
These metrics gives the following information about each NSD disk on the NSD server. For example: myMachine|GPFSNSDDisk|myNSDDisk|gpfs_nsdds_bytes_read.
  • gpfs_nsdds_bytes_read: Number of bytes read.
  • gpfs_nsdds_bytes_written: Number of bytes written.
  • gpfs_nsdds_max_disk_wait_rd: The longest time spent waiting for a disk read operation.
  • gpfs_nsdds_max_disk_wait_wr: The longest time spent waiting for a disk write operation.
  • gpfs_nsdds_max_queue_wait_rd: The longest time between being enqueued for a disk read operation and the completion of that operation.
  • gpfs_nsdds_max_queue_wait_wr: The longest time between being enqueued for a disk write operation and the completion of that operation.
  • gpfs_nsdds_min_disk_wait_rd: The shortest time spent waiting for a disk read operation.
  • gpfs_nsdds_min_disk_wait_wr: The shortest time spent waiting for a disk write operation.
  • gpfs_nsdds_min_queue_wait_rd: The shortest time between being enqueued for a disk read operation and the completion of that operation.
  • gpfs_nsdds_min_queue_wait_wr: The shortest time between being enqueued for a disk write operation and the completion of that operation.
  • gpfs_nsdds_read_ops: Number of read operations.
  • gpfs_nsdds_tot_disk_wait_rd: The total time in seconds spent waiting for disk read operations.
  • gpfs_nsdds_tot_disk_wait_wr: The total time in seconds spent waiting for disk write operations.
  • gpfs_nsdds_tot_queue_wait_rd: The total time spent between being enqueued for a read operation and the completion of that operation.
  • gpfs_nsdds_tot_queue_wait_wr: The total time spent between being enqueued for a write operation and the completion of that operation.
  • gpfs_nsdds_write_ops: Number of write operations.
GPFSNSDFS
These metrics gives the following information for each filesystem served by a specific NSD server. For example: myMachine|GPFSNSDFS|myFilesystem|gpfs_nsdfs_bytes_read.
  • gpfs_nsdfs_bytes_read: Number of NSD bytes read, aggregated to the file system.
  • gpfs_nsdfs_bytes_written: Number of NSD bytes written, aggregated to the file system.
  • gpfs_nsdfs_read_ops: Number of NSD read operations, aggregated to the file system.
  • gpfs_nsdfs_write_ops: Number of NSD write operations, aggregated to the file system.
GPFSNSDPool
These metrics gives the following information for each filesystem and pool served by a specific NSD server. For example: myMachine|GPFSNSDPool|myFilesystem|myPool|gpfs_nsdpool_bytes_read.
  • gpfs_nsdpool_bytes_read: Number of NSD bytes read, aggregated to the file system.
  • gpfs_nsdpool_bytes_written: Number of NSD bytes written, aggregated to the file system.
  • gpfs_nsdpool_read_ops: Number of NSD read operations, aggregated to the file system.
  • gpfs_nsdpool_write_ops: Number of NSD write operations, aggregated to the file system.
GPFSNode
These metrics gives the following information for a particular node. For example: myNode|GPFSNode|gpfs_ns_bytes_read.
  • gpfs_ns_bytes_read: Number of bytes read.
  • gpfs_ns_bytes_written: Number of bytes written.
  • gpfs_ns_clusters: Number of clusters participating
  • gpfs_ns_disks: Number of disks in all mounted file systems
  • gpfs_ns_filesys: Number of mounted file systems
  • gpfs_ns_max_disk_wait_rd: The longest time spent waiting for a disk read operation.
  • gpfs_ns_max_disk_wait_wr: The longest time spent waiting for a disk write operation.
  • gpfs_ns_max_queue_wait_rd: The longest time between being enqueued for a disk read operation and the completion of that operation.
  • gpfs_ns_max_queue_wait_wr: The longest time between being enqueued for a disk write operation and the completion of that operation.
  • gpfs_ns_min_disk_wait_rd: The shortest time spent waiting for a disk read operation.
  • gpfs_ns_min_disk_wait_wr: The shortest time spent waiting for a disk write operation.
  • gpfs_ns_min_queue_wait_rd: The shortest time between being enqueued for a disk read operation and the completion of that operation.
  • gpfs_ns_min_queue_wait_wr: The shortest time between being enqueued for a disk write operation and the completion of that operation.
  • gpfs_ns_read_ops: Number of read operations.
  • gpfs_ns_tot_disk_wait_rd: The total time in seconds spent waiting for disk read operations.
  • gpfs_ns_tot_disk_wait_wr: The total time in seconds spent waiting for disk write operations.
  • gpfs_ns_tot_queue_wait_rd: The total time spent between being enqueued for a read operation and the completion of that operation.
  • gpfs_ns_tot_queue_wait_wr: The total time spent between being enqueued for a write operation and the completion of that operation.
  • gpfs_ns_write_ops: Number of write operations.
GPFSNodeAPI
These metrics gives the following information for a particular node from its application point of view. For example: myMachine|GPFSNodeAPI|gpfs_is_bytes_read.
  • gpfs_is_bytes_read: Number of bytes read.
  • gpfs_is_bytes_written: Number of bytes written.
  • gpfs_is_close_calls: Number of close calls.
  • gpfs_is_inodes_written: Number of inode updates to disk.
  • gpfs_is_open_calls: Number of open calls.
  • gpfs_is_readDir_calls: Number of readdir calls.
  • gpfs_is_read_calls: Number of read calls.
  • gpfs_is_write_calls: Number of write calls.
GPFSPool
For each pool in each file system: Cluster name - GPFSPool - filesystem name -pool name.
For example: myCluster|GPFSPool|myFilesystem|myPool|gpfs_pool_free_dataKBvalid*.
  • gpfs_pool_total_dataKB: Total capacity for data (in KB) in this pool.
  • gpfs_pool_free_dataKB: Free capacity for data (in KB) in this pool.
  • gpfs_pool_total_metaKB: Total capacity for metadata (in KB) in this pool.
  • gpfs_pool_free_metaKB: Free capacity for metadata (in KB) in this pool.
GPFSPoolIO
These metrics give the details about each cluster, filesystem and pool in the system, from the point of view of a specific node. For example: myMachine|GPFSPoolIO|myCluster|myFilesystem|myPool|gpfs_pool_bytes_rd
  • gpfs_pool_bytes_rd: Total size of all disks for this usage type.
  • gpfs_pool_bytes_wr: Total available disk space in full blocks for this usage type.
  • gpfs_pool_free_fragkb: Total available space in fragments for this usage type.
GPFSVFS

Some sensors, such as VFS, are not enabled by default even though they have associated predefined queries with the mmperfmon query command. This is so because the collector might display performance issues of its own if it is required to collect more than 1000000 metrics per second. To enable VFS sensors, use the mmfsadm vfsstats enable command on the node. To enable a sensor, set the period value to an integer greater than 0 and restart the sensors on that node by using the systemctl restart pmsensors command.

These metrics gives the following information about the virtual file operation statistics (count and time) for each node. For example, myMachine|GPFSVFS|gpfs_vfs_clear

  • gpfs_vfs_accesses: Number of accesses operations.
  • gpfs_vfs_accesses_t: Amount of time in seconds spent in accesses operations.
  • gpfs_vfs_aioread: Number of aioread operations.
  • gpfs_vfs_aioread_t: Amount of time in seconds spent in aioread operations.
  • gpfs_vfs_aiowrite: Number of aiowrite operations.
  • gpfs_vfs_aiowrite_t: Amount of time in seconds spent in aiowrite operations.
  • gpfs_vfs_clear: Number of clear operations.
  • gpfs_vfs_clear_t: Amount of time in seconds spent in clear operations.
  • gpfs_vfs_close: Number of close operations.
  • gpfs_vfs_close_t: Amount of time in seconds spent in close operations.
  • gpfs_vfs_create: Number of create operations.
  • gpfs_vfs_create_t: Amount of time in seconds spent in create operations.
  • gpfs_vfs_decodeFh: Number of decodeFh operations.
  • gpfs_vfs_decodeFh_t: Amount of time in seconds spent in decodeFh operations.
  • gpfs_vfs_detDentry: Number of detDentry operations.
  • gpfs_vfs_encodeFh: Number of encodeFh operations.
  • gpfs_vfs_encodeFh_t: Amount of time in seconds spent in encodeFh operations.
  • gpfs_vfs_flock: Number of flock operations.
  • gpfs_vfs_flock_t: Amount of time in seconds spent in flock operations.
  • gpfs_vfs_fsync: Number of fsync operations.
  • gpfs_vfs_fsyncRange: Number of fsyncRange operations.
  • gpfs_vfs_fsyncRange_t: Amount of time in seconds spent in fsyncRange operations.
  • gpfs_vfs_fsync_t: Amount of time in seconds spent in fsync operations.
  • gpfs_vfs_ftrunc: Number of ftrunc operations.
  • gpfs_vfs_ftrunc_t: Amount of time in seconds spent in ftrunc operations.
  • gpfs_vfs_getDentry_t: Amount of time in seconds spent in getDentry operations.
  • gpfs_vfs_getParent: Number of getParent operations.
  • gpfs_vfs_getParent_t: Amount of time in seconds spent in getParent operations.
  • gpfs_vfs_getattr: Number of getattr operations.
  • gpfs_vfs_getattr_t: Amount of time in seconds spent in getattr operations.
  • gpfs_vfs_getxattr: Number of getxattr operations.
  • gpfs_vfs_getxattr_t: Amount of time in seconds spent in getxattr operations.
  • gpfs_vfs_link: Number of link operations.
  • gpfs_vfs_link_t: Amount of time in seconds spent in link operations.
  • gpfs_vfs_listxattr: Number of listxattr operations.
  • gpfs_vfs_listxattr_t: Amount of time in seconds spent in listxattr operations.
  • gpfs_vfs_lockctl: Number of lockctl operations.
  • gpfs_vfs_lockctl_t: Amount of time in seconds spent in lockctl operations.
  • gpfs_vfs_lookup: Number of lookup operations.
  • gpfs_vfs_lookup_t: Amount of time in seconds spent in lookup operations.
  • gpfs_vfs_mapLloff: Number of mapLloff operations.
  • gpfs_vfs_mapLloff_t: Amount of time in seconds spent in mapLloff operations.
  • gpfs_vfs_mkdir: Number of mkdir operations.
  • gpfs_vfs_mkdir_t: Amount of time in seconds spent in mkdir operations.
  • gpfs_vfs_mknod: Number of mknod operations.
  • gpfs_vfs_mknod_t: Amount of time in seconds spent in mknod operations.
  • gpfs_vfs_mmapread: Number of mmapread operations.
  • gpfs_vfs_mmapread_t: Amount of time in seconds spent in mmapread operations.
  • gpfs_vfs_mmapwrite: Number of mmapwrite operations.
  • gpfs_vfs_mmapwrite_t: Amount of time in seconds spent in mmapwrite operation.
  • gpfs_vfs_mount: Number of mount operations.
  • gpfs_vfs_mount_t: Amount of time in seconds spent in mount operations.
  • gpfs_vfs_open: Number of open operations.
  • gpfs_vfs_open_t: Amount of time in seconds spent in open operations.
  • gpfs_vfs_read: Number of read operations.
  • gpfs_vfs_read_t: Amount of time in seconds spent in read operations.
  • gpfs_vfs_readdir: Number of readdir operations.
  • gpfs_vfs_readdir_t: Amount of time in seconds spent in readdir operations.
  • gpfs_vfs_readlink: Number of readlink operations.
  • gpfs_vfs_readlink_t: Amount of time in seconds spent in readlink operations
  • gpfs_vfs_readpage: Number of readpage operations.
  • gpfs_vfs_readpage_t: Amount of time in seconds spent in readpage operations.
  • gpfs_vfs_remove: Number of remove operations.
  • gpfs_vfs_remove_t: Amount of time in seconds spent in remove operations.
  • gpfs_vfs_removexattr: Number of removexattr operations.
  • gpfs_vfs_removexattr_t: Amount of time in seconds spent in removexattr operations.
  • gpfs_vfs_rename: Number of rename operations.
  • gpfs_vfs_rename_t: Amount of time in seconds spent in rename operations.
  • gpfs_vfs_rmdir: Number of rmdir operations.
  • gpfs_vfs_rmdir_t: Amount of time in seconds spent in rmdir operations.
  • gpfs_vfs_setacl: Number of setacl operations.
  • gpfs_vfs_setacl_t: Amount of time in seconds spent in setacl operations.
  • gpfs_vfs_setattr: Number of setattr operations.
  • gpfs_vfs_setattr_t: Amount of time in seconds spent in setattr operations.
  • gpfs_vfs_setxattr: Number of setxattr operations.
  • gpfs_vfs_setxattr_t: Amount of time in seconds spent in setxattr operations.
  • gpfs_vfs_statfs: Number of statfs operations.
  • gpfs_vfs_statfs_t: Amount of time in seconds spent in statfs operations.
  • gpfs_vfs_symlink: Number of symlink operations.
  • gpfs_vfs_symlink_t: Amount of time in seconds spent in symlink operations.
  • gpfs_vfs_sync: Number of sync operations.
  • gpfs_vfs_sync_t: Amount of time in seconds spent in sync operations.
  • gpfs_vfs_tsfattr: Number of tsfsattr operation.
  • gpfs_vfs_tsfattr_t: Amount of time in seconds spent in tsfattr operations.
  • gpfs_vfs_tsfsattr: Number of tsfattr operations.
  • gpfs_vfs_tsfsattr_t: Amount of time in seconds spent in tsfsattr operations.
  • gpfs_vfs_unmap: Number of unmap operations.
  • gpfs_vfs_unmap_t: Amount of time in seconds spent in unmap operations.
  • gpfs_vfs_vget: Number of vget operations.
  • gpfs_vfs_vget_t: Amount of time in seconds spent in vget operations.
  • gpfs_vfs_write: Number of write operations.
  • gpfs_vfs_write_t: Amount of time in seconds spent in write operations.
  • gpfs_vfs_writepage: Number of writepage operations.
  • gpfs_vfs_writepage_t: Amount of time in seconds spent in writepage operations.
GPFSWaiters
For each independent fileset in the file system: Node- GPFSWaiters - waiters_time_threshold (all, 0.1s, 0.2s, 0.5s, 1.0s, 30.0s, 60.0s).
Note: Here 'all' implies a waiting time greater than or equal to 0 seconds.
For example: myNode|GPFSWaiters|all|gpfs_wt_count_all.
  • gpfs_wt_count_all : Count of all threads with waiting time greater than or equal to waiters_time_threshold seconds.
  • gpfs_wt_count_local_io: Count of threads waiting for local I/O with waiting time greater than or equal to waiters_time_threshold seconds.
  • gpfs_wt_count_network_io: Count of threads waiting for network I/O with waiting time greater than or equal to waiters_time_threshold seconds.
  • gpfs_wt_count_thcond: Count of threads waiting for a GPFS condition variable to be signaled with waiting time greater than or equal to waiters_time_threshold seconds.
  • gpfs_wt_count_thmutex: Count of threads waiting to lock a GPFS mutex with waiting time greater than or equal to waiters_time_threshold seconds.
  • gpfs_wt_count_delay: Count of threads waiting for delay interval expiration with waiting time greater than or equal to waiters_time_threshold seconds.
  • gpfs_wt_count_syscall: Count of threads waiting for system call completion with waiting time greater than or equal to waiters_time_threshold seconds.
Computed Metrics
These metrics can only be used only through the mmperfmon query command. The following metrics are computed for GPFS:
  • gpfs_write_avg_lat (latency): gpfs_vfs_write_t / gpfs_vfs_write
  • gpfs_read_avg_lat (latency): gpfs_vfs_read_t / gpfs_vfs_read
  • gpfs_create_avg_lat (latency): gpfs_vfs_create_t / gpfs_vfs_create
  • gpfs_remove_avg_lat (latency): gpfs_vfs_remove_t / gpfs_vfs_remove