lsmon

Displays load information for LSF hosts and periodically updates the display.

Synopsis

lsmon [-N | -E] [-n num_hosts] [-R res_req] [-I index_list] [-i interval] [-L file_name] [host_name ...]
lsmon [-h | -V]

Description

Note: This command is deprecated and might be removed in a future version of LSF.

The lsmon command is a full-screen LSF monitoring utility that displays and updates load information for hosts in a cluster.

By default, displays load information for all hosts in the cluster, up to the number of lines that fit on-screen.

By default, displays raw load indices.

By default, load information is sorted according to CPU and paging load.

By default, load information is updated every 10 seconds.

Options

-N
Displays normalized CPU run queue length load indices.
-E
Displays effective CPU run queue length load indices. Options -N and -E are mutually exclusive.
-n num_hosts
Displays only load information for the requested number of hosts. Information for up to num_hosts hosts that best satisfy resource requirements is displayed.
-R res_req
Displays only load information for hosts that satisfy the specified resource requirements.

Load information for the hosts is sorted according to load on the specified resources.

If resource requirement res_req contains special resource names, only load information for hosts that provide these resources is displayed. Use the lshosts command to find out what resources are available on each host.

If one or more host names are specified, only load information for the hosts that satisfy the resource requirements is displayed.

-I index_list
Displays only load information for the specified load indices. Load index names must be separated by a colon (for example, r1m:pg:ut).

If the index list index_list is too long to fit in the screen, the output is truncated. For example, if the invoker's screen is 80 characters wide, then up to 10 load indices are displayed.

-i interval
Sets how often load information is updated on-screen, in seconds.
-L file_name
Saves load information in the specified file while it is displayed on-screen.

If you do not want load information to be displayed on your screen at the same time, use lsmon -L file_name < /dev/null. The format of the file is described in the lim.acct file reference.

host_name ...
Displays only load information for the specified hosts.
-h
Prints command usage to stderr and exits.
-V
Prints LSF release version to stderr and exits.

Usage

You can use the following commands while lsmon is running:

[^L | i | n | N | E | R | q]

^L
Refreshes the screen.
i
Prompts you to input a new update interval.
n
Prompts you to input a new number of hosts to display.
N
Toggles between displaying raw CPU run queue length load indices and normalized CPU run queue length load indices.
E
Toggles between displaying raw CPU run queue length load indices and effective CPU run queue length load indices.
R
Prompts you to input new resource requirements.
q
Quits lsmon.

Output

The following fields are displayed by default.
HOST_NAME
Name of specified hosts for which load information is displayed, or if resource requirements were specified, name of hosts that satisfied the specified resource requirement and for which load information is displayed.
status
Status of the host. A minus sign (-) can precede the status, indicating that the Remote Execution Server (RES) on the host is not running.
The following statuses are supported:
ok
The host is in normal load-sharing state and can accept remote jobs.
busy
The host is overloaded because some load indices exceed configured thresholds. Load index values that caused the host to be busy are preceded by an asterisk (*. Built-in load indices include r15s, r1m, r15m, ut, pg, io, ls, it, swp, mem, and tmp. External load indices are configured in the file lsf.cluster.cluster_name.
lockW
The host is locked by its run window. Run windows for a host are specified in the lsf.conf file and can be displayed by the lshosts command. A locked host does not accept load shared jobs from other hosts.
lockU
The host is locked by the LSF administrator or root.
unavail
The host is down or the Load Information Manager (LIM) on the host is not running.
unlicensed
The host does not have a valid LSF license.
r15s
The 15 second exponentially averaged CPU run queue length.
r1m
The 1 minute exponentially averaged CPU run queue length.
r15m
The 15 minute exponentially averaged CPU run queue length.
ut
The CPU utilization exponentially averaged over the last minute, 0 - 1.
pg
The memory paging rate exponentially averaged over the last minute, in pages per second.
ls
The number of current login users.
it
On UNIX, the idle time of the host (the keyboard was not touched on all logged in sessions), in minutes.

On Windows, the it index is based on the time a screen saver is active on a particular host.

tmp
The amount of free space in /tmp, in MBytes.
swp
The amount of currently available swap space, in MB.
mem
The amount of currently available memory, in MB.

Diagnostics

Specifying an incorrect resource requirement string with the R option while the lsmon command is running causes the lsmon command to exit with an error message.

lsmon exits if it does not receive a reply from LIM within the update interval.

See also

lshosts, lsinfo, lsload, lslockhost, lim.acct, ls_load