Host status
Host status describes the ability of a host to accept and run batch jobs in terms of daemon states, load levels, and administrative controls. The bhosts and lsload commands display host status.
bhosts
Displays the current status of the host:
STATUS |
Description |
---|---|
ok |
Host is available to accept and run new batch jobs. |
unavail |
Host is down, or LIM and sbatchd are unreachable. |
unreach |
LIM is running but sbatchd is unreachable. |
closed |
Host does not accept new jobs. Use bhosts -l to display the reasons. |
bhosts -l
Displays the closed reasons (for details, see the bhosts command reference). A closed host does not accept new batch jobs:
bhosts
HOST_NAME STATUS JL/U MAX NJOBS RUN SSUSP USUSP RSV
hostA ok - 55 2 2 0 0 0
hostB closed - 20 16 16 0 0 0
...
bhosts -l hostB
HOST hostB
STATUS CPUF JL/U MAX NJOBS RUN SSUSP USUSP RSV DISPATCH_WINDOW
closed_Adm 23.10 - 55 2 2 0 0 0 -
CURRENT LOAD USED FOR SCHEDULING:
r15s r1m r15m ut pg io ls it tmp swp mem slots
Total 1.0 -0.0 -0.0 4% 9.4 148 2 3 4231M 698M 233M 8
Reserved 0.0 0.0 0.0 0% 0.0 0 0 0 0M 0M 0M 8
LOAD THRESHOLD USED FOR SCHEDULING:
r15s r1m r15m ut pg io ls it tmp swp mem
loadSched - - - - - - - - - - -
loadStop - - - - - - - - - - -
cpuspeed bandwidth
loadSched - -
loadStop - -
lsload
Displays the current state of the host:
Status |
Description |
---|---|
ok | Host is available to accept and run batch jobs and remote tasks. |
-ok | LIM is running but RES is unreachable. |
busy | Does not affect batch jobs, only used for remote task placement (such as lsrun). The value of a load index exceeded a threshold (configured in lsf.cluster.cluster_name, displayed by lshosts -l). Indices that exceed thresholds are identified with an asterisk (*). |
lockW | Does not affect batch jobs, only used for remote task placement (such as lsrun). Host is locked by a run window (configured in lsf.cluster.cluster_name, displayed by lshosts -l). |
lockU | Does not accept new batch jobs or remote tasks. An LSF administrator or root explicitly locked the host by using lsadmin limlock, or an exclusive batch job (bsub -x) is running on the host. Running jobs are not affected. Use lsadmin limunlock to unlock LIM on the local host. |
unavail | Host is down, or LIM is unavailable. |
lsload
HOST_NAME status r15s r1m r15m ut pg ls it tmp swp mem
hostA ok 0.0 0.0 0.0 4% 0.4 0 4316 10G 302M 252M
hostB ok 1.0 0.0 0.0 4% 8.2 2 14 4231M 698M 232M
...