Host status

Host status describes the ability of a host to accept and run batch jobs in terms of daemon states, load levels, and administrative controls. The bhosts and lsload commands display host status.

bhosts

Displays the current status of the host:


STATUS

Description

ok

Host is available to accept and run new batch jobs.

unavail

Host is down, or LIM and sbatchd are unreachable.

unreach

LIM is running but sbatchd is unreachable.

closed

Host does not accept new jobs. Use bhosts -l to display the reasons.


bhosts -l

Displays the closed reasons (for details, see the bhosts command reference). A closed host does not accept new batch jobs:

bhosts
HOST_NAME          STATUS       JL/U    MAX  NJOBS   RUN  SSUSP  USUSP   RSV
hostA              ok            -      55    2      2    0      0       0
hostB              closed        -      20    16     16   0      0       0 
...
 
bhosts -l hostB
HOST  hostB
STATUS           CPUF  JL/U    MAX  NJOBS    RUN  SSUSP  USUSP    RSV DISPATCH_WINDOW
closed_Adm      23.10     -     55      2      2      0      0      0      -
CURRENT LOAD USED FOR SCHEDULING:
             r15s   r1m  r15m    ut    pg    io   ls    it   tmp   swp   mem   slots
Total         1.0  -0.0  -0.0    4%   9.4   148    2     3 4231M  698M  233M       8
Reserved      0.0   0.0   0.0    0%   0.0     0    0     0    0M    0M    0M       8
LOAD THRESHOLD USED FOR SCHEDULING:
          r15s   r1m  r15m   ut      pg    io   ls    it    tmp    swp    mem
loadSched   -     -     -     -       -     -    -     -     -      -      - 
loadStop    -     -     -     -       -     -    -     -     -      -      - 
                 cpuspeed    bandwidth
loadSched          -            -
loadStop           -            -

lsload

Displays the current state of the host:


Status

Description

ok

Host is available to accept and run batch jobs and remote tasks.

-ok

LIM is running but RES is unreachable.

busy

Does not affect batch jobs, only used for remote task placement (i.e., lsrun). The value of a load index exceeded a threshold (configured in lsf.cluster.cluster_name, displayed by lshosts -l). Indices that exceed thresholds are identified with an asterisk (*).

lockW

Does not affect batch jobs, only used for remote task placement (i.e., lsrun). Host is locked by a run window (configured in lsf.cluster.cluster_name, displayed by lshosts -l).

lockU

Does not accept new batch jobs or remote tasks. An LSF administrator or root explicitly locked the host by using lsadmin limlock, or an exclusive batch job (bsub -x) is running on the host. Running jobs are not affected. Use lsadmin limunlock to unlock LIM on the local host.

unavail

Host is down, or LIM is unavailable.


lsload
HOST_NAME       status  r15s   r1m  r15m   ut    pg  ls    it   tmp   swp   mem
hostA               ok   0.0   0.0   0.0   4%   0.4   0  4316   10G  302M  252M
hostB               ok   1.0   0.0   0.0   4%   8.2   2    14 4231M  698M  232M
...