Viewing LSF cluster information

Use the lsid, badmin, bparams, and lsclusters commands to find information about the LSF cluster.

Procedure

Cluster information includes the cluster management host, cluster name, cluster resource definitions, cluster administrator, and other details.

Table 1. LSF commands to view cluster information
View Command
Version of LSF lsid
Cluster name lsid
Current management host lsid
Cluster administrators lsclusters
Configuration parameters bparams
LSF system runtime information badmin showstatus

Viewing LSF version, cluster name, and current management host

Use the lsid command to display the version of LSF, the name of your cluster, and the current management host.

Procedure

The lsid command displays cluster version management host information.
lsid
LSF 10.1.0, Jan 5 2016 
© Copyright IBM Corp. 1992, 2022. 
US Governmant Users Restricted Rights - Use, duplication or disclosure restricted
  by GSA ADP Schedule Contract with IBM Corp.
My cluster name is lsf10 
My master name is hosta.company.com

Viewing cluster administrators

Use the lsclusters command to find out who your cluster administrator is and see a summary of your cluster.

Procedure

The lsclusters command summarizes current cluster status:
lsclusters
CLUSTER_NAME   STATUS   MASTER_HOST    ADMIN        HOSTS     SERVERS
cluster1       ok       hostA          lsfadmin     6         6

If you are using the IBM® Spectrum LSF multicluster capability, you can see one line for each of the clusters that your local cluster is connected to in the output of the lsclusters command.

Viewing configuration parameters

Use the bparams command to display the generic configuration parameters of LSF.

Procedure

  1. The bparams command shows default queues, job dispatch interval, job checking interval, and job acceptance interval.
    bparams
    Default Queues:  normal idle
    MBD_SLEEP_TIME used for calculations: 20 seconds
    Job Checking Interval:  15 seconds
    Job Accepting Interval:  20 seconds
    
  2. Use the bparams -l command to display the information in long format, which gives a brief description of each parameter and the name of the parameter as it appears in the lsb.params file.
    bparams -l
    System default queues for automatic queue selection:
        DEFAULT_QUEUE = normal idle
    Amount of time in seconds used for calculating parameter values:
        MBD_SLEEP_TIME = 20 (seconds)
    The interval for checking jobs by server batch daemon:
        SBD_SLEEP_TIME = 15 (seconds)
    The interval for a host to accept two batch jobs subsequently:
        JOB_ACCEPT_INTERVAL = 1 (* MBD_SLEEP_TIME)
    The idle time of a host for resuming pg suspended jobs:
        PG_SUSP_IT = 180 (seconds)
    The amount of time during which finished jobs are kept in core:
        CLEAN_PERIOD = 3600 (seconds)
    The maximum number of finished jobs that are logged in current event file:
        MAX_JOB_NUM = 2000
    The maximum number of retries for reaching a server batch daemon:
        MAX_SBD_FAIL = 3
    The number of hours of resource consumption history:
        HIST_HOURS = 5
    The default project assigned to jobs.
        DEFAULT_PROJECT = default
    Sync up host status with management host LIM is enabled:
    LSB_SYNC_HOST_STAT_LIM = Y
    MBD child query processes will only run on the following CPUs:
    MBD_QUERY_CPUS=1 2 3
    
  3. Use the bparams -a command to display all configuration parameters and their values in the lsb.params file.
    For example,
    bparams -a
         MBD_SLEEP_TIME = 20
         SBD_SLEEP_TIME = 15
         JOB_ACCEPT_INTERVAL = 1
         SUB_TRY_INTERVAL = 60
         LSB_SYNC_HOST_STAT_LIM =  N
         MAX_JOBINFO_QUERY_PERIOD = 2147483647
         PEND_REASON_UPDATE_INTERVAL = 30
         ...
    

Viewing daemon parameter configuration

Use the badmin showconf mbd command and the lsadmin showconf command to show current cluster configuration settings.

Before you begin

Log on to a server host.

Procedure

  1. Display all configuration settings for running LSF daemons.
    • Use the lsadmin showconf command to display all configured parameters and their values in the lsf.conf or ego.conf file for LIM.
    • Use the badmin showconf mbd command or the badmin showconf sbd command to display all configured parameters and their values in the lsf.conf or ego.conf file for the mbatchd and sbatchd daemons.

    In IBM Spectrum LSF multicluster capability, the parameters apply to the local cluster only.

  2. Display mbatchd and root sbatchd daemon configuration.
    • Use the badmin showconf mbd command to display the parameters that are configured in the lsf.conf or ego.conf file that apply to the mbatchd daemon.
    • Use the badmin showconf sbd command to display the parameters that are configured in the lsf.conf or ego.conf file that apply to the root sbatchd daemon.

Example

  • Run the badmin showconf mbd command to show the mbatchd daemon configuration:
    badmin showconf mbd
    MBD configuration at Fri Jun 8 10:27:52 CST 2011
         LSB_SHAREDIR=/dev/lsf/user1/0604/work
         LSF_CONFDIR=/dev/lsf/user1/0604/conf
         LSF_LOG_MASK=LOG_WARNING
         LSF_ENVDIR=/dev/lsf/user1/0604/conf
         LSF_EGO_DAEMON_CONTROL=N
         ...
    
  • Run the badmin showconf sbd host_name command to show the sbatchd daemon configuration on a specific host:
    badmin showconf sbd hosta
    SBD configuration for host <hosta> at Fri Jun 8 10:27:52 CST 2011
         LSB_SHAREDIR=/dev/lsf/user1/0604/work
         LSF_CONFDIR=/dev/lsf/user1/0604/conf
         LSF_LOG_MASK=LOG_WARNING
         LSF_ENVDIR=/dev/lsf/user1/0604/conf
         LSF__DAEMON_CONTROL=N
         ...
    
  • Run the badmin showconf sbd all command to show the sbatchd daemon configuration for all hosts:
    badmin showconf sbd all
    SBD configuration for host <hosta> at Fri Jun 8 10:27:52 CST 2011
         LSB_SHAREDIR=/dev/lsf/user1/0604/work
         LSF_CONFDIR=/dev/lsf/user1/0604/conf
         LSF_LOG_MASK=LOG_WARNING
         LSF_ENVDIR=/dev/lsf/user1/0604/conf
         LSF_EGO_DAEMON_CONTROL=N
         ...
    SBD configuration for host <hostb> at Fri Jun 8 10:27:52 CST 2011
         LSB_SHAREDIR=/dev/lsf/user1/0604/work
         LSF_CONFDIR=/dev/lsf/user1/0604/conf
         LSF_LOG_MASK=LOG_WARNING
         LSF_ENVDIR=/dev/lsf/user1/0604/conf
         LSF_EGO_DAEMON_CONTROL=N
         ...
    
  • Run the lsadmin showconf lim command to show the lim daemon configuration:
    lsadmin showconf lim
    LIM configuration at Fri Jun 8 10:27:52 CST 2010
         LSB_SHAREDIR=/dev/lsf/user1/0604/work
         LSF_CONFDIR=/dev/lsf/user1/0604/conf
         LSF_LOG_MASK=LOG_WARNING
         LSF_ENVDIR=/dev/lsf/user1/0604/conf
         LSF_EGO_DAEMON_CONTROL=N
         ...
    
  • Run the lsadmin showconf lim host_name command to show the lim daemon configuration for a specific host:
    lsadmin showconf lim hosta
    LIM configuration for host <hosta> at Fri Jun 8 10:27:52 CST 2011
         LSB_SHAREDIR=/dev/lsf/user1/0604/work
         LSF_CONFDIR=/dev/lsf/user1/0604/conf
         LSF_LOG_MASK=LOG_WARNING
         LSF_ENVDIR=/dev/lsf/user1/0604/conf
         LSF_EGO_DAEMON_CONTROL=N
         ...
    
  • Run the lsadmin showconf lim all command to show the lim daemon configuration for all hosts:
    lsadmin showconf lim all
    LIM configuration for host <hosta> at Fri Jun 8 10:27:52 CST 2011
         LSB_SHAREDIR=/dev/lsf/user1/0604/work
         LSF_CONFDIR=/dev/lsf/user1/0604/conf
         LSF_LOG_MASK=LOG_WARNING
         LSF_ENVDIR=/dev/lsf/user1/0604/conf
         LSF_EGO_DAEMON_CONTROL=N
         ...
    LIM configuration for host <hostb> at Fri Jun 8 10:27:52 CST 2011
         LSB_SHAREDIR=/dev/lsf/user1/0604/work
         LSF_CONFDIR=/dev/lsf/user1/0604/conf
         LSF_LOG_MASK=LOG_WARNING
         LSF_ENVDIR=/dev/lsf/user1/0604/conf
         LSF_EGO_DAEMON_CONTROL=N
         ...
    

Viewing runtime cluster summary information

Use the badmin showstatus command to display a summary of the current LSF runtime information.

Procedure

The badmin showstatus command displays information about hosts, jobs, users, user groups, and the mbatchd daemon startup and reconfiguration:
% badmin showstatus

LSF runtime mbatchd information
    Available local hosts (current/peak):
        Clients:                0/0
        Servers:                8/8
           CPUs:                14/14
           Cores:               50/50
           Slots:               50/50

    Number of servers:          8
        Ok:                     8
        Closed:                 0
        Unreachable:            0
        Unavailable:            0

    Number of jobs:             7
        Running:                0
        Suspended:              0
        Pending:                0
        Finished:               7

    Number of users:            3
    Number of user groups:      1
    Number of active users:     0

    Latest mbatchd start:       Thu Nov 22 21:17:01 2012
    Active mbatchd PID:         26283
 
    Latest mbatchd reconfig:    Thu Nov 22 21:18:06 2012
 
    mbatchd restart information
        New mbatchd started:    Thu Nov 22 21:18:21 2012
        New mbatchd PID:        27474