LSF daemons and processes

Multiple LSF processes run on each host in the cluster. The type and number of processes that are running depends on whether the host is a management host or a compute host.

LSF daemons and processes

Management host daemon processes

LSF hosts run various daemon processes, depending on their role in the cluster.


Daemon Role
mbatchd Job requests and dispatch
mbschd Job scheduling
sbatchd Job execution
res Job execution
lim Host information
pim Job process information
elim Dynamic load indexes

mbatchd

Management batch daemon that runs on the management host. Responsible for the overall state of jobs in the system.

Receives job submission, and information query requests. Manages jobs that are held in queues. Dispatches jobs to hosts as determined by mbschd.

mbschd

Management batch scheduler daemon that runs on the management host. Works with mbatchd.

Makes scheduling decisions based on job requirements, policies, and resource availability. Sends scheduling decisions to the mbatchd.

sbatchd

Server batch daemon that runs on each server host, including the management host. Receives the request to run the job from mbatchd and manages local execution of the job. Responsible for enforcing local policies and maintaining the state of jobs on the host.

sbatchd forks a child sbatchd for every job. The child sbatchd runs an instance of res to create the execution environment in which the job runs. The child sbatchd exits when the job is complete.

res

Remote execution server (RES) running on each server host. Accepts remote execution requests to provide clear and secure remote execution of jobs and tasks.

lim

Load information manager (LIM) running on each server host. Collects host load and configuration information and forwards it to the management host LIM running on the management host. Reports the information that is displayed by lsload and lshosts.

Static indexes are reported when the LIM starts up or when the number of CPUs (ncpus) change.

Parent LIM

The LIM running on the management host. Receives load information from the LIMs running on hosts in the cluster.

Forwards load information to mbatchd, which forwards this information to mbschd to support scheduling decisions. If the management host LIM becomes unavailable, a LIM on a management candidate automatically takes over.

PIM

Process information manager (PIM) running on each server host. Started by LIM, which periodically checks on PIM and restarts it if it dies.

Collects information about job processes that run on the host such as CPU and memory that is used by the job, and reports the information to sbatchd.

ELIM

External LIM (ELIM) is a site-definable executable file that collects and tracks custom dynamic load indexes. An ELIM can be a shell script or a compiled binary program, which returns the values of the dynamic resources you define. The ELIM executable file must be named elim.anything and located in LSF_SERVERDIR.