bapp

Displays information about application profile configuration.

Synopsis

bapp [-alloc] [-l | -w] [application_profile ...]
bapp [-h | -V]

Description

Displays information about application profiles that are configured in the lsb.applications file.

Returns application name, task statistics, and job state statistics for all application profiles.

In the LSF multicluster capability, returns the information about all application profiles in the local cluster.

Returns job slot statistics if the -alloc option is used.

CPU time is normalized.

Options

-alloc
Shows counters for slots in RUN, SSUSP, USUSP, and RSV state. The slot allocation is different depending on whether the job is an exclusive job or not.
-w

Wide format. Fields are displayed without truncation.

-l

Long format with additional information.

Displays the following additional information about the application profile:
  • Description
  • Profile characteristics and statistics
  • Parameters
  • Resource usage limits
  • Associated commands
  • Binding policy
  • NICE value
  • Job controls
  • Pending time limits
  • Eligible pending time limits
  • APS priority factors
application_profile ...

Displays information about the specified application profile.

-h

Prints command usage to stderr and exits.

-V

Prints product release version to stderr and exits.

Default output format

Displays the following fields:

APPLICATION_NAME

The name of the application profile. Application profiles are named to correspond to the type of application that usually runs within them.

NJOBS

The total number of tasks for all jobs that are currently held in the application profile. The total includes tasks in pending, running, and suspended jobs.

If the -alloc option is used, total is the sum of the counters for RUN, SSUSP, USUSP, and RSV states.

PEND

The number of tasks for all pending jobs in the application profile. If used with the -alloc option, output is 0, because pending jobs do not have slot allocation.

RUN

The number of tasks for all running jobs in the application profile. If the -alloc option is used, the total is allocated slots for the jobs in the application profile.

SUSP

The number of tasks for all suspended jobs in the application profile. If the -alloc option is used, the total is allocated slots for the jobs in the application profile.

Long output format with the -l option

In addition to the default output fields, the -l option displays the following fields:

Description

A description of the typical use of the application profile.

PARAMETERS/STATISTICS
SSUSP

The number of tasks for all jobs in the application profile that are suspended by LSF because of load levels or run windows.

USUSP

The number of tasks for all jobs in the application profile that are suspended by the job submitter or by the LSF administrator.

RSV

The number of tasks that reserve slots for pending jobs in the application profile.

ENV_VARS

The name-value pairs that are defined by the application-specific environment variables.

Per-job resource usage limits

The soft resource usage limits that are imposed on the jobs that are associated with the application profile. These limits are imposed on a per-job and a per-process basis.

The following per-job limits are supported:
CPULIMIT

The maximum CPU time a job can use, in minutes, relative to the CPU factor of the named host. The CPULIMIT is scaled by the CPU factor of the execution host so that jobs are allowed more time on slower hosts.

MEMLIMIT

The maximum running set size (RSS) of a process.

By default, the limit is shown in KB. Use the LSF_UNIT_FOR_LIMITS parameter in the lsf.conf file to specify a larger unit for display (MB, GB, TB, PB, or EB).

MEMLIMIT_TYPE

A memory limit is the maximum amount of memory a job is allowed to consume. Jobs that exceed the level are killed. You can specify different types of memory limits to enforce, based on PROCESS, TASK, or JOB (or any combination of the three).

PROCESSLIMIT

The maximum number of concurrent processes that are allocated to a job.

SWAPLIMIT

The swap space limit that a job can use.

By default, the limit is shown in KB. Use the LSF_UNIT_FOR_LIMITS parameter in the lsf.conf file to specify a larger unit for display (MB, GB, TB, PB, or EB).

TASKLIMIT

The maximum number of tasks that are allocated to a job.

THREADLIMIT

The maximum number of concurrent threads that are allocated to a job.

Per-process resource usage limits
The following UNIX per-process resource limits are supported:
CORELIMIT

The maximum size of a core file.

By default, the limit is shown in KB. Use the LSF_UNIT_FOR_LIMITS parameter in the lsf.conf file to specify a larger unit for display (MB, GB, TB, PB, or EB).

DATALIMIT

The maximum size of the data segment of a process, in KB. This limit restricts the amount of memory a process can allocate.

FILELIMIT

The maximum file size a process can create, in KB.

RUNLIMIT

The maximum wall clock time a process can use, in minutes. RUNLIMIT is scaled by the CPU factor of the execution host.

STACKLIMIT

The maximum size of the stack segment of a process. This limit restricts the amount of memory a process can use for local variables or recursive function calls.

By default, the limit is shown in KB. Use the LSF_UNIT_FOR_LIMITS parameter in the lsf.conf file to specify a larger unit for display (MB, GB, TB, PB, or EB).

BIND_JOB
The processor binding policy for sequential and parallel job processes enabled in the application profile. Displays one of the following values: NONE, BALANCE, PACK, ANY, USER, or USER_CPU_LIST.
bapp -l app1
APPLICATION NAME: app1
 -- test processor binding options
...
PARAMETERS:
BIND_JOB: ANY

For compatibility with an earlier versions, the bapp -l command displays Y or N if the BIND_JOB parameter is defined with those values in the application profile.

CHKPNT_DIR

The checkpoint directory, if automatic checkpointing is enabled for the application profile.

CHKPNT_INITPERIOD

The initial checkpoint period in minutes. The periodic checkpoint does not happen until the initial period elapses.

CHKPNT_PERIOD

The checkpoint period in minutes. The running job is checkpointed automatically every checkpoint period.

CHKPNT_METHOD

The checkpoint method.

MIG

The migration threshold in minutes. A value of 0 (zero) specifies that a suspended job is to be migrated immediately.

Where a host migration threshold is also specified, and is lower than the job value, the host value is used.

PRE_EXEC

The job-based pre-execution command for the application profile. The PRE_EXEC command runs on the execution host before the job that is associated with the application profile is dispatched to the execution host (or to the first host selected for a parallel batch job).

POST_EXEC

The job-based post-execution command for the application profile. The POST_EXEC command runs on the execution host after the job finishes.

HOST_PRE_EXEC

The host-based pre-execution command for the application profile. The HOST_PRE_EXEC command runs on all execution hosts before the job that is associated with the application profile is dispatched to the execution hosts. If a job-based pre-execution PRE_EXEC command was defined at the queue-level/application-level/job-level, the HOST_PRE_EXEC command runs before the PRE_EXEC command of any level. The host-based pre-execution command cannot be run on Windows systems.

HOST_POST_EXEC

The host-based post-execution command for the application profile. The HOST_POST_EXEC command runs on the execution hosts after the job finishes. If a job-based post-execution POST_EXEC command was defined at the queue-level/application-level/job-level, the HOST_POST_EXEC command runs after the POST_EXEC command of any level. The host-based post-execution command cannot be run on Windows systems.

LOCAL_MAX_PREEXEC_RETRY_ACTION

The action to take on a job when the number of times to attempt its pre-execution command on the local cluster (the value of the LOCAL_MAX_PREEXEC_RETRY parameter) is reached.

JOB_INCLUDE_POSTPROC

If the JOB_INCLUDE_POSTPROC=Y parameter is defined, post-execution processing of the job is included as part of the job.

JOB_POSTPROC_TIMEOUT

Timeout in minutes for job post-execution processing. If post-execution processing takes longer than the timeout, the sbatchd daemon reports that post-execution failed (POST_ERR status). On UNIX, it kills the process group of the job's post-execution processes. On Windows, only the parent process of the pre-execution command is killed when the timeout expires, the child processes of the pre-execution command are not killed.

REQUEUE_EXIT_VALUES

Jobs that exit with these values are automatically requeued.

RES_REQ

Resource requirements of the application profile. Only the hosts that satisfy these resource requirements can be used by the application profile.

JOB_STARTER

An executable file that runs immediately before the batch job, taking the batch job file as an input argument. All jobs that are submitted to the application profile are run through the job starter, which is used to create a specific execution environment before LSF processes the jobs themselves.

RERUNNABLE

If the RERUNNABLE field displays yes, jobs in the application profile are automatically restarted or rerun if the execution host becomes unavailable. However, a job in the application profile is not restarted if you use bmod to remove the rerunnable option from the job.

RESUME_CONTROL

The configured actions for the resume job control.

The configured actions are displayed in the format [action_type, command] where action_type is RESUME.

SUSPEND_CONTROL

The configured actions for the suspend job control.

The configured actions are displayed in the format [action_type, command] where action_type is SUSPEND.

TERMINATE_CONTROL

The configured actions for terminate job control.

The configured actions are displayed in the format [action_type, command] where action_type is TERMINATE.

NO_PREEMPT_INTERVAL

The configured uninterrupted running time (minutes) that must pass before a preemptable job can be preempted.

MAX_TOTAL_TIME_PREEMPT

The configured maximum total preemption time (minutes) after which preemption cannot take place.

NICE

The relative scheduling priority at which jobs from the application run.

Current working directory (CWD) information
JOB_CWD

The current working directory for the job in the application profile. The path can be absolute or relative to the submission directory, and includes dynamic patterns.

JOB_CWD_TTL

The time to live for the current working directory for a job. LSF cleans the created CWD after a job finishes based on the TTL value.

JOB_SIZE_LIST

A list of job sizes (number of tasks) allowed on this application, including the default job size that is assigned if the job submission does not request a job size. Configured in the lsb.applications file.

PEND_TIME_LIMIT

The pending time limit for a job in the application profile. If a job remains pending for longer than this specified time limit, IBM Spectrum LSF RTM (LSF RTM) triggers the alarm and other actions. Configured in the lsb.applications file.

ELIGIBLE_PEND_TIME_LIMIT

The eligible pending time limit for a job in the application profile. If a job remains in an eligible pending state for longer than this specified time limit, IBM Spectrum LSF RTM (LSF RTM) triggers the alarm and other actions. Configured in the lsb.applications file.

PRIORITY

The APS priority factor for the application profile. PRIORITY is defined by the PRIORITY parameter in the configuration file lsb.applications.

See also

lsb.applications, lsb.queues, bsub, bjobs, badmin, mbatchd