About resource preemption
Preemptive scheduling and resource preemption
Resource preemption is a special type of preemptive scheduling. It is similar to job slot preemption.
Job slot preemption and resource preemption
If you enable preemptive scheduling, job slot preemption is always enabled. Resource preemption is optional. With resource preemption, you can configure preemptive scheduling that is based on other resources in addition to job slots.
- Other Resources
Resource preemption works for any custom shared numeric resource (except increasing dynamic resources). To preempt on a host-based resource, such as memory, you could configure a custom resource "shared" on only one host.
Multiple resource preemption
If multiple resources are required, LSF can preempt multiple jobs until sufficient resources are available. For example, one or more jobs might be preempted for a job that needs:
Multiple job slots
Multiple resources, such as a job slots and memory
More of a resource than can be obtained by preempting just one job
Use resource preemption
To allow your job to participate in resource preemption, you must use resource reservation to reserve the preemption resource (the cluster might be configured so that this occurs automatically). For dynamic resources, you must specify a duration also.
Resource reservation is part of resource requirement, which can be specified at the job level or at the queue level or application level.
You can use a task file to associate specific resource requirements with specific applications.
Dynamic resources
- Specify duration
If the preemption resource is dynamic, you must specify the duration part of the resource reservation string when you submit a preempting or preemptable job.
- Resources outside the control of LSF
If an ELIM is needed to determine the value of a dynamic resource, LSF preempts jobs as necessary, then waits for ELIM to report that the resources are available before starting the high-priority job. By default, LSF waits 300 seconds (5 minutes) for resources to become available. This time can be increased (PREEMPTION_WAIT_TIME in lsb.params).
If the preempted jobs do not release the resources, or the resources have been intercepted by a non-LSF user, the ELIM does not report any more of the resource becoming available, and LSF might preempt more jobs to get the resources.