Exclusive job re-queuing

You can configure automatic job re-queue so that a failed job is not rerun on the same host.

Limitations

  • If mbatchd is restarted, this feature might not work properly, since LSF forgets which hosts have been excluded. If a job ran on a host and exited with an exclusive exit code before mbatchd was restarted, the job could be dispatched to the same host again after mbatchd is restarted.
  • Exclusive job re-queue does not work for multicluster jobs or parallel jobs
  • A job terminated by a signal is not re-queued