Enforcing IBM® Spectrum LSF job memory and swap with Linux cgroups
Enable cgroup memory and swap enforcement in LSF.
About this task
Linux control groups (cgroups) limit, account, and isolate resource usage (CPU, memory, swap space, disk I/O, and other resources) of process groups by aggregating and partitioning sets of tasks and all their future children, into hierarchical groups with specialized behavior. cgroup support is a feature of the Linux kernel since kernel version 2.6.24.
All LSF job processes are controlled by the Linux cgroup system. If job processes on a host use more memory than the defined limit, the job will be immediately killed by the Linux cgroup memory sub-system.
Since Linux kernel 2.6.34, an eventfd-based generic API notification about changing status of a cgroup was introduced. With eventfd, LSF is able to be notified when job processes used more memory than the limit, then LSF will kill all processes of the job and provide specific termination reason that will be written into the LSF job accounting file and be displayed through bjobs –l.
The following steps enable cgroup memory and swap enforcement in LSF.