-gpu
bjobs -l -gpu shows the following information on GPU job allocation:
Categories
filter
Synopsis
bjobs -l | -UF [-gpu]Conflicting options
Use only with the -l or -UF option.
Description
- Host Name
- The name of the host.
- GPU IDs on the host
- Each GPU is shown as a separate line.
- TASK and ID
- List of job tasks and IDs using the GPU (separated by comma if used by multiple tasks)
- MODEL
- Contains the GPU brand name and model type name.
- MTOTAL
- The total GPU memory size.
- GPU Compute Capability
- MRSV
- GPU memory reserved by the job
- SOCKET
- socket ID of the GPU located at
- NVLINK
- Indicates if the GPU has NVLink connections with other GPUs allocated for the job (ranked by GPU ID and including itself). The connection flag of each GPU is a character separated by “/” with the next GPU:
If the job exited abnormally due to a GPU-related error or warning, the error or warning message displays. If LSF could not get GPU usage information from DCGM, a hyphen (-) displays.