Improving GPU utilization with resource bin packing
In Kubernetes, resource bin packing is the process of scheduling pods onto nodes to maximize your resource utilization and minimize the number of nodes required. This is especially useful when you have a limited number of nodes that can support specific workloads, such as GPU-based workloads.
If you plan to install services that require GPU, you can use the scheduling service node scoring configuration to enable bin packing.
- Who needs to complete this task?
-
Cluster administrator A cluster administrator must complete this task.
- When do you need to complete this task?
-
This task applies only if you plan to install services that require GPU.
Before you begin
To use node scoring, the scheduling service must be installed.
oc get scheduling -A- If the scheduling service is installed, the command returns information about the project where the scheduling service is installed and the version that is installed.
- If the scheduling service is not
installed, the command returns an empty response.
To install the scheduling service, see Installing shared cluster components for IBM Software Hub.
Ensure that you source the environment variables before you run the commands in this task.
About this task
Use the MoreGPURequest setting to enable bin packing. When you enable bin
packing, the scheduling service uses the
fewest number of GPU nodes to schedule GPU-based pods.