Enabling event-driven automatic scaling on GPUs for an instance of IBM Software Hub
If you install the Red Hat® OpenShift® Custom Metrics Autoscaler, you can enable event-driven automatic scaling based on inferencing requests. To enable event-driven scaling, you must configure the custom metrics autoscaler that is running in the instance project to query OpenShift Container Platform metrics.
- Installation phase
-
Setting up a client workstation
Setting up a cluster
Collecting required information
Preparing to run installs in a restricted network
Preparing to run installs from a private container registry
Preparing the cluster for IBM Software Hub
Preparing to install an instance of IBM Software Hub
Installing an instance of IBM Software Hub
Setting up the control plane
Installing solutions and services
- Who needs to complete this task?
-
Cluster administrator A cluster administrator must complete this task.
- When do you need to complete this task?
-
This task is optional. Complete this task if the following statements are true:
- You want to enable event-driven scaling on GPU for this instance of IBM Software Hub.
- You plan to install one or more of the following serviced in this instance of IBM Software
Hub:
- IBM Knowledge Catalog Premium *
- IBM Knowledge Catalog Standard *
- Watson Speech services *
- watsonx.ai™
- watsonx Assistant *
- Watsonx BI
- watsonx Code Assistant™
- watsonx Code Assistant for Red Hat Ansible® Lightspeed
- watsonx Code Assistant for Z Agentic
- watsonx Code Assistant for Z Understand
- watsonx.data™ Premium
- watsonx.data integration *
- watsonx.data intelligence *
- watsonx™ Orchestrate *
An asterisk (*) indicates that the service uses Inference foundation models in some situations.
Repeat as needed Repeat this task for each instance of IBM Software Hub where the preceding statements are true.
Before you begin
If you want to enable event-driven scaling based on inference requests, you must install the Red Hat OpenShift Custom Metrics Autoscaler.
Ensure that you source the environment variables before you run the commands in this task.
About this task
- Create a service account
- Create a role
- Bind the service account to the role
- Create a trigger authentication for the service account token
Procedure
To configure the custom metrics autoscaler to use OpenShift Container Platform metrics: