Automatic scaling resources for Assistant Builder in watsonx Orchestrate

The watsonx™ Orchestrate supports auto scaling to automatically adjust resources based on workload demand. You can enable or disable auto scaling by using the oc patch command.

Note: Auto scaling is supported only for Assistant Builder services. Other watsonx Orchestrate services do not support auto scaling.

Enable auto scaling

To enable auto scaling for your watsonx Orchestrate instance, run:
oc patch wo wo --namespace=${PROJECT_CPD_INST_OPERANDS} --type='merge' -p '{"spec":{"autoScaleConfig":true}}'
The oc patch command sets the autoScaleConfig field to true, allowing watsonx Orchestrate to scale automatically according to system utilization.

Disable auto scaling

To disable auto scaling and return to fixed or manual scaling, run:
oc patch wo wo --namespace=${PROJECT_CPD_INST_OPERANDS} --type='merge' -p '{"spec":{"autoScaleConfig":false}}'
Setting the autoScaleConfig value to false prevents watsonx Orchestrate from automatically adjusting resource usage.

Verify the setting

To verify the current auto scaling configuration, run:
oc get wo wo --namespace=${PROJECT_CPD_INST_OPERANDS} -o yaml | grep autoScaleConfig
This displays the current auto scaling value (true or false) in the watsonx Orchestrate custom resource.