Reasoning Service parameters
Update the custom resource YAML file to provide the configuration values for the Reasoning Service component in your IBM Content Cortex AI Services deployment.
| Parameters | Description | Default or example values | Required |
|---|---|---|---|
| reasoning_service_secret_name | Specifies the secret that contains provider credentials and configuration for the Reasoning Service. | ibm-providers-config-secret | Yes |
| reasoning_service_ext_tls_secret_name | Specifies an existing certificate secret for automatic creation of OpenShift routes. | "" | No |
| reasoning_service_auth_ca_secret_name | Specifies the certificate authority secret that signs the external TLS secret for automatic creation of OpenShift routes. | "" | No |
| route_ingress_annotations | Specifies custom annotations for generated routes or ingress resources. | haproxy.router.openshift.io/balance: roundrobin |
No |
| replica_count | Specifies the number of Reasoning Service replicas. | 2 | No |
rolling_update
|
Rolling update settings for Reasoning Service deployment. The max_unavailable parameter specifies the maximum number or percentage of unavailable pods during a rolling update. The max_surge parameter specifies the maximum number of extra pods that can be created during a rolling update. |
|
No |
| arch.amd64 | Specifies the architecture preference for Linux on x86 clusters. | 3 - Most preferred | No |
| image.repository | Specifies the container image repository for the Reasoning Service image. | cp.icr.io/cp/cp4a/fncm/reasoning-services | Yes |
| image.tag | Specifies the image tag for the Reasoning Service container image. | 26.0.0 | No |
| image.pull_policy | Specifies the image pull policy for the Reasoning Service container image. | IfNotPresent | No |
| image.digest | Specifies the image digest as an alternative to the image tag. | sha256:... |
No |
| resources.requests.cpu | Specifies the requested CPU resources for the Reasoning Service pods. | 500m | No |
| resources.requests.memory | Specifies the requested memory resources for the Reasoning Service pods. | 1Gi | No |
| resources.requests.ephemeral_storage | Specifies the requested ephemeral storage for the Reasoning Service pods. | 1Gi | No |
| resources.limits.cpu | Specifies the CPU resource limit for the Reasoning Service pods. | 2000m | No |
| resources.limits.memory | Specifies the memory resource limit for the Reasoning Service pods. | 4Gi | No |
| resources.limits.ephemeral_storage | Specifies the ephemeral storage limit for the Reasoning Service pods. | 2Gi | No |
| auto_scaling.enabled | Specifies whether horizontal pod autoscaling is enabled for the Reasoning Service. | false | No |
| auto_scaling.max_replicas | Specifies the maximum number of replicas when autoscaling is enabled. | <Required> | No |
| auto_scaling.min_replicas | Specifies the minimum number of replicas when autoscaling is enabled. | <Required> | No |
| auto_scaling.target_cpu_average_utilization | Specifies the target average CPU utilization percentage for autoscaling. | <Required> | No |
| auto_scaling.target_memory_average_utilization | Specifies the target average memory utilization percentage for autoscaling. | <Required> | No |
| auto_scaling.scaleup.policies_pods_value | Specifies the maximum number of pods that the horizontal pod autoscaler can add during each scale-up interval. | <Optional> | No |
| auto_scaling.scaleup.policies_pods_period_seconds | Specifies how often the horizontal pod autoscaler can apply a scale-up action. | <Optional> | No |
| auto_scaling.scaleup.stabilization_window_seconds | Specifies the cooldown window before a scale-up action is applied. | <Optional> | No |
| auto_scaling.scaledown.policies_pods_value | Specifies the maximum number of pods that the horizontal pod autoscaler can remove during each scale-down interval. | <Optional> | No |
| auto_scaling.scaledown.policies_pods_period_seconds | Specifies how often the horizontal pod autoscaler can apply a scale-down action. | <Optional> | No |
| auto_scaling.scaledown.stabilization_window_seconds | Specifies the cooldown window before a scale-down action is applied. | <Optional> | No |
| node_affinity.custom_node_selector_match_expression | Specifies custom node selector match expressions for node affinity. | [] | No |
| custom_annotations | Specifies custom annotations that are added to generated pods. | { } | No |
| custom_labels | Specifies custom labels that are added to generated pods. | { } | No |
| tolerations | Specifies tolerations that allow pods to be scheduled on tainted nodes. | [] | No |
| security_context.supplemental_groups | Specifies supplemental group IDs for the container security context. | [] | No |
| security_context.selinux_options | Specifies SELinux options for the container security context. | {} | No |
| security_context.fs_groupchangepolicy | Specifies how ownership and permissions are changed for mounted volumes. | "" | No |
| custom_configmap | Specifies custom ConfigMaps to mount into the Reasoning Service pods. | Commented out | No |
| reasoning_service_production_setting.log_level | Specifies the general application logging level. | info | No |
| reasoning_service_production_setting.mcp_client | Specifies the logging level for MCP client communication. | info | No |
| reasoning_service_production_setting.basic_search | Specifies the logging level for the basic search agent. | info | No |
| reasoning_service_production_setting.navigator_core | Specifies the logging level for the watsonx search agent. | info | No |
| reasoning_service_production_setting.api | Specifies the logging level for API endpoint processing. | info | No |
| reasoning_service_production_setting.agent_registry | Specifies the logging level for agent registry operations. | info | No |
| reasoning_service_production_setting.thread_manager | Specifies the logging level for thread management. | info | No |
| reasoning_service_production_setting.tool_registry | Specifies the logging level for tool registry operations. | info | No |
| reasoning_service_production_setting.auth | Specifies the logging level for authentication processing. | warning | No |
| reasoning_service_production_setting.console_output | Specifies whether console logging is enabled. | false | No |
| reasoning_service_production_setting.file_output | Specifies whether file-based logging is enabled. | true | No |
| reasoning_service_production_setting.file_mcp_client_path | Specifies the log file path for MCP client logs. | /app/logs/mcp_client.log | No |
| reasoning_service_production_setting.file_navigator_core_path | Specifies the log file path for watsonx search agent logs. | /app/logs/navigator_core_agent.log | No |
| reasoning_service_production_setting.file_api_path | Specifies the log file path for API logs. | /app/logs/api.log | No |
| reasoning_service_production_setting.file_agent_registry_path | Specifies the log file path for agent registry logs. | /app/logs/api.log | No |
| reasoning_service_production_setting.file_thread_manager_path | Specifies the log file path for thread manager logs. | /app/logs/api.log | No |
| reasoning_service_production_setting.file_tool_registry_path | Specifies the log file path for tool registry logs. | /app/logs/utils.log | No |
| reasoning_service_production_setting.file_auth_path | Specifies the log file path for authentication logs. | /app/logs/api.log | No |
| reasoning_service_production_setting.startup.workers | Specifies the number of startup workers. | 4 | No |
| reasoning_service_production_setting.startup.limit_concurrency | Specifies the startup concurrency limit. | 4 | No |
| reasoning_service_production_setting.startup.limit_max_requests | Specifies the maximum number of requests before worker recycling. | 2000 | No |
| reasoning_service_production_setting.startup.timeout_keep_alive | Specifies the keep-alive timeout, in seconds. | 600 | No |
| datavolume.existing_pvc_for_reasoning_service_cfgstore.name | Specifies the persistent volume claim name for configuration storage. | reasoning-service-config-pvc | No |
| datavolume.existing_pvc_for_reasoning_service_cfgstore.size | Specifies the persistent volume claim size for configuration storage. | 5Gi | No |
| datavolume.existing_pvc_for_reasoning_service_logstore.name | Specifies the persistent volume claim name for log storage. | reasoning-service-logs-pvc | No |
| datavolume.existing_pvc_for_reasoning_service_logstore.size | Specifies the persistent volume claim size for log storage. | 10Gi | No |
probe.readiness
|
The behavior of readiness probes to know when the containers are ready to start accepting traffic. |
|
No |
probe.liveness
|
The behavior of liveness probes to know when to restart a container. |
|
No |
probe.startup
|
The behavior of startup probes to know when the container is started. |
|
No |