Core MCP Server parameters

Update the custom resource YAML file to provide the configuration values for the Core MCP Server component in your IBM Content Cortex AI Services deployment.

Table 1. Core MCP Server configuration parameters: core_mcp_server_configuration
Parameters Description Default or example values Required
core_mcp_ext_tls_secret_name Specifies an existing certificate secret for automatic creation of OpenShift routes. "" No
core_mcp_auth_ca_secret_name Specifies the certificate authority secret that signs the external TLS secret for automatic creation of OpenShift routes. "" No
route_ingress_annotations Specifies custom annotations for generated routes or ingress resources. haproxy.router.openshift.io/balance: roundrobin No
arch.amd64 Specifies the architecture preference for Linux on x86 clusters. 3 - Most preferred No
replica_count Specifies the number of Core MCP Server replicas. 2 No
rolling_update
  • max_unavailable
  • max_surge
Rolling update settings for Core MCP Server deployment. The max_unavailable parameter specifies the maximum number or percentage of unavailable pods during a rolling update. The max_surge parameter specifies the maximum number of extra pods that can be created during a rolling update.
  • 25%
  • 1
No
image.repository Specifies the container image repository for the Core MCP Server image. cp.icr.io/cp/cp4a/fncm/core-mcp-server Yes
image.tag Specifies the image tag for the Core MCP Server container image. 26.0.0 No
image.pull_policy Specifies the image pull policy for the Core MCP Server container image. IfNotPresent No
image.digest Specifies the image digest as an alternative to the image tag. sha256:... No
resources.requests.cpu Specifies the requested CPU resources for the Core MCP Server pods. 250m No
resources.requests.memory Specifies the requested memory resources for the Core MCP Server pods. 512Mi No
resources.requests.ephemeral_storage Specifies the requested ephemeral storage for the Core MCP Server pods. 1Gi No
resources.limits.cpu Specifies the CPU resource limit for the Core MCP Server pods. 1000m No
resources.limits.memory Specifies the memory resource limit for the Core MCP Server pods. 2Gi No
resources.limits.ephemeral_storage Specifies the ephemeral storage limit for the Core MCP Server pods. 2Gi No
auto_scaling.enabled Specifies whether horizontal pod autoscaling is enabled for the Core MCP Server. false No
auto_scaling.max_replicas Specifies the maximum number of replicas when autoscaling is enabled. <Required> No
auto_scaling.min_replicas Specifies the minimum number of replicas when autoscaling is enabled. <Required> No
auto_scaling.target_cpu_average_utilization Specifies the target average CPU utilization percentage for autoscaling. <Required> No
auto_scaling.target_memory_average_utilization Specifies the target average memory utilization percentage for autoscaling. <Required> No
auto_scaling.scaleup.policies_pods_value Specifies the maximum number of pods that the horizontal pod autoscaler can add during each scale-up interval. <Optional> No
auto_scaling.scaleup.policies_pods_period_seconds Specifies how often the horizontal pod autoscaler can apply a scale-up action. <Optional> No
auto_scaling.scaleup.stabilization_window_seconds Specifies the cooldown window before a scale-up action is applied. <Optional> No
auto_scaling.scaledown.policies_pods_value Specifies the maximum number of pods that the horizontal pod autoscaler can remove during each scale-down interval. <Optional> No
auto_scaling.scaledown.policies_pods_period_seconds Specifies how often the horizontal pod autoscaler can apply a scale-down action. <Optional> No
auto_scaling.scaledown.stabilization_window_seconds Specifies the cooldown window before a scale-down action is applied. <Optional> No
node_affinity.custom_node_selector_match_expression Specifies custom node selector match expressions for node affinity. [] No
custom_annotations Specifies custom annotations that are added to generated pods. { } No
custom_labels Specifies custom labels that are added to generated pods. { } No
security_context.supplemental_groups Specifies supplemental group IDs for the container security context. [] No
security_context.selinux_options Specifies SELinux options for the container security context. {} No
security_context.fs_groupchangepolicy Specifies how ownership and permissions are changed for mounted volumes. "" No
custom_configmap Specifies custom ConfigMaps to mount into the Core MCP Server pods. Commented out No
core_mcp_production_setting.log_level Specifies the logging level for the Core MCP Server. info No
core_mcp_production_setting.rate_limit_rpm Specifies the request-per-minute limit for the Core MCP Server. 100 No
core_mcp_production_setting.rate_limit_burst Specifies the burst capacity for rate limiting. 150 No
core_mcp_production_setting.startup.workers Specifies the number of startup workers. 4 No
core_mcp_production_setting.startup.limit_concurrency Specifies the startup concurrency limit. 4 No
core_mcp_production_setting.startup.limit_max_requests Specifies the maximum number of requests before worker recycling. 2000 No
core_mcp_production_setting.startup.timeout_keep_alive Specifies the keep-alive timeout, in seconds. 600 No
probe.readiness
  • initial_delay_seconds
  • period_seconds
  • timeout_seconds
  • failure_threshold
The behavior of readiness probes to know when the containers are ready to start accepting traffic.
  • 10
  • 10
  • 5
  • 3
No
probe.liveness
  • initial_delay_seconds
  • period_seconds
  • timeout_seconds
  • failure_threshold
The behavior of liveness probes to know when to restart a container.
  • 30
  • 30
  • 5
  • 3
No
probe.startup
  • initial_delay_seconds
  • period_seconds
  • timeout_seconds
  • failure_threshold
The behavior of startup probes to know when the container is started.
  • 20
  • 10
  • 5
  • 20
No