Creating Kubernetes resources for the MDM machine learning service
The file mdm-ml_service.yml creates the Kubernetes resources ConfigMap, Deployment, Service, and Ingress for the MDM machine learning deployment. The file mdm-ml_hpa.yml creates an Auto Scaler for the machine learning service.
Before you begin
See the topic YAML files and scripts for deploying InfoSphere MDM on OpenShift or Kubernetes to download an archive containing mdm-ml_service.yml and mdm-ml_hpa.yml. Save both files to your Docker working directory (/mdm).
Take note of the following implementation details:
- Auto Scaling works based on the percentage of CPU utilization.
- The minimum number of replicas is two and the maximum is four. Tip: You can edit the replica limits as needed for your deployment.
About this task
Before creating these Kubernetes resources, you must update the YAML files to provide a number of details corresponding to your deployment needs.
Important: When using Machine Learning containers in a production environment,
especially if you are using an HPA configuration in Kubernetes or OpenShift, be sure to follow these steps:
- Start a single ML service instance (one Kubernetes pod configuration) first and wait until it is ready. As part of starting up for the first time, the container creates a number of required database tables in Cassandra.
- After the first instance is ready, enable the HPA configuration to start further service instances for high availability and performance load balancing.