If your machine learning workload is large, mission critical, or both, you can configure your WMLz base core services for high availability. To achieve high availability of core services, consider configuring a WMLz base cluster. Each cluster can consist of two or more WMLz base instances that run either on a single LPAR or across different LPARs.
Procedure
- Complete the installation and configuration of the first WMLz base instance by completing all required tasks as described in Installation roadmap.
- Retrieve the following configuration information of your first WMLz base instance from the System Configuration page of the administration dashboard:
- Keystore. All WMLz base instances in a cluster must use the same keystore for secure connections and user authentication. If a RACF® keying-based keystore is used in the first WMLz base instance, make note of the keyring name, keyring owner, and certificate label. If a file-based keystore is used, retrieve the location of your SSL certificates and the password for the keystore.jks file.
- Metadata schema. All WMLz base instances in a cluster must use the same metadata schema. Write down the schema name as well as the metadata database, storage group, and buffer pool information.
- Core services port. All WMLz base instances in a cluster must use the same cluster host IP and WMLz core services port number. If the cluster is type 1 where all instances run on the same LPAR, write down the LPAR IP address and the SHAREPORT number. If the cluster is type 2 where the instances run across different LPARs, write down the sysplex IP address and the SD port number.
- Install and configure the second WMLz base instance into the cluster.
Follow instructions in Configuring WML for z/OS base to configure the new instance. When prompted, make sure that you specify the keystore, metadata schema, and core services port information that you collected in Step 2.
- On the Authentication page, specify the keystore type and related information used in the first WMLz base instance. If the keystore is RACF keying-based, specify the keystore name, keyring owner, and certificate label. If the keystore is file-based, specify the same set of certificate and key files. This ensures that all WMLz base instances in the cluster uses the same keystore to secure connections and authenticate users.
- On the Metadata repository page, specify the same metadata schema name, database, storage group, and buffer pool used in the first WMLz base instance. This ensures that all instances in the cluster use the same metadata objects.
- On the UI and core services page, specify the cluster host IP address and WMLz core services port number. If the cluster is type 1 where all instances run on the same LPAR, specify the LPAR IP as the cluster host IP and the SHAREPORT number as the WMLz core services port. If the cluster is type 2 where the instances run across different LPARs, specify the sysplex IP address as the cluster host IP and the SD port number as the core services port.
- Repeat Step 3 to install and configure any additional instance into the cluster.
- Complete the cluster setup and start the cluster.
- Repeat Steps 3 - 4 to complete the cluster setup.
- Verify that all WMLz base instances in the cluster are started and running.
- Verify that the runtime environment of one WMLz base instance is active.
- Configure the REST API of your machine learning application to call the host IP and core services port of your WMLz base cluster.
For cluster type 1, the cluster host IP is the LPAR IP address and the core services port is the SHAREPORT number. For cluster type 2, the cluster host IP is the sysplex distributor IP address and the core services port is the sysplex distributor (SD) port number.
- In the event that the active WMLz base instance in your cluster is down, activate the runtime environment of another instance by using the administration dashboard.
- Sign into the administration dashboard.
- From the sidebar, navigate to the
System Management - Runtime Environments
page.
- Select the runtime environment you want to activate and from the
ACTIONS
menu, click the Connection icon to connect and activate.
- Verify that the new runtime environment is active and the cluster is up and running.