Deploying a trained model

Deploy a trained model to get it ready to use within PowerAI Vision or a different program, such as IBM® PowerAI. Deploying a model creates a unique API endpoint based on that model for inference operations.

Note: PowerAI Vision assigns one GPU to each active training job or deployed deep learning API. For example, if a system has four GPUs and you have two deployed web APIs, there are two GPUs available for active training jobs.
To deploy the trained model, follow these steps:
  1. Click Models from the menu.
  2. Select the model you want to deploy and click Deploy.
  3. Specify a name for the model, and for models that were trained with the Optimized for speed (tiny YOLO v2) model, choose the accelerator to deploy to. You can choose GPU, CPU, or Xilinx FPGA - 16 bit (technology preview).
    Note: Deploying a model to a Xilinx FPGA requires the Xilinx Alveo U200 Accelerator card.
  4. Click Deploy. The Deployed Models page is displayed. When the model has been deployed, the status column displays Ready.
  5. Click the deployed model to get the API endpoint, to view details about the model, such as the owner and the accuracy, and to test other videos or images against the model. For information about using the API see Vision Service API documentation.
    Note: When using the API, the smaller confidence threshold you specify, the more results are returned. If you specify 0, many, many results will be returned because there is no filter based on the confidence level of the model.
  6. If necessary, you can delete a deployed model. To delete a deployed model, click Deployed Models. Next, select the model that you want to delete and click Delete. The trained model is not deleted from PowerAI Vision.