Deploying generative AI assets

Deploy generative AI assets to use them in production and monitor these deployed assets.

Types of deployable assets for generative AI applications

You can use watsonx.ai to deploy the following assets for your generative AI applications:

Prompt templates
Tuned models
Custom foundation models

Deploying prompt templates

After you save a prompt template as a project asset, you can promote it to a deployment space. From the deployment space, you can deploy your prompt template to production and get the endpoint for inferencing.

If you have the watsonx.governance service, you can also capture and track the deployment details for a prompt template to meet governance requirements.

For more information, see Deploying a prompt template.

Deploying tuned models

After you tune a foundation model and save the tuned model as a project asset, you can promote it to a deployments space. From the deployment space, you can test the tuned model and get the endpoint for inferencing.

For more information, see Deploying tuned models.

Deploying custom foundation models

In addition to working with foundation models that are curated by IBM, you can upload and deploy your own foundation models. After the models are deployed and registered with watsonx.ai, create prompts that inference the custom models from the Prompt Lab.

Deploying a custom foundation model provides the flexibility for you to implement the AI solutions that are right for your use case.

For more information, see Deploying a custom foundation model.

Deploying generative AI assets

Types of deployable assets for generative AI applications

Deploying prompt templates

Deploying tuned models

Deploying custom foundation models

Learn more