Setting up storage and uploading the model

To deploy a custom foundation model for inferencing with watsonx.ai, you must prepare a properly-sized Persistent Volume Claim on your cluster, upload the model, and then make the required conversions. Detailed steps depend on where your model is located.

Prerequisites

Public model repositories might require you to set up an account. For example, if you decide to get your model from Hugging Face, you must have a Hugging Face account. To create a new account, go to the Hugging Face website. After creating a new account, generate a new Hugging Face token. To generate a token, see Hugging Face's guide to creating a token.

Set up storage and upload a model. Find a scenario that matches your model: