Getting foundation model information

Get a list of the foundation models deployed in watsonx.ai and filter the list in useful ways.

Ways to develop

You must generate credentials to authenticate with watsonx.ai APIs. For details, see Generating a bearer token.

You can get information about the available foundation models by using these programming methods:

Alternatively, you can see the list of foundation models and filter them from the Resource hub in the watsonx.ai UI. For details, see the following resources:

REST API

You can use the List the available foundation models method of the watsonx.ai API to get information about the available foundation models.

The model information that is returned includes the model ID, which you need to reference the model from your code.

List the available foundation models

The List the available foundation models method in the watsonx.ai API gets information about the foundation models that are deployed in your cluster and for inferencing.

From a watsonx.ai lightweight engine installation, the request also returns information about any custom foundation models that are available.

curl -X GET \
  'https://cpd-<namespace-name>.apps.<OCP-domain>/ml/v1/foundation_model_specs?version=2024-05-01'

After you get the model ID, you can reference the model ID in your API request as follows:

curl --request POST 'https://cpd-<namespace-name>.apps.<OCP-domain>/ml/v1/text/generation?version=2023-05-02'
-H 'Authorization: Bearer ${TOKEN}'
-H 'Content-Type: application/json'
-H 'Accept: application/json'
--data-raw '{
  "model_id": "ibm/granite-8b-code-instruct",
  "input": "Tell me a story",
  "project_id": "63dc4cf1-252f-424b-b52d-5cdd9814987f"
}'

Model IDs for inferencing foundation models

The following list shows the values to use in the {model_id} parameter when you inference a foundation model from the API.

  • allam-1-13b-instruct

    sdaia/allam-1-13b-instruct
    
  • devstral-medium-2507

    mistralai/devstral-medium-2507
    
  • codestral-2501

    mistralai/codestral-2501
    
  • codestral-2508

    mistralai/codestral-2508
    
  • gpt-oss-20b

    openai/gpt-oss-20b
    
  • gpt-oss-120b

    openai/gpt-oss-120b
    
  • granite-4-h-tiny

    ibm/granite-4-h-tiny
    
  • granite-4-h-small

    ibm/granite-4-h-small
    
  • granite-4-h-micro

    ibm/granite-4-h-micro
    
  • granite-4-1b-speech

    ibm/granite-4-1b-speech
    
  • granite-3-2-8b-instruct

    ibm/granite-3-2-8b-instruct
    
  • granite-3-2b-instruct

    ibm/granite-3-2b-instruct
    
  • granite-3-8b-instruct

    ibm/granite-3-8b-instruct
    
  • granite-docling-258M

    ibm/granite-docling-258M
    
  • granite-guardian-3-2-5b

    ibm/granite-guardian-3-2-5b
    
  • granite-guardian-3-2b

    ibm/granite-guardian-3-2b
    
  • granite-guardian-3-8b

    ibm/granite-guardian-3-8b
    
  • granite-3b-code-instruct

    ibm/granite-3b-code-instruct
    
  • granite-8b-code-instruct

    ibm/granite-8b-code-instruct
    
  • granite-20b-code-instruct

    ibm/granite-20b-code-instruct
    
  • granite-34b-code-instruct

    ibm/granite-34b-code-instruct
    
  • granite-vision-3-3-2b

    ibm/granite-vision-3-3-2b
    
  • granite-vision-3-2-2b

    ibm/granite-vision-3-2-2b
    
  • ibm-defense-3-3-8b-instruct

    ibm/ibm-defense-3-3-8b-instruct
    
  • ibm-defense-4-0-micro

    ibm/ibm-defense-4-0-micro
    
  • jais-13b-chat

    core42/jais-13b-chat
    
  • llama-4-maverick-17b-128e-instruct-fp8

    meta-llama/llama-4-maverick-17b-128e-instruct-fp8
    
  • llama-4-maverick-17b-128e-instruct-int4

    redhatai/llama-4-maverick-17b-128e-instruct-int4
    
  • llama-4-scout-17b-16e-instruct-int4

    redhatai/llama-4-scout-17b-16e-instruct-int4
    
  • llama-3-3-70b-instruct

    meta-llama/llama-3-3-70b-instruct
    
  • llama-3-2-1b-instruct

    meta-llama/llama-3-2-1b-instruct
    
  • llama-3-2-3b-instruct

    meta-llama/llama-3-2-3b-instruct
    
  • llama-3-2-11b-vision-instruct

    meta-llama/llama-3-2-11b-vision-instruct
    
  • llama-3-2-90b-vision-instruct

    meta-llama/llama-3-2-90b-vision-instruct
    
  • llama-guard-3-11b-instruct

    meta-llama/llama-guard-3-11b-vision
    
  • llama-3-1-8b-instruct

    meta-llama/llama-3-1-8b-instruct
    
  • llama-3-1-70b-instruct

    meta-llama/llama-3-1-70b-instruct
    
  • ministral-3b-instruct-2512

    mistralai/ministral-3b-instruct-2512
    
  • ministral-8b-instruct

    mistralai/ministral-8b-instruct
    
  • mistral-medium-2505

    mistralai/mistral-medium-2505
    
  • mistral-medium-2508

    mistralai/mistral-medium-2508
    
  • mistral-large-instruct-2411

    mistralai/mistral-large-instruct-2411
    
  • mistral-small-3-1-24b-instruct-2503

    mistralai/mistral-small-3-1-24b-instruct-2503
    
  • nvidia-nemotron-nano-12b-v2-vl-fp8

    nvidia/nvidia-nemotron-nano-12b-v2-vl-fp8
    
  • nvidia-nemotron-3-nano-30b-a3b-fp8

    nvidia/nvidia-nemotron-3-nano-30b-a3b-fp8
    
  • pixtral-12b

    mistralai/pixtral-12b
    
  • pixtral-large-instruct-2411

    mistralai/pixtral-large-instruct
    
  • voxtral-small-24b-2507

    mistralai/voxtral-small-24b-2507
    
  • voxtral-mini-2507

    mistralai/voxtral-mini-2507