Supported foundation models
You can work with third-party and IBM foundation models in IBM watsonx.ai.You can use foundation models that are provided by IBM and are ready to use immediately, or deploy foundation models on-demand to use exclusively for your organization.
How to choose a model
To review factors that can help you to choose a model, such as supported tasks and languages, see Choosing a model and Foundation model benchmarks.
Provided foundation models that are ready to use
A collection of open source and IBM foundation models are deployed in IBM watsonx.ai. You can prompt these foundation models in the Prompt Lab or programmatically.
For details on metering for foundation model inference in watsonx.ai, see Billing rates for inferencing foundation models. For more information about the IBM watsonx.ai service description with various cloud providers, see:
You can work with the following types of provided foundation models:
IBM foundation models
The following table lists the supported IBM foundation models that IBM provides for inferencing.
You can also access some IBM foundation models from third-party repositories, such as Hugging Face. IBM foundation models that you obtain from a third-party repository are not indemnified by IBM. Only IBM foundation models that you access from watsonx.ai are indemnified by IBM. For more information about contractual protections related to IBM indemnification, see the IBM Client Relationship Agreement.
| Model name | API model ID | Input price (USD/1,000 tokens) |
Output price (USD/1,000 tokens) |
Context window (input + output tokens) |
More information |
|---|---|---|---|---|---|
| granite-4-h-small | ibm/granite-4-h-small |
$0.00006 | $0.00025 | 131,072 | • Model card • Website |
| granite-3-3-8b-instruct | ibm/granite-3-3-8b-instruct |
$0.0002 | $0.0002 | 131,072 | • Model card • Website |
| granite-3-8b-instruct | ibm/granite-3-8b-instruct |
$0.0002 | $0.0002 | 131,072 | • Model card • Website • Research paper |
| granite-3-2-8b-instruct | ibm/granite-3-2-8b-instruct |
$0.0002 | $0.0002 | 131,072 | • Model card • Website • Research paper |
| granite-8b-code-instruct | ibm/granite-8b-code-instruct |
$0.0006 | $0.0006 | 128,000 | • Model card • Website • Research paper |
| Model name | API model ID | Input price (USD/1,000 data points) |
Output price (USD/1,000 data points) |
Context length Min data points |
More information |
|---|---|---|---|---|---|
| granite-ttm-512-96-r2 | ibm/granite-ttm-512-96-r2 |
$0.00013 | $0.00038 | 512 | • Model card • Website • Research paper |
| granite-ttm-1024-96-r2 | ibm/granite-ttm-1024-96-r2 |
$0.00013 | $0.00038 | 1,024 | • Model card • Website • Research paper |
| granite-ttm-1536-96-r2 | ibm/granite-ttm-1536-96-r2 |
$0.00013 | $0.00038 | 1,536 | • Model card • Website • Research paper |
Third-party foundation models
The following table lists the supported third-party foundation models for inferencing.
| Model name | API model ID | Provider | Input price (USD/1,000 tokens) |
Output price (USD/1,000 tokens) |
Context window (input + output tokens) |
More information |
|---|---|---|---|---|---|---|
| allam-1-13b-instruct | sdaia/allam-1-13b-instruct |
National Center for Artificial Intelligence and Saudi Authority for Data and Artificial Intelligence | $0.0018 | $0.0018 | 4,096 | • Model card |
| gpt-oss-120b | openai/gpt-oss-120b |
OpenAI | $0.00015 | $0.0006 | 131,072 | • Model card • OpenAI blog |
| llama-4-maverick-17b-128e-instruct-fp8 | meta-llama/llama-4-maverick-17b-128e-instruct-fp8 |
Meta | $0.00035 | $0.0014 | 131,072 | • Model card • Meta AI blog |
| llama-3-3-70b-instruct | meta-llama/llama-3-3-70b-instruct |
Meta | $0.00071 | $0.00071 | 131,072 | • Model card • Meta AI blog |
| llama-3-2-11b-vision-instruct | meta-llama/llama-3-2-11b-vision-instruct |
Meta | $0.00035 | $0.00035 | 131,072 | • Model card • Meta AI blog • Research paper |
| llama-3-2-90b-vision-instruct | meta-llama/llama-3-2-90b-vision-instruct |
Meta | $0.0020 | $0.0020 | 131,072 | • Model card • Meta AI blog • Research paper |
| llama-guard-3-11b-vision | meta-llama/llama-guard-3-11b-vision |
Meta | $0.00035 | $0.00035 | 131,072 | • Model card • Meta AI blog • Research paper |
| llama-3-405b-instruct | meta-llama/llama-3-405b-instruct |
Meta | $0.0050 | $0.016 | 16,384 | • Model card • Meta AI blog |
| mistral-large | mistralai/mistral-large |
Mistral AI | $0.003 | $0.01 | 131,072 | • Model card • Blog post for Mistral Large 2 |
| mistral-medium-2505 | mistralai/mistral-medium-2505 |
Mistral AI | $0.003 | $0.010 | 131,072 | • Model card • Blog post for Mistral Medium 3 |
| mistral-small-3-1-24b-instruct-2503 | mistralai/mistral-small-3-1-24b-instruct-2503 |
Mistral AI | $0.0001 | $0.0003 | 131,072 | • Model card • Blog post for Mistral 3.1 |
| mt0-xxl-13b | bigscience/mt0-xxl |
BigScience | $0.0018 | $0.0018 | 4,096 | • Model card • Research paper |
Learn more
- IBM foundation models
- Third-party foundation models
- For more information about the foundation models that IBM provides for embedding and reranking text, see Supported encoder models.
- For a list of which models are provided in each regional data center, see Regional availability of foundation models.
- For details about foundation model pricing, see Billing details for generative AI assets.
- For information about pricing and rate limiting, see watsonx.ai Runtime plans.