Supported foundation models

You can work with third-party and IBM foundation models in IBM watsonx.ai.You can use foundation models that are provided by IBM and are ready to use immediately, or deploy foundation models on-demand to use exclusively for your organization.

How to choose a model

To review factors that can help you to choose a model, such as supported tasks and languages, see Choosing a model and Foundation model benchmarks.

Provided foundation models that are ready to use

A collection of open source and IBM foundation models are deployed in IBM watsonx.ai. You can prompt these foundation models in the Prompt Lab or programmatically.

For details on metering for foundation model inference in watsonx.ai, see Billing rates for inferencing foundation models. For more information about the IBM watsonx.ai service description with various cloud providers, see:

You can work with the following types of provided foundation models:

IBM foundation models
Third-party foundation models

IBM foundation models

The following table lists the supported IBM foundation models that IBM provides for inferencing.

You can also access some IBM foundation models from third-party repositories, such as Hugging Face. IBM foundation models that you obtain from a third-party repository are not indemnified by IBM. Only IBM foundation models that you access from watsonx.ai are indemnified by IBM. For more information about contractual protections related to IBM indemnification, see the IBM Client Relationship Agreement.

Attention:

If your watsonx region is the Dallas data center on IBM Cloud, you can follow the model card links. Otherwise, search for the model name in the Resource hub. The model might not be available in all regions or cloud platforms.

Table 2a. IBM foundation models provided with watsonx.ai for inferencing
Model name	API model ID	Input price (USD/1,000 tokens)	Output price (USD/1,000 tokens)	Context window (input + output tokens)	More information
granite-4-h-small	`ibm/granite-4-h-small`	$0.00006	$0.00025	131,072	• Model card • Website
granite-3-3-8b-instruct	`ibm/granite-3-3-8b-instruct`	$0.0002	$0.0002	131,072	• Model card • Website
granite-3-8b-instruct	`ibm/granite-3-8b-instruct`	$0.0002	$0.0002	131,072	• Model card • Website • Research paper
granite-3-2-8b-instruct	`ibm/granite-3-2-8b-instruct`	$0.0002	$0.0002	131,072	• Model card • Website • Research paper
granite-8b-code-instruct	`ibm/granite-8b-code-instruct`	$0.0006	$0.0006	128,000	• Model card • Website • Research paper

Table 2b. IBM foundation models provided with watsonx.ai for forecasting future values
Model name	API model ID	Input price (USD/1,000 data points)	Output price (USD/1,000 data points)	Context length Min data points	More information
granite-ttm-512-96-r2	`ibm/granite-ttm-512-96-r2`	$0.00013	$0.00038	512	• Model card • Website • Research paper
granite-ttm-1024-96-r2	`ibm/granite-ttm-1024-96-r2`	$0.00013	$0.00038	1,024	• Model card • Website • Research paper
granite-ttm-1536-96-r2	`ibm/granite-ttm-1536-96-r2`	$0.00013	$0.00038	1,536	• Model card • Website • Research paper

Third-party foundation models

The following table lists the supported third-party foundation models for inferencing.

Attention:

Table 3. Third-party foundation models supported in watsonx.ai
Model name	API model ID	Provider	Input price (USD/1,000 tokens)	Output price (USD/1,000 tokens)	Context window (input + output tokens)	More information
allam-1-13b-instruct	`sdaia/allam-1-13b-instruct`	National Center for Artificial Intelligence and Saudi Authority for Data and Artificial Intelligence	$0.0018	$0.0018	4,096	• Model card
gpt-oss-120b	`openai/gpt-oss-120b`	OpenAI	$0.00015	$0.0006	131,072	• Model card • OpenAI blog
llama-4-maverick-17b-128e-instruct-fp8	`meta-llama/llama-4-maverick-17b-128e-instruct-fp8`	Meta	$0.00035	$0.0014	131,072	• Model card • Meta AI blog
llama-3-3-70b-instruct	`meta-llama/llama-3-3-70b-instruct`	Meta	$0.00071	$0.00071	131,072	• Model card • Meta AI blog
llama-3-2-11b-vision-instruct	`meta-llama/llama-3-2-11b-vision-instruct`	Meta	$0.00035	$0.00035	131,072	• Model card • Meta AI blog • Research paper
llama-3-2-90b-vision-instruct	`meta-llama/llama-3-2-90b-vision-instruct`	Meta	$0.0020	$0.0020	131,072	• Model card • Meta AI blog • Research paper
llama-guard-3-11b-vision	`meta-llama/llama-guard-3-11b-vision`	Meta	$0.00035	$0.00035	131,072	• Model card • Meta AI blog • Research paper
llama-3-405b-instruct	`meta-llama/llama-3-405b-instruct`	Meta	$0.0050	$0.016	16,384	• Model card • Meta AI blog
mistral-large	`mistralai/mistral-large`	Mistral AI	$0.003	$0.01	131,072	• Model card • Blog post for Mistral Large 2
mistral-medium-2505	`mistralai/mistral-medium-2505`	Mistral AI	$0.003	$0.010	131,072	• Model card • Blog post for Mistral Medium 3
mistral-small-3-1-24b-instruct-2503	`mistralai/mistral-small-3-1-24b-instruct-2503`	Mistral AI	$0.0001	$0.0003	131,072	• Model card • Blog post for Mistral 3.1
mt0-xxl-13b	`bigscience/mt0-xxl`	BigScience	$0.0018	$0.0018	4,096	• Model card • Research paper

Learn more

IBM foundation models
Third-party foundation models
For more information about the foundation models that IBM provides for embedding and reranking text, see Supported encoder models.
For a list of which models are provided in each regional data center, see Regional availability of foundation models.
For details about foundation model pricing, see Billing details for generative AI assets.
For information about pricing and rate limiting, see watsonx.ai Runtime plans.