Supported foundation models in watsonx.ai

You can work with third-party and IBM foundation models in IBM watsonx.ai.

How to choose a model

To review factors that can help you to choose a model, such as supported tasks and languages, see Choosing a model and Foundation model benchmarks.

You can choose to deploy foundation models that are provided with watsonx.ai, tuned models, or custom foundation models that suits a specialized use case. To learn more about the various ways you can use to deploy models in watsonx.ai, see Foundation model deployment methods.

For more information about the foundation models provided with watsonx.ai for embedding and reranking text, see Supported encoder models.

Accessing models from other providers through the model gateway

You can securely access and interact with foundation models from multiple model providers through the model gateway. The model gateway provides an OpenAI-compatible API that routes requests to these foundation models. Use the model gateway to efficiently switch between multiple model providers by routing and formatting requests through a unified interface. You can build and deploy AI agents, RAG patterns, and more by using these models.

For more information, see Model gateway.

Provided foundation models that are ready to use

You can deploy foundation models from a collection of models curated by IBM in watsonx.ai. You can prompt these foundation models in the Prompt Lab or programmatically.

You can work with the following types of provided foundation models:

IBM foundation models
Third-party foundation models

All IBM foundation models in watsonx.ai are indemnified by IBM.

For information about the GPU requirements for the supported foundation models, see Foundation models in IBM watsonx.ai in the IBM Software Hub documentation.

IBM foundation models

You can inference the following supported foundation models that are provided by IBM. The foundation models must be deployed in your cluster by an administrator to be available for use. All IBM models are instruction-tuned.

Note:

A view of all foundation models is available in the Resource hub.

Latest IBM foundation models

The following table lists the latest IBM foundation models in watsonx.ai for inferencing.

Table 1. Latest IBM foundation models in watsonx.ai for inferencing
Model name	Context window (input + output tokens)	Supported tasks	More information
ibm-defense-4-0-small	131,072	• classification • extraction • generation • question answering • summarization • retrieval-augmented generation • function calling	• IBM Defense Model
ibm-defense-4-0-micro	131,072	• classification • extraction • generation • question answering • summarization • retrieval-augmented generation • function calling	• IBM Defense Model
granite-docling-258M	8,192	• extraction • generation • retrieval-augmented generation	• Granite Docling documentation
granite-4-h-tiny	131,072	• classification • extraction • generation • question answering • summarization • retrieval-augmented generation • code • extraction • translation • function calling	• Granite 4.0 documentation
granite-4-h-micro	131,072	• classification • extraction • generation • question answering • summarization • retrieval-augmented generation • code • extraction • translation • function calling • code	• Granite 4.0 documentation
granite-4-h-small	131,072	• classification • extraction • generation • question answering • summarization • retrieval-augmented generation • code • extraction • translation • function calling • code	• Granite 4.0 documentation
ibm-defense-3-3-8b-instruct	128,000	• classification • extraction • generation • question answering • summarization • retrieval-augmented generation • code • extraction • translation • function calling	• IBM Defense Model
granite-vision-3-3-2b	131,072	• question answering • generation	• Granite Vision documentation
granite-3-3-8b-instruct	131,072	• classification • extraction • generation • question answering • summarization • retrieval-augmented generation • code • extraction • translation • function calling	• Granite models blog
granite-guardian-3-2-5b	128,000	• classification • extraction • generation • question answering • summarization	• Granite Guardian documentation

Table 2. IBM foundation models provided with watsonx.ai for forecasting future values
Model name	Context length Minimum data points	More information
granite-ttm-512-96-r2	512	• Granite Time Series documentation • Research paper
granite-ttm-1024-96-r2	1,024	• Granite Time Series documentation • Research paper
granite-ttm-1536-96-r2	1,536	• Granite Time Series documentation • Research paper

Legacy IBM foundation models

The following table lists the legacy IBM foundation models in watsonx.ai for inferencing. When a newer version of a foundation model is released, the existing foundation models are moved to the legacy state. You can still access these legacy models with the same level of support as the latest versions, but switching to the newer models is recommended.

Table 3. Legacy IBM foundation models in watsonx.ai for inferencing
Model name	Context window (input + output tokens)	Supported tasks	More information
granite-3-2-8b-instruct	131,072	• classification • extraction • generation • question answering • summarization • retrieval-augmented generation • code • extraction • translation • function calling	• Website • Research paper
granite-3-2b-instruct	4,096	• classification • extraction • function calling • generation • question answering • summarization	• Website • Research paper
granite-3-8b-instruct	4,096	• classification • extraction • function calling • generation • question answering • summarization	• Website • Research paper
granite-guardian-3-2b	8,192	• classification • extraction • generation • question answering • summarization	• Website
granite-guardian-3-8b	8,192	• classification • extraction • generation • question answering • summarization	• Website
granite-20b-code-base-schema-linking	8,192	• code	• Research paper
granite-20b-code-base-sql-gen	8,192	• code	• Research paper
granite-8b-code-instruct	128,000	• code • classification • extraction • generation • question answering • summarization	• Website • Research paper
granite-vision-3-2-2b	131,072	• question answering	• Website • Research paper
granite-13b-instruct-v2	8,192	• classification • extraction • generation • question answering • summarization	• Website • Research paper

Third-party foundation models

You can inference the following supported third-party foundation models. An administrator must deploy the foundation models in your cluster before you can use these models.