Granite Embedding Models

Granite is IBM's suite of generative AI models that provides enterprise control and customization through accessible model weights and architectures.

Granite Embedding Models

The Granite Embedding collection delivers innovative sentence-transformer models purpose-built for retrieval-based applications. Featuring a bi-encoder architecture, these models generate high-quality embeddings for textual inputs such as queries, passages, and documents, enabling seamless comparison through cosine similarity. Built using retrieval oriented pretraining, contrastive finetuning, knowledge distillation, and model merging, the Granite Embedding lineup is optimized to ensure strong alignment between query and passage embeddings.

Built on a foundation of carefully curated, permissibly licensed public datasets, the Granite Embedding models set a high standard for performance, maintaining competitive scores not only on academic benchmarks such as BEIR, but also out-performing models of the same size on many enterprise use cases. Developed to meet enterprise-grade expectations, they are crafted transparently in accordance with IBM's AI Ethics principles and offered under the Apache 2.0 license for both research and commercial innovation.

Model
granite-embedding-30m-english
granite-embedding-125m-english
granite-embedding-107m-multilingual
granite-embedding-278m-multilingual

Embedding Size

384

768

384

768

Vocabulary Size

50265

50265

250002

250002

Max Sequence Length

512

512

512

512

No. of Parameters

30M

125M

107M

278M