Skip to main content

Documentation Index

Fetch the complete documentation index at: https://wwwpoc.ibm.com/llms.txt

Use this file to discover all available pages before exploring further.

This guide demonstrates using inference calls against a model hosted on Replicate. This guide will demonstrate a basic inference call using the replicate package as well as via LangChain. In both cases, you will provide a Replicate API Token. To see how you can use Ollama to host models locally instead, see how to build a VS Code Assistant with Granite.

Granite Code on Replicate

Jupyter notebook showing how to use Granite Code on Replicate