Models and AI Providers
GPT, Llama, and Qwen - oh my!
Last updated
GPT, Llama, and Qwen - oh my!
Last updated
Kiln can use essentially any LLM model from a number of providers:
Run locally with Ollama
Connect a cloud provider like OpenAI, Groq, OpenRouter, AWS, Azure, Gemini API, Anthropic, Fireworks and much more. You provide your own API keys and we never have access to your dataset.
Connect to any OpenAI compatible server, like LiteLLM or vLLM
When you first run Kiln, the app will prompt you to setup one or more AI providers. You need at least one for the core features of Kiln to function.
We currently support the following AI providers:
Ollama
OpenRouter
OpenAI
Groq
Fireworks.ai
Together.ai
AWS Bedrock
Anthropic
Gemini AI Studio / Gemini API
Google Vertex AI
Azure OpenAI
HuggingFace
Any OpenAI compatible API
If you want to add or remove providers after initial setup, open Settings > AI Providers & Models
.
Don't see you provider listed? Most providers offer an OpenAI compatible API that Kiln can connect to. This includes common open source projects like LiteLLM and vLLM. Search their docs, and connect via the "Custom API" option in Kiln.
Models come in several flavours, from very easy to use, to advanced methods for expert users:
Provider Specific Guidance
These are models that have been tested to work with Kiln's various features. These are the easiest to use, and generally won't result in errors.
To use these models simply connect any AI provider from the Settings page. Once connected, you can select these model from the model dropdown on the Run screen. The dropdown will warn if you attempt to use a model that doesn't support a feature (like structured output or synthetic data generation).
GPT 4o Mini
Azure OpenAI, OpenAI, OpenRouter
✅︎
✅︎
✅︎
GPT 4o
Azure OpenAI, OpenAI, OpenRouter
✅︎
✅︎
✅︎
GPT o3 Mini - Low
Azure OpenAI, OpenAI
✅︎
✅︎
GPT o3 Mini - Medium
Azure OpenAI, OpenAI
✅︎
✅︎
GPT o3 Mini - High
Azure OpenAI, OpenAI
✅︎
✅︎
GPT o1 - Low
Azure OpenAI, OpenAI
✅︎
✅︎
GPT o1 - Medium
Azure OpenAI, OpenAI
✅︎
✅︎
GPT o1 - High
Azure OpenAI, OpenAI
✅︎
✅︎
Claude 3.5 Haiku
Anthropic, Google Vertex AI, OpenRouter
✅︎
✅︎
Claude 3.5 Sonnet
Anthropic, Google Vertex AI, OpenRouter
✅︎
✅︎
Claude 3.7 Sonnet
Anthropic, OpenRouter
✅︎
✅︎
Claude 3.7 Sonnet Thinking
Anthropic, OpenRouter
✅︎
✅︎
✅︎
Gemini 1.5 Pro
Gemini API, Google Vertex AI, OpenRouter
✅︎
✅︎
Gemini 1.5 Flash
Gemini API, Google Vertex AI, OpenRouter
✅︎
✅︎
Gemini 1.5 Flash 8B
Gemini API, OpenRouter
✅︎
Gemini 2.0 Flash
Gemini API, Google Vertex AI, OpenRouter
✅︎
✅︎
Nemotron 70B
OpenRouter
Llama 3.1 8B
Amazon Bedrock, Fireworks AI, Groq, Ollama, OpenRouter, Together AI
✅︎ (some providers)
✅︎ (some providers)
✅︎
Llama 3.1 70B
Amazon Bedrock, Fireworks AI, Ollama, OpenRouter, Together AI
✅︎
✅︎ (some providers)
✅︎
Llama 3.1 405B
Amazon Bedrock, Fireworks AI, Ollama, OpenRouter, Together AI
✅︎
✅︎ (some providers)
✅︎
Mistral Nemo
OpenRouter
✅︎
✅︎
Mistral Large
Amazon Bedrock, Ollama, OpenRouter
✅︎
✅︎
Llama 3.2 1B
Groq, Hugging Face, Ollama, OpenRouter, Together AI
✅︎ (some providers)
✅︎ (some providers)
✅︎
Llama 3.2 3B
Groq, Hugging Face, Ollama, OpenRouter, Together AI
✅︎ (some providers)
✅︎
Llama 3.2 11B
Fireworks AI, Groq, Hugging Face, Ollama, OpenRouter, Together AI
✅︎ (some providers)
✅︎ (some providers)
Llama 3.2 90B
Fireworks AI, Groq, Ollama, OpenRouter, Together AI
✅︎ (some providers)
✅︎ (some providers)
Llama 3.3 70B
Fireworks AI, Google Vertex AI, Groq, Ollama, OpenRouter, Together AI
✅︎ (some providers)
✅︎ (some providers)
✅︎
Phi 3.5
Fireworks AI, Ollama, OpenRouter
Phi 4 - 14B
Ollama, OpenRouter
✅︎
✅︎ (some providers)
Phi 4 - 5.6B
OpenRouter
Phi 4 Mini - 3.8B
Ollama
✅︎
✅︎
Gemma 2 2B
Ollama
✅︎
Gemma 2 9B
Ollama, OpenRouter
✅︎ (some providers)
Gemma 2 27B
Ollama, OpenRouter
✅︎
Gemma 3 1B
Ollama, OpenRouter
Gemma 3 4B
Ollama, OpenRouter
✅︎
✅︎
Gemma 3 12B
Ollama, OpenRouter
✅︎
✅︎
Gemma 3 27B
Hugging Face, Ollama, OpenRouter
✅︎
✅︎
Mixtral 8x7B
Ollama, OpenRouter
✅︎
✅︎ (some providers)
QwQ 32B (Qwen Reasoning)
Fireworks AI, Groq, Ollama, OpenRouter, Together AI
✅︎
✅︎
✅︎
✅︎
Qwen 2.5 7B
Ollama, OpenRouter
✅︎
✅︎
Qwen 2.5 14B
Ollama, Together AI
✅︎
✅︎ (some providers)
✅︎
Qwen 2.5 72B
Fireworks AI, Ollama, OpenRouter, Together AI
✅︎ (some providers)
✅︎ (some providers)
✅︎
Mistral Small 3
Ollama, OpenRouter
✅︎
✅︎
DeepSeek V3
Fireworks AI, OpenRouter, Together AI
✅︎
✅︎ (some providers)
✅︎
DeepSeek R1
Fireworks AI, Ollama, OpenRouter, Together AI
✅︎
✅︎
✅︎
✅︎
DeepSeek R1 Distill Qwen 32B
Ollama, OpenRouter, Together AI
✅︎
✅︎
✅︎
DeepSeek R1 Distill Llama 70B
Ollama, OpenRouter, Together AI
✅︎
✅︎ (some providers)
✅︎ (some providers)
DeepSeek R1 Distill Qwen 14B
Ollama, OpenRouter, Together AI
✅︎
✅︎ (some providers)
✅︎ (some providers)
DeepSeek R1 Distill Llama 8B
Ollama, OpenRouter
✅︎
DeepSeek R1 Distill Qwen 7B
Ollama
✅︎
DeepSeek R1 Distill Qwen 1.5B
Ollama, OpenRouter, Together AI
✅︎ (some providers)
Dolphin 2.9 8x22B
Ollama, OpenRouter
✅︎
✅︎
Grok 2
OpenRouter
✅︎
✅︎
The following models can be fine tuned using a Fireworks.ai API key:
Any Ollama model you have installed on your server will be available to use in Kiln. To add models, simply install them with the Ollama CLI ollama pull <model_name>
.
Some Ollama models are included/tested, and will automatically appear in the model dropdown. Any untested Ollama models will still appear in the dropdown, but in the "Untested" section.
If you want to use a model that is not in the list but is supported by one of our AI providers, you can use a custom model.
To use a custom model, click "Add Model" in the "AI Providers & Models" section of Settings.
These will appear in the "untested" section of the model dropdown.
When using Azure OpenAI API, you need to deploy each model you want to use, manually through the Azure console. If you have not, you'll get deployment errors when trying to call a model.
Suggested - Deploy with Default Names: If you deploy with the default names, for example "gpt-4o"/"gpt-4o-mini", you can simply use the models using the dropdown in Kiln.
When using Azure AI Foundry, you need to deploy each model you want to use manually through the Azure console. If you have not, you'll get deployment errors when trying to call a model.
When using Vertex, many models need to be manually enabled through the console before using them (primarily Anthropic models). If you see errors when trying to run a model, open the vertex AI console for your project, go to the model garden, and enable that model.
Similarly, if you see quota errors you may need to manage/request quota from the Vertex console. Quota is specific to the model + region. Ensure you request quota in the region you specified when you connected Vertex AI to Kiln.
Hugging face errors are not always descriptive - if you get 400 errors, it's likely the model you've selected requires a Hugging Face Pro subscription. Try the same model in their UI for a more helpful error messages.
If you have an OpenAI compatible server (LiteLLM, vLLM, etc.), you can use it in Kiln.
To do this, add a "Custom API" in the "AI Providers & Models" section of Settings.
All models supported by this API will appear in the "untested" section of the model dropdown.
Notes:
The API must support the /v1/models
endpoint, so Kiln can access the list of models.
Many Kiln tasks produce structured (JSON) output. These can be hard to get working on custom servers, as each server/model pair usually needs some configuration to reliably produce structured output (tools vs json_mode vs json parsing vs json_schema, etc).
- Anthropic, Huggingface, VertexAI, TogetherAI, and more.
You can request we add models .
For "✅︎ (some providers)
", see for details of which providers are supported.
Deployments with Custom Names: If you have a non-standard depolyment name, you'll have to add each model as a , using the deployment name as the model name.
After deploying a model, you must add it to Kiln as a a , using the deployment name as the model name.
Hugging face has thousands of models. We've included a few of these common models in the Kiln built-in model list, but you can add any hugging face model via the option.