Overview
Open WebUI supports multiple LLM providers through OpenAI-compatible API endpoints. This includes native integrations with Anthropic Claude, Google Gemini, and many others.Supported Providers
Anthropic Claude
Claude 3 Opus, Sonnet, and Haiku models
Google AI
Gemini Pro, Gemini Ultra, and PaLM 2
OpenRouter
Access to 100+ models from multiple providers
Groq
Ultra-fast LLM inference
Mistral AI
Mistral, Mixtral, and specialized models
Together AI
Open source models with fast inference
Anthropic Claude
Configuration
Available Models
claude-3-opus-20240229- Most powerful modelclaude-3-sonnet-20240229- Balanced performance and speedclaude-3-haiku-20240307- Fastest and most affordableclaude-2.1- Previous generation
Open WebUI automatically detects Anthropic URLs and applies appropriate model mappings.File: backend/open_webui/utils/anthropic.py
Features
- System prompts
- Tool/function calling
- Vision capabilities (Claude 3)
- Long context (200K tokens)
Google AI (Gemini)
Configuration
Available Models
gemini-1.5-pro- Latest and most capablegemini-1.5-flash- Optimized for speedgemini-1.0-pro- Previous generation
Features
- Multimodal (text, image, video, audio)
- 2M context window (Gemini 1.5 Pro)
- Code execution
- Function calling
OpenRouter
Access 100+ models from multiple providers through a single API.Configuration
OpenRouter requires HTTP-Referer and X-Title headers. Open WebUI automatically adds these when it detects an OpenRouter URL.File: backend/open_webui/routers/openai.py:134
Popular Models
anthropic/claude-3-opusgoogle/gemini-pro-1.5meta-llama/llama-3-70b-instructmistralai/mixtral-8x7b-instructopenai/gpt-4-turbo
Groq
Ultra-fast inference for open source models.Configuration
Available Models
llama-3.1-70b-versatilellama-3.1-8b-instantmixtral-8x7b-32768gemma-7b-it
Features
- Extremely fast inference (500+ tokens/sec)
- Free tier available
- OpenAI-compatible API
Mistral AI
Configuration
Available Models
mistral-large-latest- Most capablemistral-medium-latest- Balancedmistral-small-latest- Fast and efficientmixtral-8x7b- Open source MoEcodestral-latest- Code generation specialist
Together AI
Open source models with fast inference.Configuration
Features
- 50+ open source models
- Fine-tuning support
- Fast inference
- Competitive pricing
Perplexity AI
Configuration
Models
llama-3.1-sonar-large-128k-online- With web searchllama-3.1-sonar-small-128k-online- Faster with web searchllama-3.1-70b-instruct- Base model
Cohere
Configuration
Models
command-r-plus- Most capablecommand-r- Balancedcommand-light- Fast and efficient
Hugging Face Inference
Configuration
Usage
Access any model on Hugging Face:Multiple Provider Setup
Configure multiple providers simultaneously:Provider-Specific Features
Vision Models
Providers with vision support:- OpenAI:
gpt-4-vision-preview,gpt-4-turbo - Anthropic:
claude-3-opus,claude-3-sonnet,claude-3-haiku - Google:
gemini-1.5-pro,gemini-1.5-flash
Function Calling
Providers with function/tool calling:- OpenAI: All GPT models
- Anthropic: Claude 3 models
- Google: Gemini models
- Mistral: Most models
Streaming
All providers support streaming responses through Server-Sent Events (SSE).Cost Optimization
Model Selection
Use smaller models (haiku, flash, small) for simple tasks
Provider Comparison
Compare costs across providers for equivalent capabilities
Caching
Enable model caching to reduce duplicate API calls
Free Tiers
Leverage free tiers from Groq, Hugging Face, etc.
Troubleshooting
Provider Not Responding
Provider Not Responding
- Verify API key is valid
- Check base URL is correct
- Ensure provider service is operational
- Check firewall/network restrictions
Models Not Appearing
Models Not Appearing
Some providers require specific model IDs in configuration:
Rate Limiting
Rate Limiting
Each provider has different rate limits:
- Use multiple API keys
- Implement exponential backoff
- Monitor usage through provider dashboard
Best Practices
- API Key Security: Store keys in environment variables
- Model Prefixing: Use prefixes to distinguish provider models
- Cost Monitoring: Track usage across providers
- Fallback Providers: Configure multiple providers for redundancy
- Model Tagging: Use tags to categorize models by capability
References
- Anthropic API: docs.anthropic.com
- Google AI: ai.google.dev
- OpenRouter: openrouter.ai/docs
- Groq: console.groq.com/docs
- Mistral: docs.mistral.ai