Overview
Open WebUI supports 9 vector database options for storing and retrieving document embeddings in RAG workflows. Choose based on your deployment requirements, scale, and features needed.Supported Vector Databases
ChromaDB
Default, embedded, zero-config
Qdrant
High-performance, production-ready
Milvus
Scalable, cloud-native
PGVector
PostgreSQL extension
Elasticsearch
Full-text + vector search
OpenSearch
Open source Elasticsearch alternative
Pinecone
Managed cloud service
S3Vector
S3-backed vector storage
Oracle 23ai
Oracle database with vector support
ChromaDB (Default)
Embedded vector database with zero configuration.Configuration
ChromaDB works out of the box with no configuration:Features
- Zero configuration
- Embedded mode (no separate server)
- Automatic persistence
- Metadata filtering
- Hybrid search
Use Cases
Single-server deployments
Development and testing
Small to medium datasets
Qdrant
High-performance vector database optimized for production.Installation
Configuration
Features
- High performance
- Distributed deployment
- HNSW indexing
- Filtering and payload support
- Cloud-managed option
Milvus
Scalable, cloud-native vector database.Installation
Configuration
Features
- Horizontal scalability
- Multiple index types (HNSW, IVF, etc.)
- GPU acceleration support
- Time travel queries
- Partition support
PGVector
PostgreSQL extension for vector similarity search.Installation
Configuration
Features
- SQL-based queries
- ACID compliance
- Existing PostgreSQL infrastructure
- Familiar tooling
- Cost-effective
Elasticsearch
Full-text search with vector capabilities.Installation
Configuration
Features
- Hybrid search (full-text + vector)
- Mature ecosystem
- Advanced analytics
- Scalable architecture
OpenSearch
Open source alternative to Elasticsearch.Installation
Configuration
Features
- k-NN plugin for vector search
- Apache 2.0 license
- AWS managed service available
- Compatible with Elasticsearch APIs
Pinecone
Managed cloud vector database.Installation
Configuration
Features
- Fully managed
- No infrastructure management
- Auto-scaling
- Low latency
- Free tier available
Oracle 23ai
Oracle Database with AI Vector Search.Installation
Configuration
Features
- Integrated with Oracle Database
- ACID transactions
- Enterprise features
- SQL-based vector search
Configuration Comparison
| Database | Setup Complexity | Performance | Scalability | Cost |
|---|---|---|---|---|
| ChromaDB | ⭐ Easy | ⭐⭐⭐ Good | ⭐⭐ Medium | Free |
| Qdrant | ⭐⭐ Medium | ⭐⭐⭐⭐⭐ Excellent | ⭐⭐⭐⭐ High | Free/Paid |
| Milvus | ⭐⭐⭐ Complex | ⭐⭐⭐⭐⭐ Excellent | ⭐⭐⭐⭐⭐ Highest | Free/Paid |
| PGVector | ⭐⭐ Medium | ⭐⭐⭐ Good | ⭐⭐⭐ High | Free |
| Elasticsearch | ⭐⭐⭐ Complex | ⭐⭐⭐⭐ Very Good | ⭐⭐⭐⭐ High | Free/Paid |
| OpenSearch | ⭐⭐⭐ Complex | ⭐⭐⭐⭐ Very Good | ⭐⭐⭐⭐ High | Free |
| Pinecone | ⭐ Easy | ⭐⭐⭐⭐ Very Good | ⭐⭐⭐⭐⭐ Highest | Paid |
| Oracle 23ai | ⭐⭐⭐ Complex | ⭐⭐⭐⭐ Very Good | ⭐⭐⭐⭐ High | Paid |
Switching Vector Databases
RAG Configuration
Embedding Models
Open WebUI uses embedding models to convert text to vectors:all-MiniLM-L6-v2(default, 384 dims)all-mpnet-base-v2(768 dims)text-embedding-3-small(OpenAI, 1536 dims)
Chunk Configuration
Control how documents are split:Retrieval Settings
Performance Tuning
Index Type
Choose appropriate index (HNSW, IVF, FLAT) based on dataset size
Dimension Reduction
Use smaller embedding models for better performance
Batch Processing
Index documents in batches for efficiency
Caching
Enable result caching for repeated queries
Troubleshooting
Connection Failed
Connection Failed
- Verify database is running:
docker psor check service status - Check connection string/URL format
- Verify credentials (API key, username, password)
- Check network connectivity and firewall rules
- Review database logs for errors
Slow Queries
Slow Queries
- Optimize index configuration
- Reduce embedding dimensions
- Decrease TOP_K value
- Add more resources to database
- Enable caching
Out of Memory
Out of Memory
- Use disk-based storage instead of in-memory
- Reduce batch size during indexing
- Scale database resources
- Archive old/unused collections
Best Practices
-
Choose Based on Scale:
- Small datasets: ChromaDB
- Medium: PGVector, Qdrant
- Large: Milvus, Elasticsearch
- Managed: Pinecone, Qdrant Cloud
-
Monitor Performance:
- Track query latency
- Monitor index size
- Watch memory usage
-
Backup Strategy:
- Regular database backups
- Document source preservation
- Embedding model versioning
-
Security:
- Use authentication
- Enable TLS/SSL
- Network isolation
- API key rotation
References
- ChromaDB: docs.trychroma.com
- Qdrant: qdrant.tech/documentation
- Milvus: milvus.io/docs
- PGVector: github.com/pgvector/pgvector