Overview
Open WebUI supports powerful image generation and editing capabilities through multiple engines, allowing you to create custom visuals directly in your chat conversations.Supported Engines
- OpenAI
- Gemini
- ComfyUI
- AUTOMATIC1111
DALL·E Integration
- DALL·E 2: Fast, cost-effective
- DALL·E 3: Higher quality, more detailed
- GPT-IMAGE 1 & 1.5: Latest models
Image Generation
Basic Generation
Create images from text descriptions:API Usage
Generate images programmatically:Generation Parameters
Prompt
Text description of desired image
- Be specific and detailed
- Include style, mood, lighting
- Mention artistic techniques
Size
Image dimensions
- Standard: 512x512, 1024x1024
- Custom: WIDTHxHEIGHT format
- “auto”: Model-dependent sizing
Steps
Generation iterations
- More steps = higher quality
- More steps = slower generation
- Typical range: 20-100
Negative Prompt
What to avoid in the image
- “blurry, distorted”
- “low quality, watermark”
- Engine-dependent support
Image Editing
Modify existing images with AI.Configuration
Edit Workflow
Optional Parameters
- Background color for transparency
- Multiple image inputs (for Gemini)
- Size adjustments
Edit API
Supported Edit Operations
- OpenAI
- Gemini
- ComfyUI
DALL·E Edit Capabilities:
- Single image input
- Inpainting/outpainting
- Background replacement
- Style transfer
- Detail enhancement
ComfyUI Integration
Advanced users can leverage ComfyUI’s powerful workflow system.Workflow Configuration
Design Workflow
Create workflow in ComfyUI interface:
- Load desired models
- Configure nodes
- Test generation
Workflow Node Types
- model: Checkpoint/model selector
- prompt: Positive/negative prompts
- size: Width/height dimensions
- steps: Sampling steps
- seed: Random seed control
- sampler: Sampling method
AUTOMATIC1111 Integration
Configuration
Custom Parameters
Pass additional parameters to AUTOMATIC1111:Model Management
Switch models dynamically:Image Storage
Generated and edited images are automatically stored in the file system.Storage Flow
Upload to Storage
- Converted to standard format (PNG/JPEG)
- Assigned unique file ID
- Metadata stored (prompt, model, parameters)
Accessing Images
Permissions
Control who can generate images:Best Practices
Prompt Engineering
Write effective prompts:
- Be specific about style and details
- Include lighting and mood
- Specify artistic techniques
- Use negative prompts to avoid unwanted elements
Quality vs Speed
Balance performance:
- More steps = better quality, slower
- Larger sizes = more detail, higher cost
- Test with low steps first
- Increase for final generation
Model Selection
Choose appropriate engine:
- OpenAI: Best quality, easiest setup
- Gemini: Good balance, Google ecosystem
- ComfyUI: Maximum control, self-hosted
- AUTOMATIC1111: Free, extensive models
Cost Management
Control expenses:
- Monitor API usage
- Set generation limits
- Use local engines for volume
- Cache common generations
Troubleshooting
Connection failed to image engine
Connection failed to image engine
Check:
- Base URL is correct and accessible
- API key is valid (if required)
- Firewall allows connection
- Engine is running and healthy
- Authentication credentials correct
Poor image quality
Poor image quality
Solutions:
- Increase step count
- Use larger image size
- Refine prompt with more details
- Try different model
- Adjust sampling parameters
Generation timeout
Generation timeout
Fix:
- Reduce image size
- Lower step count
- Check engine performance
- Increase timeout settings
- Use faster sampler
ComfyUI workflow fails
ComfyUI workflow fails
Verify:
- Workflow JSON is valid
- Node IDs correctly mapped
- Required models installed
- ComfyUI API accessible
- Workflow tested in ComfyUI first
Advanced Features
Auto-Size Support
Some models support automatic sizing:Auto-size is only supported by models matching the pattern:
gpt-image-*, dall-e-*Batch Generation
Generate multiple variations:Image Caching
Generated images are cached based on:- Prompt hash
- Model configuration
- Generation parameters