DeepFloyd IF
Pixel-space diffusion model by DeepFloyd/Stability AI with strong text rendering.
Open SourceSelf HostedOffline CapableGPU Required (16GB+ VRAM)
0.0 (0)
About
DeepFloyd IF is a modular pixel-space diffusion model by DeepFloyd Lab (Stability AI). Uses T5-XXL text encoder for strong text rendering and prompt understanding. Three-stage cascaded architecture for up to 1024x1024 resolution. Requires GPU with 16+ GB VRAM. DeepFloyd IF license.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Category
- Image Generation
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Advanced (4/5)
- Minimum VRAM
- 16 GB
- Added
- Apr 3, 2026
Similar Tools
Featured
Neural network architecture for adding spatial control to diffusion models.
Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)
Image prompt adapter for pre-trained text-to-image diffusion models.
Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)
Zero-shot identity-preserving image generation from a single face photo.
Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)