Tools/Image Generation/DeepFloyd IF

DeepFloyd IF

Pixel-space diffusion model by DeepFloyd/Stability AI with strong text rendering.

Open SourceSelf HostedOffline CapableGPU Required (16GB+ VRAM)
0.0 (0)

About

DeepFloyd IF is a modular pixel-space diffusion model by DeepFloyd Lab (Stability AI). Uses T5-XXL text encoder for strong text rendering and prompt understanding. Three-stage cascaded architecture for up to 1024x1024 resolution. Requires GPU with 16+ GB VRAM. DeepFloyd IF license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Advanced (4/5)
Minimum VRAM
16 GB
Added
Apr 3, 2026

Similar Tools

Featured

Neural network architecture for adding spatial control to diffusion models.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)

Image prompt adapter for pre-trained text-to-image diffusion models.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)

Zero-shot identity-preserving image generation from a single face photo.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)