HunyuanDiT

Bilingual text-to-image diffusion transformer by Tencent with Chinese and English support.

Open SourceSelf HostedOffline CapableGPU Required (12GB+ VRAM)
0.0 (0)

About

HunyuanDiT is a text-to-image diffusion transformer by Tencent. Native bilingual support for Chinese and English prompts. Uses a fine-grained understanding of text through a multi-resolution architecture. Requires GPU with 12+ GB VRAM. Tencent Hunyuan Community License.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Intermediate (3/5)
License
Tencent Community
Minimum VRAM
12 GB
Added
Apr 3, 2026

Similar Tools

Featured

Neural network architecture for adding spatial control to diffusion models.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)

Image prompt adapter for pre-trained text-to-image diffusion models.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)

Zero-shot identity-preserving image generation from a single face photo.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)