Show-1
Text-to-video model combining pixel and latent diffusion approaches.
About
Show-1 is a research text-to-video model from Show Lab at the National University of Singapore that combines pixel-based and latent-based video diffusion. It uses pixel diffusion to produce coherent low-resolution motion and latent diffusion to upscale, which improves text-video alignment over latent-only approaches. Base, interpolation, and super-resolution checkpoints are published on Hugging Face for self-hosted inference on a GPU.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Category
- Video Generation
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Advanced (4/5)
- Minimum VRAM
- 16 GB
- Added
- Apr 3, 2026
Related Tools
Text-to-video generation framework with cascaded latent diffusion.
Turn text-to-image models into animation generators
AI animation tool for creating dynamic video content
Latest Open-Sora release with improved video generation quality.
AI video generation model from Stability AI
Updated CogVideo model by Zhipu AI with improved video quality.