Stable Video Diffusion XT
Extended Stable Video Diffusion model generating 25-frame videos.
About
Stable Video Diffusion XT by Stability AI is an image-to-video latent diffusion model fine-tuned from the base SVD to generate 25 frames at 576 by 1024 resolution from a single conditioning image, up from 14 frames in the base model, with a fine-tuned decoder for better temporal consistency. It produces short clips for novel-view and motion synthesis. Commercial use follows the Stability AI Community License.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Category
- Video Generation
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Intermediate (3/5)
- License
- Stability Community
- Minimum VRAM
- 12 GB
- Added
- Apr 3, 2026
Related Tools
Text-to-video generation framework with cascaded latent diffusion.
Turn text-to-image models into animation generators
AI animation tool for creating dynamic video content
Latest Open-Sora release with improved video generation quality.
AI video generation model from Stability AI
Updated CogVideo model by Zhipu AI with improved video quality.