Tools/Video Generation/Stable Video Diffusion XT

Stable Video Diffusion XT

Extended Stable Video Diffusion model generating 25-frame videos.

Open SourceSelf HostedOffline CapableGPU Required (12GB+ VRAM)
0.0 (0)

About

Stable Video Diffusion XT by Stability AI is an image-to-video latent diffusion model fine-tuned from the base SVD to generate 25 frames at 576 by 1024 resolution from a single conditioning image, up from 14 frames in the base model, with a fine-tuned decoder for better temporal consistency. It produces short clips for novel-view and motion synthesis. Commercial use follows the Stability AI Community License.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Intermediate (3/5)
License
Stability Community
Minimum VRAM
12 GB
Added
Apr 3, 2026

Related Tools

Text-to-video generation framework with cascaded latent diffusion.

Open SourceSelf HostedOfflineGPU 16GB+
Advanced
0.0 (0)

Turn text-to-image models into animation generators

Open SourceSelf HostedOfflineGPU 12GB+
Advanced
0.0 (0)

AI animation tool for creating dynamic video content

Open SourceSelf HostedOfflineGPU 8GB+
Advanced
0.0 (0)

Latest Open-Sora release with improved video generation quality.

Open SourceSelf HostedOfflineGPU 16GB+
Advanced
0.0 (0)
Featured

AI video generation model from Stability AI

Open SourceSelf HostedOfflineGPU 16GB+
Advanced
0.0 (0)

Updated CogVideo model by Zhipu AI with improved video quality.

Open SourceSelf HostedOfflineGPU 16GB+
Advanced
0.0 (0)
Browse all Video Generation tools