Tools/Video Generation/Stable Video Diffusion XT

Stable Video Diffusion XT

Extended Stable Video Diffusion model generating 25-frame videos.

Open SourceSelf HostedOffline CapableGPU Required (12GB+ VRAM)

0.0 (0)

Visit Website View on GitHub

About

Stable Video Diffusion XT by Stability AI is an image-to-video latent diffusion model fine-tuned from the base SVD to generate 25 frames at 576 by 1024 resolution from a single conditioning image, up from 14 frames in the base model, with a fine-tuned decoder for better temporal consistency. It produces short clips for novel-view and motion synthesis. Commercial use follows the Stability AI Community License.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Category: Video Generation
Price: Free
Platform: Local/Desktop
Difficulty: Intermediate (3/5)
License: Stability Community
Minimum VRAM: 12 GB
Added: Apr 3, 2026

Tags

video-generation image-to-video stability-ai 25-frames extended

Related Tools

Featured

HunyuanVideo

Video Generation

Open-source video generation model by Tencent with text and image conditioning.

Open SourceSelf HostedOfflineGPU 24GB+

Advanced

0.0 (0)

I2VGen-XL

Video Generation

Image-to-video generation model by Alibaba DAMO Academy.

Open SourceSelf HostedOfflineGPU 12GB+

Advanced

0.0 (0)

CogVideo 1.5

Video Generation

Updated CogVideo model by Zhipu AI with improved video quality.

Open SourceSelf HostedOfflineGPU 16GB+

Advanced

0.0 (0)

MuseV

Video Generation

Infinite-length music-driven video generation with visual conditioning.

Open SourceSelf HostedOfflineGPU 12GB+

Advanced

0.0 (0)

LaVie

Video Generation

Text-to-video generation framework with cascaded latent diffusion.

Open SourceSelf HostedOfflineGPU 16GB+

Advanced

0.0 (0)

CogVideoX

Video Generation

Open-source text-to-video model by Zhipu AI/Tsinghua with 2B and 5B variants.

Open SourceSelf HostedOfflineGPU 12GB+

Advanced

0.0 (0)

Browse all Video Generation tools