LaVie

Text-to-video generation framework with cascaded latent diffusion.

Open SourceSelf HostedOffline CapableGPU Required (16GB+ VRAM)
0.0 (0)

About

LaVie is a research text-to-video framework from Shanghai AI Lab and Vchitect that uses cascaded latent diffusion models with temporal super-resolution to produce short video clips at higher resolution than the base generator alone. The repository is the official PyTorch implementation accompanying the paper and ships pretrained model weights, an image-to-video companion (SEINE), and a Hugging Face demo space.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Advanced (4/5)
License
Apache-2.0
Minimum VRAM
16 GB
Added
Apr 3, 2026

Related Tools

Turn text-to-image models into animation generators

Open SourceSelf HostedOfflineGPU 12GB+
Advanced
0.0 (0)

AI animation tool for creating dynamic video content

Open SourceSelf HostedOfflineGPU 8GB+
Advanced
0.0 (0)

Latest Open-Sora release with improved video generation quality.

Open SourceSelf HostedOfflineGPU 16GB+
Advanced
0.0 (0)
Featured

AI video generation model from Stability AI

Open SourceSelf HostedOfflineGPU 16GB+
Advanced
0.0 (0)

Open video generation model by Genmo focused on motion quality.

Open SourceSelf HostedOfflineGPU 16GB+
Advanced
0.0 (0)

Updated CogVideo model by Zhipu AI with improved video quality.

Open SourceSelf HostedOfflineGPU 16GB+
Advanced
0.0 (0)
Browse all Video Generation tools