Tools/Video Generation/ModelScope Text-to-Video

ModelScope Text-to-Video

Text-to-video generation model by Alibaba DAMO Academy on ModelScope.

Open SourceSelf HostedOffline CapableGPU Required (12GB+ VRAM)
0.0 (0)

About

ModelScope Text-to-Video is an early open text-to-video diffusion model from Alibaba DAMO Academy, distributed through the ModelScope model-as-a-service library. It generates short clips from English text prompts and was one of the first openly released models of its kind. The core ModelScope library provides unified interfaces for loading and running the model. Inference benefits from a GPU with 12 GB or more of VRAM.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Advanced (4/5)
Minimum VRAM
12 GB
Added
Apr 3, 2026

Related Tools

Text-to-video generation framework with cascaded latent diffusion.

Open SourceSelf HostedOfflineGPU 16GB+
Advanced
0.0 (0)

Turn text-to-image models into animation generators

Open SourceSelf HostedOfflineGPU 12GB+
Advanced
0.0 (0)

AI animation tool for creating dynamic video content

Open SourceSelf HostedOfflineGPU 8GB+
Advanced
0.0 (0)

Latest Open-Sora release with improved video generation quality.

Open SourceSelf HostedOfflineGPU 16GB+
Advanced
0.0 (0)
Featured

AI video generation model from Stability AI

Open SourceSelf HostedOfflineGPU 16GB+
Advanced
0.0 (0)

Updated CogVideo model by Zhipu AI with improved video quality.

Open SourceSelf HostedOfflineGPU 16GB+
Advanced
0.0 (0)
Browse all Video Generation tools