ModelScope Text-to-Video
Text-to-video generation model by Alibaba DAMO Academy on ModelScope.
About
ModelScope Text-to-Video is an early open text-to-video diffusion model from Alibaba DAMO Academy, distributed through the ModelScope model-as-a-service library. It generates short clips from English text prompts and was one of the first openly released models of its kind. The core ModelScope library provides unified interfaces for loading and running the model. Inference benefits from a GPU with 12 GB or more of VRAM.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Category
- Video Generation
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Advanced (4/5)
- Minimum VRAM
- 12 GB
- Added
- Apr 3, 2026
Related Tools
Text-to-video generation framework with cascaded latent diffusion.
Turn text-to-image models into animation generators
AI animation tool for creating dynamic video content
Latest Open-Sora release with improved video generation quality.
AI video generation model from Stability AI
Updated CogVideo model by Zhipu AI with improved video quality.