Tools/Video Generation/MuseV

MuseV

Infinite-length music-driven video generation with visual conditioning.

Open SourceSelf HostedOffline CapableGPU Required (12GB+ VRAM)

0.0 (0)

Visit Website View on GitHub

About

MuseV by Tencent is a diffusion-based model for generating long, even effectively unbounded, virtual-human videos through parallel denoising conditioned on music, images, and text. It is part of the Muse open-source series alongside MuseTalk and MusePose and targets high-fidelity talking and performing avatars. Inference benefits from a GPU with 12 GB or more of VRAM. Distributed as open-source research.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Category: Video Generation
Price: Free
Platform: Local/Desktop
Difficulty: Advanced (4/5)
Minimum VRAM: 12 GB
Added: Apr 3, 2026

Tags

video-generation music-driven tencent animation long-form

Related Tools

Featured

HunyuanVideo

Video Generation

Open-source video generation model by Tencent with text and image conditioning.

Open SourceSelf HostedOfflineGPU 24GB+

Advanced

0.0 (0)

I2VGen-XL

Video Generation

Image-to-video generation model by Alibaba DAMO Academy.

Open SourceSelf HostedOfflineGPU 12GB+

Advanced

0.0 (0)

CogVideo 1.5

Video Generation

Updated CogVideo model by Zhipu AI with improved video quality.

Open SourceSelf HostedOfflineGPU 16GB+

Advanced

0.0 (0)

LaVie

Video Generation

Text-to-video generation framework with cascaded latent diffusion.

Open SourceSelf HostedOfflineGPU 16GB+

Advanced

0.0 (0)

SkyReels V1

Video Generation

Open-source video generation model with controllable camera and subject motion.

Open SourceSelf HostedOfflineGPU 16GB+

Advanced

0.0 (0)

CogVideoX

Video Generation

Open-source text-to-video model by Zhipu AI/Tsinghua with 2B and 5B variants.

Open SourceSelf HostedOfflineGPU 12GB+

Advanced

0.0 (0)

Browse all Video Generation tools