DiffRhythm

Full-length song generation model using diffusion with lyrics and style conditioning.

Open SourceSelf HostedOffline CapableGPU Required (12GB+ VRAM)
0.0 (0)

About

DiffRhythm is a diffusion-based model for generating full-length songs (up to 4m45s). Conditions on lyrics and musical style descriptions. Produces vocals and instrumentals together. By ASLP@NPU. Requires GPU with 12+ GB VRAM. Apache 2.0 license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Advanced (4/5)
License
Apache-2.0
Minimum VRAM
12 GB
Added
Apr 3, 2026

Related Tools

Fast music generation model producing full songs with lyrics in seconds.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)

Open-source toolkit for audio, music, and speech generation research.

Open SourceSelf HostedOfflineGPU 8GB+
Advanced
0.0 (0)

Original latent diffusion model for text-to-audio generation.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)
Featured

State-of-the-art music source separation model by Meta for splitting tracks.

Open SourceSelf HostedOffline
Easy
0.0 (0)

High-fidelity neural audio codec by Meta for audio compression and tokenization.

Open SourceSelf HostedOffline
Intermediate
0.0 (0)

Updated music generation model with improved quality and longer generation.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)
Browse all Music & Audio Generation tools