Tools/Voice Cloning & Voice Conversion/XTTS-v2 (Voice Cloning)
Featured Tool

XTTS-v2 (Voice Cloning)

Voice cloning mode of XTTS-v2 for creating custom voice replicas.

Open SourceSelf HostedOffline CapableGPU Required (4GB+ VRAM)
0.0 (0)

About

XTTS v2, shipped as part of the Coqui TTS library, is a multilingual voice-cloning model that reproduces a target voice from roughly six seconds of reference audio across 17 languages. It supports cross-language cloning, so a clip in one language can drive output in another, and inference runs on consumer GPUs with around 4 GB of VRAM. Distributed under the Coqui Public Model License within the broader MPL-2.0 codebase.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Easy (2/5)
License
CPML
Minimum VRAM
4 GB
Added
Apr 3, 2026

Related Tools

Zero-shot voice cloning mode of CosyVoice model by Alibaba.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)

Community fork of RVC with additional features and optimizations.

Open SourceSelf HostedOfflineGPU 4GB+
Easy
0.0 (0)

Updated zero-shot voice cloning by MyShell with improved quality.

Open SourceSelf HostedOfflineGPU 4GB+
Easy
0.0 (0)

Zero-shot voice cloning capabilities of Fish Speech model.

Open SourceSelf HostedOfflineGPU 6GB+
Easy
0.0 (0)
Featured

Easy-to-use voice conversion framework based on retrieval for real-time voice cloning.

Open SourceSelf HostedOfflineGPU 4GB+
Easy
0.0 (0)

Text-free one-shot voice conversion model requiring no text transcription.

Open SourceSelf HostedOfflineGPU 4GB+
Intermediate
0.0 (0)
Browse all Voice Cloning & Voice Conversion tools