Tools/Voice Cloning & Voice Conversion/Fish Speech (Voice Cloning)

Fish Speech (Voice Cloning)

Zero-shot voice cloning capabilities of Fish Speech model.

Open SourceSelf HostedOffline CapableGPU Required (6GB+ VRAM)
0.0 (0)

About

Fish Speech is a multilingual text-to-speech and zero-shot voice cloning system from Fish Audio that reproduces a target voice from a short reference clip. The current Fish Audio S2 generation uses a dual-autoregressive architecture with reinforcement-learning alignment, trained on a large multilingual corpus for natural, emotionally varied speech across many languages. The open-source release is distributed under the Apache 2.0 license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Easy (2/5)
License
Apache-2.0
Minimum VRAM
6 GB
Added
Apr 3, 2026

Related Tools

Zero-shot voice cloning mode of CosyVoice model by Alibaba.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)

Community fork of RVC with additional features and optimizations.

Open SourceSelf HostedOfflineGPU 4GB+
Easy
0.0 (0)

Updated zero-shot voice cloning by MyShell with improved quality.

Open SourceSelf HostedOfflineGPU 4GB+
Easy
0.0 (0)
Featured

Voice cloning mode of XTTS-v2 for creating custom voice replicas.

Open SourceSelf HostedOfflineGPU 4GB+
Easy
0.0 (0)
Featured

Easy-to-use voice conversion framework based on retrieval for real-time voice cloning.

Open SourceSelf HostedOfflineGPU 4GB+
Easy
0.0 (0)

Text-free one-shot voice conversion model requiring no text transcription.

Open SourceSelf HostedOfflineGPU 4GB+
Intermediate
0.0 (0)
Browse all Voice Cloning & Voice Conversion tools