Fish Speech (Voice Cloning)
Zero-shot voice cloning capabilities of Fish Speech model.
About
Fish Speech is a multilingual text-to-speech and zero-shot voice cloning system from Fish Audio that reproduces a target voice from a short reference clip. The current Fish Audio S2 generation uses a dual-autoregressive architecture with reinforcement-learning alignment, trained on a large multilingual corpus for natural, emotionally varied speech across many languages. The open-source release is distributed under the Apache 2.0 license.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Category
- Voice Cloning & Voice Conversion
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Easy (2/5)
- License
- Apache-2.0
- Minimum VRAM
- 6 GB
- Added
- Apr 3, 2026
Related Tools
Zero-shot voice cloning mode of CosyVoice model by Alibaba.
Community fork of RVC with additional features and optimizations.
Updated zero-shot voice cloning by MyShell with improved quality.
Voice cloning mode of XTTS-v2 for creating custom voice replicas.
Easy-to-use voice conversion framework based on retrieval for real-time voice cloning.
Text-free one-shot voice conversion model requiring no text transcription.