Bark
Transformer-based text-to-audio model from Suno
About
Bark is a transformer text-to-audio model from Suno that generates speech, music, background noise, and short sound effects in many languages from a single text prompt. Unlike conventional TTS systems, it is fully generative and can produce nonverbal vocalizations such as laughing, sighing, and crying. Pretrained model checkpoints are released for research and commercial use; outputs can diverge from prompts in unpredictable ways.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Category
- Audio & Speech
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Easy (2/5)
- License
- MIT
- Minimum VRAM
- 8 GB
- Added
- Jan 29, 2026
Related Tools
Free text-to-speech generator with multiple voices, accents, and languages. No signup required.
Qwen Chat is an AI assistant for everyone, powered by the Qwen series models. It’s free to use, open to all, and ready to help with creativity, collaboration, and endless possibilities.
CTranslate2-based Whisper with 4x faster transcription
Deep learning toolkit for text-to-speech synthesis
OpenAI's powerful speech recognition model
Fast, local neural text-to-speech for home automation