Tools/Audio & Speech/Bark

Bark

Transformer-based text-to-audio model from Suno

Open SourceSelf HostedOffline CapableGPU Required (8GB+ VRAM)

0.0 (0)

Visit Website View on GitHub Documentation

About

Bark is a transformer text-to-audio model from Suno that generates speech, music, background noise, and short sound effects in many languages from a single text prompt. Unlike conventional TTS systems, it is fully generative and can produce nonverbal vocalizations such as laughing, sighing, and crying. Pretrained model checkpoints are released for research and commercial use; outputs can diverge from prompts in unpredictable ways.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Category: Audio & Speech
Price: Free
Platform: Local/Desktop
Difficulty: Easy (2/5)
License: MIT
Minimum VRAM: 8 GB
Added: Jan 29, 2026

Tags

text-to-speech audio music sound-effects

Related Tools

Featured

TextSpeakPro

Free text-to-speech generator with multiple voices, accents, and languages. No signup required.

Beginner

5.0 (1)

Featured

faster-whisper

CTranslate2-based Whisper with 4x faster transcription

Open SourceSelf HostedOffline

Easy

0.0 (0)

BigVGAN

Universal neural vocoder from NVIDIA that converts mel spectrograms into waveforms up to 44 kHz.

Open SourceSelf HostedOfflineGPU

Intermediate

0.0 (0)

GLM-4-Voice

End-to-end Chinese and English spoken dialogue model from Zhipu AI with streaming speech output.

Open SourceSelf HostedOfflineGPU

Intermediate

0.0 (0)

Kimi-Audio

Audio foundation model unifying speech recognition, understanding, and conversation in one 7B model.

Open SourceSelf HostedOfflineGPU

Intermediate

0.0 (0)

Coqui TTS

Deep learning toolkit for text-to-speech synthesis

Open SourceSelf HostedOffline

Intermediate

0.0 (0)

Browse all Audio & Speech tools