F5-TTS

Diffusion transformer text-to-speech model using flow matching for fluent, faithful speech.

Open SourceSelf HostedOffline CapableGPU Required

0.0 (0)

About

F5-TTS is a non-autoregressive text-to-speech system that pairs flow matching with a Diffusion Transformer, using ConvNeXt V2 blocks for faster training and inference. Zero-shot voice cloning works from a short reference clip, with optional ASR transcription when the reference audio is unlabeled, and the repository bundles E2 TTS, a Flat-UNet Transformer variant, for comparison. Sway Sampling, an inference-time strategy introduced by the project, improves output quality without retraining. Trained on datasets including Emilia, WenetSpeech4TTS, LibriTTS, and LJSpeech, the models reach a real-time factor of about 0.04 on an L20 GPU. Inference options span a Gradio web UI with multi-style and multi-speaker support plus a voice chat mode driven by Qwen2.5-3B-Instruct, a command-line interface, Docker, and a TensorRT-LLM runtime. The code is MIT licensed while the pretrained checkpoints carry a CC-BY-NC license, a distinction that matters for commercial use.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Category: Text-to-Speech (TTS)
Price: Free
Platform: Local/Desktop
Difficulty: Intermediate (3/5)
License: MIT
Added: May 7, 2026

0.0 (0)

Website GitHub

Featured

Bark

Text-to-Speech (TTS)

Transformer-based text-to-audio model by Suno that generates speech, music, and sound effects.

Open SourceSelf HostedOfflineGPU 4GB+

Intermediate

0.0 (0)

Website GitHub

Browse all Text-to-Speech (TTS) tools

Mentioned in

Open-Weight Text to Speech Models in 2026: The XTTS Successors

A working developer's comparison of Kokoro, Zonos, Kyutai TTS, F5-TTS, Piper, Chatterbox and the rest:...

Billy C

Running F5-TTS Locally for Voice Cloning, A Setup Guide

Step-by-step on installing F5-TTS, prepping a clean reference clip, running CLI and Gradio inference, and a...

Billy C

F5-TTS

About

Reviews (0)

Leave a Review

Details

Tags

Related Tools

Kokoro TTS

ChatTTS

CosyVoice

CosyVoice 2

EmotiVoice

Bark

Mentioned in