Tools/Text-to-Speech (TTS)/IndexTTS

IndexTTS

Zero-shot TTS model with high naturalness and speaker similarity.

Open SourceSelf HostedOffline CapableGPU Required (6GB+ VRAM)

0.0 (0)

Visit Website View on GitHub

About

IndexTTS is a zero-shot text-to-speech model that synthesizes speech matching a short reference clip's voice. The IndexTTS2 release adds a method for controlling the duration of generated speech in autoregressive synthesis, which helps tasks like video dubbing that need audio-visual synchronization, alongside emotionally expressive output. It supports multiple languages and runs on a GPU. Released as an open-source model with inference code.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Category: Text-to-Speech (TTS)
Price: Free
Platform: Local/Desktop
Difficulty: Intermediate (3/5)
Minimum VRAM: 6 GB
Added: Apr 3, 2026

Tags

tts zero-shot high-quality naturalness

Related Tools

Featured

Kokoro TTS

Text-to-Speech (TTS)

Lightweight and expressive TTS model with 82M parameters for fast local inference.

Open SourceSelf HostedOffline

Easy

4.0 (1)

ChatTTS

Text-to-Speech (TTS)

Conversational TTS model optimized for dialogue and chat applications.

Open SourceSelf HostedOfflineGPU 4GB+

Intermediate

0.0 (0)

CosyVoice

Text-to-Speech (TTS)

Multilingual large voice generation model with full-stack inference, training, and deployment.

Open SourceSelf HostedOfflineGPU

Intermediate

0.0 (0)

CosyVoice 2

Text-to-Speech (TTS)

Large-scale multilingual TTS model by Alibaba with zero-shot voice cloning.

Open SourceSelf HostedOfflineGPU 8GB+

Advanced

0.0 (0)

EmotiVoice

Text-to-Speech (TTS)

Emotion-controllable TTS engine by NetEase with 2000+ voices.

Open SourceSelf HostedOfflineGPU 4GB+

Intermediate

0.0 (0)

Featured

Bark

Text-to-Speech (TTS)

Transformer-based text-to-audio model by Suno that generates speech, music, and sound effects.

Open SourceSelf HostedOfflineGPU 4GB+

Intermediate

0.0 (0)

Browse all Text-to-Speech (TTS) tools