Tools/Text-to-Speech (TTS)/StyleTTS 2

StyleTTS 2

Style diffusion and adversarial training for human-level TTS with style transfer.

Open SourceSelf HostedOffline CapableGPU Required (6GB+ VRAM)

0.0 (0)

Visit Website View on GitHub

About

StyleTTS 2 is a text-to-speech model that uses style diffusion and adversarial training to achieve human-level speech synthesis quality. Supports style transfer, allowing control over speaking style via reference audio. Developed by Columbia University researchers. Requires GPU with 6+ GB VRAM. MIT license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Category: Text-to-Speech (TTS)
Price: Free
Platform: Local/Desktop
Difficulty: Advanced (4/5)
License: MIT
Minimum VRAM: 6 GB
Added: Apr 3, 2026

Tags

tts style-transfer diffusion adversarial research

Related Tools

Featured

Kokoro TTS

Text-to-Speech (TTS)

Lightweight and expressive TTS model with 82M parameters for fast local inference.

Open SourceSelf HostedOffline

Easy

4.0 (1)

Chatterbox TTS (Resemble)

Text-to-Speech (TTS)

Open-source TTS model by Resemble AI with emotion and accent control.

Open SourceSelf HostedOfflineGPU 4GB+

Easy

0.0 (0)

Chatterbox TTS

Text-to-Speech (TTS)

Expressive zero-shot TTS model by Resemble AI with emotion and accent control.

Open SourceSelf HostedOfflineGPU 4GB+

Intermediate

0.0 (0)

So-VITS-SVC

Text-to-Speech (TTS)

Singing voice conversion model based on VITS and SoftVC for voice-to-voice transfer.

Open SourceSelf HostedOfflineGPU 6GB+

Advanced

0.0 (0)

IndexTTS

Text-to-Speech (TTS)

Zero-shot TTS model with high naturalness and speaker similarity.

Open SourceSelf HostedOfflineGPU 6GB+

Intermediate

0.0 (0)

Featured

Bark

Text-to-Speech (TTS)

Transformer-based text-to-audio model by Suno that generates speech, music, and sound effects.

Open SourceSelf HostedOfflineGPU 4GB+

Intermediate

0.0 (0)

Browse all Text-to-Speech (TTS) tools