Featured Tool

Whisper

General-purpose speech recognition model by OpenAI trained on 680K hours of multilingual audio.

Open SourceSelf HostedOffline Capable
0.0 (0)

About

Whisper is an open-source automatic speech recognition model by OpenAI. Trained on 680,000 hours of multilingual audio, it handles transcription and translation across 99 languages. Available in multiple sizes (tiny to large-v3) for different accuracy/speed tradeoffs. Runs locally on CPU or GPU. MIT license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Easy (2/5)
License
MIT
Added
Apr 3, 2026

Related Tools

Whisper extension providing word-level timestamps for transcription.

Open SourceSelf HostedOfflineGPU 4GB+
Easy
0.0 (0)

Multilingual ASR model by NVIDIA supporting 4 languages with translation.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)

Convolution-augmented transformer for speech recognition in ESPnet toolkit.

Open SourceSelf HostedOfflineGPU 8GB+
Advanced
0.0 (0)

Pre-trained speech models for STT, TTS, and VAD with simple PyTorch integration.

Open SourceSelf HostedOffline
Beginner
0.0 (0)

CLI tool that transcribes audio 10x faster using pipeline optimizations.

Open SourceSelf HostedOfflineGPU 6GB+
Easy
0.0 (0)

Open-source speaker diarization and voice activity detection toolkit.

Open SourceSelf HostedOfflineGPU 4GB+
Intermediate
0.0 (0)
Browse all Speech-to-Text / Speech Recognition tools