DeepSpeech
End-to-end speech recognition engine by Mozilla using TensorFlow.
About
DeepSpeech is an open-source speech-to-text engine by Mozilla, based on Baidu Deep Speech research paper. Uses a TensorFlow-trained model to convert audio to text. Supports real-time streaming transcription. Runs on CPU. Now in maintenance mode. MPL 2.0 license.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Intermediate (3/5)
- License
- MPL-2.0
- Added
- Apr 3, 2026
Related Tools
Whisper extension providing word-level timestamps for transcription.
Multilingual ASR model by NVIDIA supporting 4 languages with translation.
Convolution-augmented transformer for speech recognition in ESPnet toolkit.
Pre-trained speech models for STT, TTS, and VAD with simple PyTorch integration.
CLI tool that transcribes audio 10x faster using pipeline optimizations.
Open-source speaker diarization and voice activity detection toolkit.