Kaldi
Established speech recognition toolkit used in research and production systems.
About
Kaldi is a widely-used speech recognition toolkit written in C++ for research and production ASR systems. Supports GMM, DNN, and neural network acoustic models. Used extensively in academic research and industry. Requires Linux and significant setup. Apache 2.0 license.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Expert (5/5)
- License
- Apache-2.0
- Added
- Apr 3, 2026
Related Tools
Whisper extension providing word-level timestamps for transcription.
Multilingual ASR model by NVIDIA supporting 4 languages with translation.
Convolution-augmented transformer for speech recognition in ESPnet toolkit.
Pre-trained speech models for STT, TTS, and VAD with simple PyTorch integration.
CLI tool that transcribes audio 10x faster using pipeline optimizations.
Open-source speaker diarization and voice activity detection toolkit.