Voxtral
Speech understanding model by Mistral AI for transcription and analysis.
Open SourceSelf HostedOffline CapableGPU Required (8GB+ VRAM)
0.0 (0)
About
Voxtral by Mistral AI is a speech understanding model handling transcription, translation, and spoken content analysis. Part of the Mistral model family with strong multilingual capabilities.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Intermediate (3/5)
- Minimum VRAM
- 8 GB
- Added
- Apr 3, 2026
Related Tools
Whisper extension providing word-level timestamps for transcription.
Open SourceSelf HostedOfflineGPU 4GB+
Easy
0.0 (0)
Multilingual ASR model by NVIDIA supporting 4 languages with translation.
Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)
Convolution-augmented transformer for speech recognition in ESPnet toolkit.
Open SourceSelf HostedOfflineGPU 8GB+
Advanced
0.0 (0)
Pre-trained speech models for STT, TTS, and VAD with simple PyTorch integration.
Open SourceSelf HostedOffline
Beginner
0.0 (0)
CLI tool that transcribes audio 10x faster using pipeline optimizations.
Open SourceSelf HostedOfflineGPU 6GB+
Easy
0.0 (0)
Open-source speaker diarization and voice activity detection toolkit.
Open SourceSelf HostedOfflineGPU 4GB+
Intermediate
0.0 (0)