Insanely Fast Whisper

CLI tool that transcribes audio 10x faster using pipeline optimizations.

Open SourceSelf HostedOffline CapableGPU Required (6GB+ VRAM)
0.0 (0)

About

Insanely Fast Whisper is a command-line tool that transcribes audio on-device using Hugging Face Transformers, Optimum, and Flash Attention 2 with batched inference to reach large speedups, transcribing around 2.5 hours of audio in under a couple of minutes on an A100. It is a community-driven CLI wrapper around optimized Whisper inference. Released under the MIT license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Easy (2/5)
License
MIT
Minimum VRAM
6 GB
Added
Apr 3, 2026

Related Tools

Whisper extension providing word-level timestamps for transcription.

Open SourceSelf HostedOfflineGPU 4GB+
Easy
0.0 (0)

Multilingual ASR model by NVIDIA supporting 4 languages with translation.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)

Convolution-augmented transformer for speech recognition in ESPnet toolkit.

Open SourceSelf HostedOfflineGPU 8GB+
Advanced
0.0 (0)

Pre-trained speech models for STT, TTS, and VAD with simple PyTorch integration.

Open SourceSelf HostedOffline
Beginner
0.0 (0)

Self-supervised speech representation model by Meta for ASR.

Open SourceSelf HostedOfflineGPU 8GB+
Advanced
0.0 (0)

Open-source speaker diarization and voice activity detection toolkit.

Open SourceSelf HostedOfflineGPU 4GB+
Intermediate
0.0 (0)
Browse all Speech-to-Text / Speech Recognition tools