NeMo
NVIDIA framework for building and training speech AI models including ASR, TTS, and speech LLMs.
About
NVIDIA NeMo is a framework for researchers and PyTorch developers building speech AI models. It covers automatic speech recognition, text-to-speech, and speech-focused large language models, leveraging pretrained checkpoints. NeMo targets audio, speech, and multimodal LLM applications and is installable via pip on systems with Python 3.12+ and PyTorch 2.6+.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Category
- AI Frameworks & Libraries
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Advanced (4/5)
- License
- Apache-2.0
- Added
- May 7, 2026
Related Tools
Tensor library for machine learning on commodity hardware
Structured output extraction from LLMs with Pydantic
Deploy LangChain runnables as REST APIs
Unified system for large-scale distributed training and inference.
High-level deep learning library making neural nets accessible with best practices.
Open-source machine learning framework by Meta with dynamic computation graphs.