Megatron-LM
NVIDIA framework for training multi-billion parameter transformer models.
About
Megatron-LM by NVIDIA is a GPU-optimized framework for training transformer models at very large scale. It ships Megatron Core, a composable library of transformer building blocks with tensor, pipeline, data, expert, and context parallelism plus mixed precision, alongside reference training scripts. It has been used to train many of the largest open models. Released under a custom NVIDIA open-source license.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Category
- AI Frameworks & Libraries
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Expert (5/5)
- License
- Custom
- Minimum VRAM
- 16 GB
- Added
- Apr 3, 2026
Related Tools
Tensor library for machine learning on commodity hardware
Structured output extraction from LLMs with Pydantic
Deploy LangChain runnables as REST APIs
Unified system for large-scale distributed training and inference.
High-level deep learning library making neural nets accessible with best practices.
Open-source machine learning framework by Meta with dynamic computation graphs.