CTranslate2
Fast inference engine for Transformer models using custom C++ runtime.
Open SourceSelf HostedOffline Capable
0.0 (0)
About
CTranslate2 by SYSTRAN is a fast inference engine for Transformer models. Supports int8/int16/float16 quantization. Up to 4x faster and 4x less memory than PyTorch. Supports translation, generation, and speech models. MIT license.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Category
- LLM Inference & Serving
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Intermediate (3/5)
- License
- MIT
- Added
- Apr 3, 2026
Similar Tools
Featured
Desktop application for discovering, downloading, and running local LLMs.
Self HostedOffline
Beginner
0.0 (0)
Open-source ChatGPT alternative that runs 100% offline on your computer.
Open SourceSelf HostedOffline
Beginner
0.0 (0)
Open-source ecosystem for running LLMs locally on consumer hardware.
Open SourceSelf HostedOffline
Beginner
0.0 (0)