CTranslate2

Fast inference engine for Transformer models using custom C++ runtime.

Open SourceSelf HostedOffline Capable
0.0 (0)

About

CTranslate2 by SYSTRAN is a fast inference engine for Transformer models. Supports int8/int16/float16 quantization. Up to 4x faster and 4x less memory than PyTorch. Supports translation, generation, and speech models. MIT license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Intermediate (3/5)
License
MIT
Added
Apr 3, 2026

Similar Tools

Featured

Desktop application for discovering, downloading, and running local LLMs.

Self HostedOffline
Beginner
0.0 (0)

Open-source ChatGPT alternative that runs 100% offline on your computer.

Open SourceSelf HostedOffline
Beginner
0.0 (0)

Open-source ecosystem for running LLMs locally on consumer hardware.

Open SourceSelf HostedOffline
Beginner
0.0 (0)