Tools/LLM Inference & Serving/Text Generation Inference (TGI)

Text Generation Inference (TGI)

Production-ready LLM serving toolkit by Hugging Face.

Open SourceSelf HostedOffline CapableGPU Required (8GB+ VRAM)
0.0 (0)

About

TGI (Text Generation Inference) by Hugging Face is a production-ready toolkit for serving LLMs. Features continuous batching, tensor parallelism, Flash Attention, quantization (GPTQ, AWQ, EETQ), and streaming. Powers Hugging Face Inference Endpoints. Apache 2.0 license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Intermediate (3/5)
License
Apache-2.0
Minimum VRAM
8 GB
Added
Apr 3, 2026

Similar Tools

Featured

Desktop application for discovering, downloading, and running local LLMs.

Self HostedOffline
Beginner
0.0 (0)

Open-source ChatGPT alternative that runs 100% offline on your computer.

Open SourceSelf HostedOffline
Beginner
0.0 (0)

Open-source ecosystem for running LLMs locally on consumer hardware.

Open SourceSelf HostedOffline
Beginner
0.0 (0)