GPTQ (Quantization)

Post-training quantization method for compressing large language models.

Open SourceSelf HostedOffline CapableGPU Required (8GB+ VRAM)
0.0 (0)

About

GPTQ is a one-shot post-training quantization method for large language models. Compresses models to 4-bit or 3-bit precision with minimal quality loss. Enables running large models on consumer GPUs. By IST Austria.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Intermediate (3/5)
License
Apache-2.0
Minimum VRAM
8 GB
Added
Apr 3, 2026

Similar Tools

Featured

Library for training LLMs with reinforcement learning (RLHF, DPO, PPO).

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)
Featured

All-in-one framework for fine-tuning 100+ LLMs with web UI.

Open SourceSelf HostedOfflineGPU 8GB+
Easy
0.0 (0)

Low-code framework for building custom AI models by Predibase.

Open SourceSelf HostedOfflineGPU 8GB+
Easy
0.0 (0)