Tools/Model Training & Fine-Tuning/AWQ (Activation-aware Weight Quantization)

AWQ (Activation-aware Weight Quantization)

Efficient LLM quantization preserving important weight channels.

Open SourceSelf HostedOffline CapableGPU Required (8GB+ VRAM)
0.0 (0)

About

AWQ, Activation-aware Weight Quantization from MIT HAN Lab, compresses large language and multimodal models to 3 or 4-bit weights by protecting the small fraction of salient weight channels identified from activation statistics, preserving more quality than naive quantization at the same bit-width. It pairs with the TinyChat runtime for efficient 4-bit inference, including vision-language models. Released under the MIT license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Intermediate (3/5)
License
MIT
Minimum VRAM
8 GB
Added
Apr 3, 2026

Related Tools

No-code tool by Hugging Face for training ML models automatically.

Open SourceSelf HostedOfflineGPU 8GB+
Beginner
0.0 (0)

Video model fine-tuning toolkit by Hugging Face Diffusers team.

Open SourceSelf HostedOfflineGPU 16GB+
Advanced
0.0 (0)

Low-code framework for building custom AI models by Predibase.

Open SourceSelf HostedOfflineGPU 8GB+
Easy
0.0 (0)
Featured

Library for training LLMs with reinforcement learning (RLHF, DPO, PPO).

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)
Featured

All-in-one framework for fine-tuning 100+ LLMs with web UI.

Open SourceSelf HostedOfflineGPU 8GB+
Easy
0.0 (0)
Featured

Efficient fine-tuning method using 4-bit quantized base model with LoRA adapters.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)
Browse all Model Training & Fine-Tuning tools