casper-hansen/AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

PythonStars 2291Forks 294Watchers 2291Open issues 202License MIT License
Details
仓库信息
Ownercasper-hansen
Last pushed2025-05-11
Last updated2025-12-14
Issues fetched at

Stats

Community at a glance

Loading...

Loading

--

Loading

--

Loading

--

Loading

--