lm-sys/FastChat

support for 4bit quantization from transfomer library.

Open

#1,798 创建于 2023年6月27日

在 GitHub 查看
 (7 评论) (2 反应) (0 负责人)Python (38,959 star) (4,736 fork)batch import
enhancementgood first issue

描述

Loading a vicuna13B using 4bit quantization from the transformers library is possible load_in_4bit. How difficult could be for Fastach to support it?

贡献者指南

support for 4bit quantization from transfomer library. · lm-sys/FastChat#1798 | Good First Issue