lm-sys/FastChat
GitHub で見るsupport for 4bit quantization from transfomer library.
Open
#1,798 opened on 2023年6月27日
enhancementgood first issue
説明
Loading a vicuna13B using 4bit quantization from the transformers library is possible load_in_4bit. How difficult could be for Fastach to support it?