lm-sys/FastChat
View on GitHubsupport for 4bit quantization from transfomer library.
Open
#1,798 opened on Jun 27, 2023
enhancementgood first issue
Description
Loading a vicuna13B using 4bit quantization from the transformers library is possible load_in_4bit. How difficult could be for Fastach to support it?