lm-sys/FastChat
Ver no GitHubsupport for 4bit quantization from transfomer library.
Open
#1.798 aberto em 27 de jun. de 2023
enhancementgood first issue
Description
Loading a vicuna13B using 4bit quantization from the transformers library is possible load_in_4bit. How difficult could be for Fastach to support it?