lm-sys/FastChat
Voir sur GitHubsupport for 4bit quantization from transfomer library.
Open
#1 798 ouverte le 27 juin 2023
enhancementgood first issue
Description
Loading a vicuna13B using 4bit quantization from the transformers library is possible load_in_4bit. How difficult could be for Fastach to support it?