[Feature request] Support loading GGUF and GGML model format · lm-sys/FastChat#2410

(5 comments) (7 reactions) (0 assignees)Python (4,736 forks)batch import

good first issue

Repository metrics

This issue does not include a description.

Research direction: Explore how GGUF/GGML model formats are loaded in other LLM serving frameworks (e.g., llama.cpp, Ollama). Identify the necessary changes in FastChat's model loader to support these formats. Focus on the `fastchat.model.model adapter` and related modules.
Tech stack: python
Domain: machine learningbackend
Issue type: Feature
Difficulty: 3
Estimated time: Half day
Activity status: Active
Clarity: Mostly clear
Prerequisites: PythonFamiliarity with LLM model formats
Newbie friendliness: 50