vllm-project/vllm
Auf GitHub ansehen[Feature]: Upstream DGX spark improvements from Avarok-Cybersecurity/dgx-vllm
Open
#37.141 geöffnet am 16. März 2026
feature requesthelp wantednvidiaquantization
Repository-Metriken
- Stars
- (80.034 Stars)
- PR-Merge-Metriken
- (Durchschn. Merge 9T 2h) (921 gemergte PRs in 30 T)
Beschreibung
🚀 The feature, motivation and pitch
As shown in the readme in Avarok-Cybersecurity/dgx-vllm, there are gaps in vLLM fp4 performance for DGX. The fixes seem not too complicated, and we should try to upstream the changes from the repo into vLLM.
Avarok-Cybersecurity/dgx-vllm#7
Alternatives
No response
Additional context
No response
Before submitting a new issue...
- Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.