[Misc]: Discussion on accuracy variance · vllm-project/vllm-ascend#6623

(1 留言) (0 反應) (0 負責人)C++ (1,318 fork)github user discovery

help wantedwait-feedback

倉庫指標

Star: (2,180 star)
PR 合併指標: (平均合併 5天 16小時) (30 天內合併 419 個 PR)

描述

Anything you want to discuss about vllm on ascend.

Because of batch variance, We cannot guarantee that the same input will yield the same output in a multi-batch inference case. And this accuracy variance is more explicit in certain datasets.

model	dataset	acc	acc variance
deepseek-v3.1	GPQA	74~82	8
qwen3-235b	GPQA	64~71	7
qwen3-480b	GPQA	60~67	7

We need to test these case on GPU or use other inference engine, such as sglang to check if this is an acc bug.

If GPU also has the same acc variance like us, we believe that this acc variance is reasonable for this dataset. Otherwise, we need to solve this acc bug.

貢獻者指南

研究方向: 在GPU和Ascend上執行相同資料集，比較準確率變異。如果GPU也顯示變異，則為固有效應；否則除錯Ascend推論路徑。
技術棧: python
領域: machine learningaibackend
議題類型: 錯誤
難度: 2
預計時間: 1-2 天
活動狀態: 活躍
清晰度: 大致清晰
前置要求: PythonvLLMAscend
新手友善度: 30

倉庫指標

描述

Anything you want to discuss about vllm on ascend.

貢獻者指南

每天在信箱收到新鮮 Easy issues。