[Misc]: Discussion on accuracy variance · vllm-project/vllm-ascend#6623

(1 comment) (0 reactions) (0 assignees)C++ (1,318 forks)github user discovery

help wantedwait-feedback

Repository metrics

Stars: (2,180 stars)
PR merge metrics: (Avg merge 4d 5h) (559 merged PRs in 30d)

Description

Anything you want to discuss about vllm on ascend.

Because of batch variance, We cannot guarantee that the same input will yield the same output in a multi-batch inference case. And this accuracy variance is more explicit in certain datasets.

model	dataset	acc	acc variance
deepseek-v3.1	GPQA	74~82	8
qwen3-235b	GPQA	64~71	7
qwen3-480b	GPQA	60~67	7

We need to test these case on GPU or use other inference engine, such as sglang to check if this is an acc bug.

If GPU also has the same acc variance like us, we believe that this acc variance is reasonable for this dataset. Otherwise, we need to solve this acc bug.

Contributor guide

Research direction: Compare accuracy variance on GPU vs Ascend by running the same datasets. If GPU also shows variance, it's inherent; otherwise, debug the Ascend inference path.
Tech stack: python
Domain: machine learningaibackend
Issue type: Bug
Difficulty: 2
Estimated time: 1-2 days
Activity status: Active
Clarity: Mostly clear
Prerequisites: PythonvLLMAscend
Newbie friendliness: 30

Repository metrics

Description

Anything you want to discuss about vllm on ascend.

Contributor guide

Get fresh easy issues in your inbox.