[Misc]: Discussion on accuracy variance · vllm-project/vllm-ascend#6623

(1 comment) (0 reactions) (0 assignees)C++ (1.318 forks)github user discovery

help wantedwait-feedback

Métricas do repositório

Stars: (2.180 stars)
Métricas de merge de PR: (Mesclagem média 5d 16h) (419 fundiu PRs em 30d)

Description

Anything you want to discuss about vllm on ascend.

Because of batch variance, We cannot guarantee that the same input will yield the same output in a multi-batch inference case. And this accuracy variance is more explicit in certain datasets.

model	dataset	acc	acc variance
deepseek-v3.1	GPQA	74~82	8
qwen3-235b	GPQA	64~71	7
qwen3-480b	GPQA	60~67	7

We need to test these case on GPU or use other inference engine, such as sglang to check if this is an acc bug.

If GPU also has the same acc variance like us, we believe that this acc variance is reasonable for this dataset. Otherwise, we need to solve this acc bug.

Guia do colaborador

Direção de pesquisa: Compare a variação de precisão na GPU e no Ascend executando os mesmos conjuntos de dados. Se a GPU também apresentar variação, é inerente; caso contrário, depure o caminho de inferência do Ascend.
Pilha de tecnologia: python
Domain: machine learningaibackend
Tipo Issue: Bug
Difficulty: 2
Tempo estimado: 1-2 dias
Status da atividade: Ativo
Clarity: Principalmente claro
Prerequisites: PythonvLLMAscend
Simpatia para novatos: 30

Métricas do repositório

Description

Anything you want to discuss about vllm on ascend.

Guia do colaborador

Receba issues Easy novas por email.