[Misc]: Discussion on accuracy variance · vllm-project/vllm-ascend#6623

(1 commento) (0 reazioni) (0 assegnatari)C++ (1318 fork)github user discovery

help wantedwait-feedback

Metriche repository

Star: (2180 star)
Metriche merge PR: (Merge medio 5g 16h) (419 PR mergiate in 30 g)

Descrizione

Anything you want to discuss about vllm on ascend.

Because of batch variance, We cannot guarantee that the same input will yield the same output in a multi-batch inference case. And this accuracy variance is more explicit in certain datasets.

model	dataset	acc	acc variance
deepseek-v3.1	GPQA	74~82	8
qwen3-235b	GPQA	64~71	7
qwen3-480b	GPQA	60~67	7

We need to test these case on GPU or use other inference engine, such as sglang to check if this is an acc bug.

If GPU also has the same acc variance like us, we believe that this acc variance is reasonable for this dataset. Otherwise, we need to solve this acc bug.

Guida contributor

Direzione di ricerca: Confronta la varianza di accuratezza su GPU e Ascend eseguendo gli stessi dataset. Se la GPU mostra varianza, è intrinseco; altrimenti, esegui il debug del percorso di inferenza di Ascend.
Tech stack: python
Dominio: machine learningaibackend
Tipo issue: Bug
Difficoltà: 2
Tempo stimato: 1-2 giorni
Stato attività: Attiva
Chiarezza: Abbastanza chiara
Prerequisiti: PythonvLLMAscend
Adatta ai principianti: 30

Metriche repository

Descrizione

Anything you want to discuss about vllm on ascend.

Guida contributor

Ricevi issue Easy fresche nella tua inbox.