vllm-project/vllm

[Feature]: Batch Invariant Feature and Performance Optimization

Open

#27433 opened on Oct 23, 2025

View on GitHub
 (47 comments) (27 reactions) (3 assignees)Python (80,034 stars) (16,816 forks)batch import
feature requestgood first issue

Description

🚀 The feature, motivation and pitch

We have basically support Batch Invariant based on https://thinkingmachines.ai/blog/defeating-nondeterminism-in-llm-inference/

https://github.com/orgs/vllm-project/projects/29/views/1

But there are still some work to be done, so here is the issue to track the work

TODOs:

Nice to have:

Model coverage

https://docs.vllm.ai/en/latest/features/batch_invariance/#tested-models

🙋Help needed for validations of more models.

  1. Test a model using the script in https://github.com/vllm-project/vllm/tree/main/tests/v1/determinism
  2. Submit a PR updating the document

Contributor guide