vllm-project/vllm

[Feature]: Batch Invariant Feature and Performance Optimization

Open

#27,433 创建于 2025年10月23日

在 GitHub 查看
 (47 评论) (27 反应) (3 负责人)Python (80,034 star) (16,816 fork)batch import
feature requestgood first issue

描述

🚀 The feature, motivation and pitch

We have basically support Batch Invariant based on https://thinkingmachines.ai/blog/defeating-nondeterminism-in-llm-inference/

https://github.com/orgs/vllm-project/projects/29/views/1

But there are still some work to be done, so here is the issue to track the work

TODOs:

Nice to have:

Model coverage

https://docs.vllm.ai/en/latest/features/batch_invariance/#tested-models

🙋Help needed for validations of more models.

  1. Test a model using the script in https://github.com/vllm-project/vllm/tree/main/tests/v1/determinism
  2. Submit a PR updating the document

贡献者指南