vllm-project/vllm

[Feature]: Improve DCP error messages

Open

#28,407 创建于 2025年11月10日

在 GitHub 查看
 (4 评论) (2 反应) (0 负责人)Python (80,034 star) (16,816 fork)batch import
feature requestgood first issuestale

描述

🚀 The feature, motivation and pitch

Currently if a backend doesn't support DCP we get the following error message

AssertionError: DCP requires attention impls to return the softmax lse for decode, but the impl FlashInferImpl does not return the softmax lse for decode.

It would good to suggest to the user to try an alternative backend using VLLM_ATTENTION_BACKEND

Alternatives

No response

Additional context

No response

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

贡献者指南