vllm-project/vllm

[Feature]: Improve DCP error messages

Open

#28407 opened on Nov 10, 2025

View on GitHub
 (4 comments) (2 reactions) (0 assignees)Python (80,034 stars) (16,816 forks)batch import
feature requestgood first issuestale

Description

🚀 The feature, motivation and pitch

Currently if a backend doesn't support DCP we get the following error message

AssertionError: DCP requires attention impls to return the softmax lse for decode, but the impl FlashInferImpl does not return the softmax lse for decode.

It would good to suggest to the user to try an alternative backend using VLLM_ATTENTION_BACKEND

Alternatives

No response

Additional context

No response

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

Contributor guide