verl-project/verl

vllm cannot load model after megatron training

Open

#1,757 opened on 2025年5月29日

GitHub で見る
 (0 comments) (0 reactions) (0 assignees)Python (21,533 stars) (3,940 forks)auto 404
help wanted

説明

After megatron training and convert to hf model, i want to infer using vllm, which meet problem when loading. File "/python3.11/site-packages/vllm/model_executor/models/utils.py", line 250, in _load_module raise ValueError(msg) ValueError: There is no module or parameter named 'decoder' in Qwen3ForCausalLM After model_merger.py process, the name of parameters changed.

コントリビューターガイド