verl-project/verl

vllm cannot load model after megatron training

Open

#1,757 创建于 2025年5月29日

在 GitHub 查看
 (0 评论) (0 反应) (0 负责人)Python (21,533 star) (3,940 fork)auto 404
help wanted

描述

After megatron training and convert to hf model, i want to infer using vllm, which meet problem when loading. File "/python3.11/site-packages/vllm/model_executor/models/utils.py", line 250, in _load_module raise ValueError(msg) ValueError: There is no module or parameter named 'decoder' in Qwen3ForCausalLM After model_merger.py process, the name of parameters changed.

贡献者指南