verl-project/verl

vllm cannot load model after megatron training

Open

#1757 opened on May 29, 2025

View on GitHub
 (0 comments) (0 reactions) (0 assignees)Python (21,533 stars) (3,940 forks)auto 404
help wanted

Description

After megatron training and convert to hf model, i want to infer using vllm, which meet problem when loading. File "/python3.11/site-packages/vllm/model_executor/models/utils.py", line 250, in _load_module raise ValueError(msg) ValueError: There is no module or parameter named 'decoder' in Qwen3ForCausalLM After model_merger.py process, the name of parameters changed.

Contributor guide