verl-project/verl

[Bug][CI] FDSP2 test in `model_rmpad` job seems unstable

Open

#1,388 建立於 2025年5月4日

在 GitHub 查看
 (0 留言) (1 反應) (0 負責人)Python (21,533 star) (3,940 fork)auto 404
bugcall for contributiongood first issue

描述

Motivation

https://github.com/volcengine/verl/actions/workflows/model.yml shows that:

  1. the FDSP2 test in model_rmpad workflow fails sometimes;
  2. but can also pass sometimes.

Plan

  • Find a setup that can reproduce the error steadily (possibly using the test container)
  • Locate the root cause
  • Fix the bug

Additional Info.

貢獻者指南

[Bug][CI] FDSP2 test in `model_rmpad` job seems unstable · verl-project/verl#1388 | Good First Issue