verl-project/verl

[Bug] workflow `e2e_prime` is sometimes stuck

Open

#1344 opened on Apr 30, 2025

View on GitHub
 (1 comment) (0 reactions) (0 assignees)Python (21,533 stars) (3,940 forks)auto 404
bugcall for contributiongood first issue

Description

Motivation

The workflow e2e_prime is sometimes stuck and reaches timeout, while sometimes finishes normally. See https://github.com/volcengine/verl/actions/workflows/e2e_prime.yml for details.

Plan

  • Reproduce the issue locally with the container and commands used in the workflow.
  • Check where the root cause is.

Contributor guide