verl-project/verl

[FR] Tests for gradient accumulation

Open

#907 创建于 2025年4月3日

在 GitHub 查看
 (0 评论) (0 反应) (0 负责人)Python (21,533 star) (3,940 fork)auto 404
enhancementgood first issue

描述

Summary

Promise from #906.

Add tests to assert the losses on the same mini-batch are equal

  • between w/o & w/ gradient accumulation
  • using all the different loss aggregation modes

Plan

We might went to test end-to-end to avoid potential implication by any other changes.

Plan Pro
A) Inherit a new Trainer class to test Standalone without affecting the main config
B) Add a config term and modify update_* to test 1) Minimal code modification 2) Compatibility to any Trainer

贡献者指南

[FR] Tests for gradient accumulation · verl-project/verl#907 | Good First Issue