verl-project/verl

[FR] Tests for gradient accumulation

Open

#907 建立於 2025年4月3日

在 GitHub 查看
 (0 留言) (0 反應) (0 負責人)Python (21,533 star) (3,940 fork)auto 404
enhancementgood first issue

描述

Summary

Promise from #906.

Add tests to assert the losses on the same mini-batch are equal

  • between w/o & w/ gradient accumulation
  • using all the different loss aggregation modes

Plan

We might went to test end-to-end to avoid potential implication by any other changes.

Plan Pro
A) Inherit a new Trainer class to test Standalone without affecting the main config
B) Add a config term and modify update_* to test 1) Minimal code modification 2) Compatibility to any Trainer

貢獻者指南