verl-project/verl

Fused CE loss integration

Open

#97 opened on Jan 12, 2025

View on GitHub
 (2 comments) (1 reaction) (0 assignees)Python (21,533 stars) (3,940 forks)auto 404
call for contributionhelp wanted

Description

Integrate it with main stream models: https://github.com/apple/ml-cross-entropy so that model with large vocab size uses much less memory

Contributor guide