unslothai/unsloth

[Bug] After adding import unsloth to the first line of the script, the GRPOTrainer fails to run properly; however, it works normally again once this import is removed. The Sophia optimizer interface being used was generated by an AI.

Open

#3,591 建立於 2025年11月12日

在 GitHub 查看
 (11 留言) (0 反應) (0 負責人)Python (64,271 star) (5,658 fork)batch import
help wanted

描述

sophia_grpo.py Scripts generated by AI may encounter many issues, such as the inability to utilize multiple GPUs using fp16

xl.py token_utils.py

貢獻者指南

[Bug] After adding import unsloth to the first line of the script, the GRPOTrainer fails to run properly; however, it works normally again once this import is removed. The Sophia optimizer interface being used was generated by an AI. · unslothai/unsloth#3591 | Good First Issue