unslothai/unsloth
在 GitHub 查看[Bug] After adding import unsloth to the first line of the script, the GRPOTrainer fails to run properly; however, it works normally again once this import is removed. The Sophia optimizer interface being used was generated by an AI.
Open
#3,591 创建于 2025年11月12日
help wanted
描述
sophia_grpo.py Scripts generated by AI may encounter many issues, such as the inability to utilize multiple GPUs using fp16