Evaluation loss becomes constant · unslothai/unsloth#1067

(7 留言) (0 反應) (0 負責人)Python (5,658 fork)batch import

currently fixinggood first issue

倉庫指標

Star: (64,271 star)
PR 合併指標: (平均合併 3天 15小時) (30 天內合併 525 個 PR)

描述

Hi!

I am testing out unsloth to fine tune llama 3.1 8B instruct and following your notebook here.

One exception is that I have added an eval set. What is really strange is that the eval loss locks up to a specific value after around 300 steps. I mean down to the last decimal, not just flattening out. The training loss looks fine and as expected.

I have changed many parameters and tried different things but it always happens. Any idea on what can cause this?

貢獻者指南

研究方向: 檢查 unsloth 庫中的評估循環，理解為什麼在約 300 步後評估損失變得恆定。檢查評估數據集是否被緩存或梯度是否被錯誤地禁用。查找數據打亂或固定評估批次的問題。
技術棧: pythonpytorch
領域: machine learningai
議題類型: 錯誤
難度: 3
預計時間: 半天
活動狀態: 活躍
清晰度: 大致清晰
前置要求: PythonPyTorch
新手友善度: 40

倉庫指標

描述

貢獻者指南

每天在信箱收到新鮮 Easy issues。