Evaluation loss becomes constant · unslothai/unsloth#1067

(7 评论) (0 反应) (0 负责人)Python (5,658 fork)batch import

currently fixinggood first issue

仓库指标

Star: (64,271 star)
PR 合并指标: (平均合并 3天 15小时) (30 天内合并 525 个 PR)

描述

Hi!

I am testing out unsloth to fine tune llama 3.1 8B instruct and following your notebook here.

One exception is that I have added an eval set. What is really strange is that the eval loss locks up to a specific value after around 300 steps. I mean down to the last decimal, not just flattening out. The training loss looks fine and as expected.

I have changed many parameters and tried different things but it always happens. Any idea on what can cause this?

贡献者指南

研究方向: 检查 unsloth 库中的评估循环，理解为什么在约 300 步后评估损失变得恒定。检查评估数据集是否被缓存或梯度是否被错误地禁用。查找数据打乱或固定评估批次的问题。
技术栈: pythonpytorch
领域: machine learningai
议题类型: 缺陷
难度: 3
预计时间: 半天
活动状态: 活跃
清晰度: 基本清晰
前置要求: PythonPyTorch
新手友好度: 40

仓库指标

描述

贡献者指南

每天在邮箱收到新鲜 Easy issues。