像截图中微调大模型任务,微调不到10%损失函数就下降那么低了之后就收敛了,我还有必要继续训练还是直接中断进行部署? #208
Labels
No Label
bug
duplicate
enhancement
help wanted
invalid
question
wontfix
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: HswOAuth/llm_course#208
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
我觉得确实是可以考虑直接中断的,如果训练得过多甚至可能会出现过拟合。
但是要小心喔,提前结束训练需要先确定你的模型会在适当的检查点进行保存,然后再在算法中调用相应的检查点结合原来的预训练模型进行部署。
好的感谢您的回复,训练的时候每训练50步进行保存的