【求助帖】10/20 大模型初探-基于LLaMA-Factory的模型微调训练 #242
Labels
No Label
bug
duplicate
enhancement
help wanted
invalid
question
wontfix
No Milestone
No project
No Assignees
3 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: HswOAuth/llm_course#242
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
为什么我开始微调之后先是报错


然后训练过程中梯度一直是nan,降低学习率、降低max_grad_norm、设置为f32精度,还是这样,请问怎么解决呢
是否是按上课时一幕一样操作?所使用的环境能否告知一下?比如说autodl租用,还是自己私有化服务器?
是autodl租用的,操作都是按照老师的一步一步来
请将LLaMA-Factory放置在/root/目录下,也就是:/root/LLaMA-Factory,然后再重复做一次实验