【求助帖】llama factory微调后,大模型乱回复 #285
Labels
No Label
bug
duplicate
enhancement
help wanted
invalid
question
wontfix
No Milestone
No project
No Assignees
4 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: HswOAuth/llm_course#285
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
微调时不显示损失曲线。微调前可以正常回复,微调后胡言乱语。
如图
我也是同样的问题,步骤都和文档一样的做,体现为这里的损失曲线没有,且纵坐标特别大,我看老师视频中是1.x,还有就是加载微调后的模型,回复胡言乱语。
请将LLaMA-Factory放置在/root/目录下,也就是:/root/LLaMA-Factory,然后再重复做一次实验
我的LLaMA-Factory就是在root下的,上午有老师评论让我录屏,不知道为什么评论删了,完整的录屏放在下面了(前1:50在克隆容器)
找到问题了,已经更新了课件,请按最新课件操作流程来:https://www.yuque.com/hkutangyu/di80sc/oy84gbs16y1ubzdd?singleDoc# 《基于LLaMA-Factory的模型微调训练》 密码:amos