工作中的大模型微调不准确问题--需要远程指导--林希老师 #629
Labels
No Label
bug
duplicate
enhancement
help wanted
invalid
question
wontfix
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: HswOAuth/llm_course#629
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
用了公司的私有数据,是docx文档,里面每个标题下面都有具体说明,总共有13类内容。例如:1.名称:xxx,2.坐标:xxx,3.基本情况:xxx。可以转换为结构化的json格式,尝试之后转为alpaca格式,用deepseek-r1微调,效果很不好,之后又用glm4微调,效果好些,不过回答的也不准确并且回答太少了,之后微调了学习率等参数,回答的又太多了,还是不准确,需要老师远程指导一下。