经历了医疗数据的有监督SFT微调、奖励模型RM微调、PPO训练再到vllm部署,想知道如何证明最终微调结果回答的问题是否符合客观事实?太专业了看不懂医疗信息 #276
Labels
No Label
bug
duplicate
enhancement
help wanted
invalid
question
wontfix
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: HswOAuth/llm_course#276
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
是否符合客观事实可能需要专业的人员去判断喔
这种判断医学结果是否符合客观事实的问题其实本质上还是判断大模型输出的结果是否正确的问题,针对这种回答内容是否正确进行评估和微调也是大模型发展中的一个重要问题。没有任何模型能够保证100%的准确性哈,这种持续的评估和改进的过程是必要的。