关于DeepSeek蒸馏公司的模型 #545
Labels
No Label
bug
duplicate
enhancement
help wanted
invalid
question
wontfix
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: HswOAuth/llm_course#545
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
老师,我目前通过Ollama部署了一个DeepSeek R1-32b模型,有一些公司内部文档数据,想用自己资料数据通过这个蒸馏一个小模型,我不太会做蒸馏模型和微调模型,老师能给一些思路或建议吗?
首先ollama本身不支持微调,如果需要微调,可以学习下llama factory之类的微调架构,通过sft进行微调。