26-25.3.4-大语言模型训练篇:多机多卡微调 - 林希老师-执行报错 #606
Labels
No Label
bug
duplicate
enhancement
help wanted
invalid
question
wontfix
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: HswOAuth/llm_course#606
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
执行到6.1.1步骤时,命令报错(见附件),请老师帮忙看下
我看到报错后,曾经尝试创建/userhome/xtuner-workdir1目录,修改命令中的--wordir缺失字符k(从文档直接copy-paste会丢失这个字母)等操作,但报错都仍然存在
试下这里的复制
cd /code/
NPROC_PER_NODE=1 xtuner train qwen1_5_0_5b_chat_full_alpaca_e3_copy.py --work-dir /userhome/xtuner-workdir1 --deepspeed deepspeed_zero3_offload