求助帖:26-25.3.4-大语言模型训练篇:多机多卡微调 - 林希老师——训练过程疑惑 #604

Closed
opened 2025-03-06 22:29:02 +08:00 by wangjo · 0 comments

如图:
1、为何一机一卡和一机四卡训练,steps总数不一样
2、为何一机四卡的训练显示的时间 比 一机一卡还要长,而且长那么多天。

如图: 1、为何一机一卡和一机四卡训练,steps总数不一样 2、为何一机四卡的训练显示的时间 比 一机一卡还要长,而且长那么多天。
wangjo reopened this issue 2025-03-06 23:09:14 +08:00
Sign in to join this conversation.
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: HswOAuth/llm_course#604
No description provided.