【求助】多机多卡模型调试 #398
Labels
No Label
bug
duplicate
enhancement
help wanted
invalid
question
wontfix
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: HswOAuth/llm_course#398
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
老师,一直卡在这里不往下运行了,这是什么问题
显示TimeoutError: The client socket has timed out after 900s while trying to connect to (10.244.96.126, 22222).
两台notebook应该设置同一个IP地址,就是ADDR这个变量应该一样。
因为worker需要连接master,所以需要给定master的IP地址进行通信。
可以将任意notebook作为master
两台notebook设置成了同一个IP地址,但问题还是一样 @21970855250cs