11534018664cs
  • Joined on 2024-11-21
11534018664cs closed issue HswOAuth/llm_course#398 2024-11-27 11:51:37 +08:00
【求助】多机多卡模型调试
11534018664cs commented on issue HswOAuth/llm_course#398 2024-11-26 23:31:20 +08:00
【求助】多机多卡模型调试

两台notebook设置成了同一个IP地址,但问题还是一样 @21970855250cs

11534018664cs commented on issue HswOAuth/llm_course#398 2024-11-26 13:16:49 +08:00
【求助】多机多卡模型调试

显示TimeoutError: The client socket has timed out after 900s while trying to connect to (10.244.96.126, 22222).

11534018664cs opened issue HswOAuth/llm_course#398 2024-11-26 12:57:37 +08:00
【求助】多机多卡模型调试
11534018664cs commented on issue HswOAuth/llm_course#396 2024-11-26 12:41:17 +08:00
单机单卡训练问题

可是没有往下运行,没有出现案例附图上这样的训练 可以了,谢谢

11534018664cs commented on issue HswOAuth/llm_course#396 2024-11-25 22:34:44 +08:00
单机单卡训练问题

进行单机多卡训练时也是同样的问题,在命令行运行代码启动正常(单机单卡,单机多卡均正常)

11534018664cs opened issue HswOAuth/llm_course#396 2024-11-25 20:54:13 +08:00
单机单卡训练问题