13-24.9.5-私有化LLM仿OpenAI API接口的高可用工程实践 课后作业 #30

Open
opened 2024-09-06 23:35:11 +08:00 by 12422372205cs · 0 comments

环境:
windows宿主机
docker搭建ubuntu完成实验

Tip:
配置完OneAPI令牌后,NextChat需要重启聊天框才生效。

BUG:
--load_in_8bit 在高并发下会报错( output += torch.matmul(subA, state.subB RuntimeError: mat1 and mat2 shapes cannot be multiplied (1x2 and 1x1024))

作业完成

image
image

环境: windows宿主机 docker搭建ubuntu完成实验 Tip: 配置完OneAPI令牌后,NextChat需要重启聊天框才生效。 BUG: --load_in_8bit 在高并发下会报错( output += torch.matmul(subA, state.subB RuntimeError: mat1 and mat2 shapes cannot be multiplied (1x2 and 1x1024)) ## 作业完成 ![image](/attachments/4ebf1233-c549-4567-b5b4-9c41396eb535) ![image](/attachments/3b2a87e0-e17c-4f4f-85eb-7e6c4924a177)
Sign in to join this conversation.
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: HswOAuth/llm_course#30
No description provided.