【求助帖】关于调用qianwen2.5模型加速的问题 #345
Labels
No Label
bug
duplicate
enhancement
help wanted
invalid
question
wontfix
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: HswOAuth/llm_course#345
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
最近尝试调用qianwen2.5 70B 模型的api,请求一次,响应结果为45秒,请问有什么办法进行加速处理呢?
调用的是阿里的api,还是自己部署的模型?
如果是阿里的线上模型,可能是网络模型;
如果是自己部署的模型,硬件以及部署方式需要提供下看看