05-私有化LLM仿OpenAI API接口的高可用工程实践(跟练) #32
Labels
No Label
bug
duplicate
enhancement
help wanted
invalid
question
wontfix
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: HswOAuth/llm_course#32
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
本地服务配置:#24 (comment)
step1: 1.启动gpu服务,端口号19300;2.启动cpu服务,端口号19333








python openai_api_server.py --gpus 0,1 --port 19300 --base_model /mnt/wksp/agi/models/ChineseAlpacaGroup/llama-3-chinese-8b-instruct-v3/
python openai_api_server.py --only_cpu --port 19333 --base_model /mnt/wksp/agi/models/ChineseAlpacaGroup/llama-3-chinese-8b-instruct-v3/
step2: 启动OneAPI
step3: OneAPI添加渠道
step4: OneAPI设置令牌
step5: nextchat配置
step6: 对话测试(gpu,cpu都可用)
gpu、cpu渠道都可用时,由OneAPI负责调度决定具体使用哪一个进行响应。
cpu响应
gpu响应
step7: 对话测试(仅cpu可用)
私有化LLM仿OpenAI API接口的高可用工程实践(跟练)to 05-私有化LLM仿OpenAI API接口的高可用工程实践(跟练)