【求助帖】我目前想要做个UI自动化的工具,需要识别页面上面所有可能按钮,并且获取到这些按钮的坐标,有哪些开源模型比较适合?要怎么微调? #734

Open
opened 2025-05-22 19:00:01 +08:00 by guoyue · 1 comment
No description provided.

OpenCV + Tesseract
可以考虑结合OpenCV和Tesseract进行按钮识别:
利用OpenCV的模板匹配或者轮廓检测等方法,结合Tesseract OCR对按钮文本进行识别。

OpenCV + Tesseract 可以考虑结合OpenCV和Tesseract进行按钮识别: 利用OpenCV的模板匹配或者轮廓检测等方法,结合Tesseract OCR对按钮文本进行识别。
Sign in to join this conversation.
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: HswOAuth/llm_course#734
No description provided.