AGI-2404期-智能审稿机器人--多模态数据处理?-林希老师 #738
Labels
No Label
bug
duplicate
enhancement
help wanted
invalid
question
wontfix
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: HswOAuth/llm_course#738
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
如果目标稿件PDF中,含有较多的图片以及类似柱状数据图/数据表格等,这类数据如何处理呢? 是否有较为成熟的方案可供参考
AGI-2404期-智能审稿机器人--课程代码在哪里?-林希老师to AGI-2404期-智能审稿机器人--多模态数据处理?-林希老师如果是高频次处理这类复杂PDF文件的需求,可以考虑使用一些商业API
Docparser:一个基于云的API,能够提取PDF中的数据表格、文本、图表等内容。
Tabula:一个开源工具,专门用于从PDF中提取表格数据。