第6期课程-实时网页信息爬取+RAG的舆情分析系统-林辉老师 #774
Labels
No Label
bug
duplicate
enhancement
help wanted
invalid
question
wontfix
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: HswOAuth/llm_course#774
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
老师,我发现 https://weibo.com/ajax/side/hotSearch 这个网页不能爬了
这个网页已经被微博禁止掉了,需要使用xpath从这个网页https://s.weibo.com/top/summary?cate=realtimehot爬取微博的热榜,这个github仓库有相关的爬取代码可以参考下:https://github.com/legeling/weibo_hotSearch