Skip to content

rhzxg/MicroblogCrawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

48 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MicroblogCrawler

微博热榜爬虫

Attention & Disclaimer: Due to the specific license(GPLv3) this repository is using, the usage of this repository is only accepted in mankind-benefiting and research-related studies. Behaviors while using, including but not limited to, idolizing celebrities, profiting in any kind, violating Microblog terms of use, and so on are strictly prohibited! Users are solely responsible for any legal liabilities incurred. Please be mindful of your behavior while using it to ensure compliance with this term.

注意事项&免责声明:由于GPLv3许可证协议的诸多特殊性,本项目的使用范围仅被限制在造福人类的学术型研究中。包括但不限于追星、盈利、违反微博使用条款等其他行为将被严厉禁止!用户将会承担所有法律后果,使用时务必注意遵守此条款。

Step1:

pip install requirements.txt

Step2:

Make sure Edge and Excel are installed properly on your computer.

Make sure the network connection is fine.

BTW: Uninstall WPS software kit to avoid strange errors.

Step3:

Run main.py.

Step4:

Input the correct Microblog URL and wait.

# correct url example:
https://s.weibo.com/weibo?q=xxx

Crawled:

Crawled example image