v0.1
This commit is contained in:
parent
9009c3b572
commit
4d8b582df2
14
README.md
14
README.md
@ -2,8 +2,14 @@
|
|||||||
批量导出任意公众号历史文章
|
批量导出任意公众号历史文章
|
||||||
|
|
||||||
主要思路参考这几篇文章
|
主要思路参考这几篇文章
|
||||||
[一步步教你打造文章爬虫(1)-综述](https://mp.weixin.qq.com/s/tm4ypSllpb7MhjrlMXmNDA)
|
[一步步教你打造文章爬虫(1)-综述](https://mp.weixin.qq.com/s?__biz=MzAxMDM4MTA2MA==&mid=2455304602&idx=1&sn=4beadc781c44c17cb4451b579d077c45&chksm=8cfd6bf1bb8ae2e7d5a9f1a66696dd12e260ac7919c7bebe317af81e90bd25591ba286da1f0f&token=2137480545&lang=zh_CN#rd)
|
||||||
[一步步教你打造文章爬虫(2)-下载网页](https://mp.weixin.qq.com/s/YoUYJ7iokJcARkL2hYxwXw)
|
[一步步教你打造文章爬虫(2)-下载网页](https://mp.weixin.qq.com/s?__biz=MzAxMDM4MTA2MA==&mid=2455304609&idx=1&sn=b7496563aab42e92060bd68936bc4212&chksm=8cfd6bcabb8ae2dc606b060fecf3f837177e3ef22a05a30ee28ebefd75c6677b29df3e426692&token=2137480545&lang=zh_CN#rd)
|
||||||
特别是要注意第3篇
|
特别要仔细看第3篇
|
||||||
|
[一步步教你打造文章爬虫(3)-批量下载
|
||||||
|
](https://mp.weixin.qq.com/s?__biz=MzAxMDM4MTA2MA==&mid=2455304632&idx=1&sn=d0a1f6ef7e5d4356d17219a2b79f65d4&chksm=8cfd6bd3bb8ae2c532f901e11aa4b080c19f16626f0dceb291fcb8270e2d7689d7b97d232683&token=2137480545&lang=zh_CN#rd)
|
||||||
|
|
||||||
|
QQ交流群 703431832 ,加群暗号"不止技术流"
|
||||||
|
|
||||||
|
本项目仅用于技术学习交流,请勿用于非法用途,由此引起的后果本作者概不负责。
|
||||||
|
|
||||||
|
|
||||||
需要搭配Fiddler使用
|
|
||||||
|
|||||||
@ -128,6 +128,8 @@ def GetArticleList(jsondir):
|
|||||||
ArtList.append(art)
|
ArtList.append(art)
|
||||||
print(len(ArtList),pubdate, idx, title)
|
print(len(ArtList),pubdate, idx, title)
|
||||||
return ArtList
|
return ArtList
|
||||||
|
|
||||||
|
|
||||||
if __name__ == "__main__":
|
if __name__ == "__main__":
|
||||||
dir = "C:/vWeChatFiles/rawlist/Dump-0805-15-00-45" #改成你自己的文件夹地址
|
dir = "C:/vWeChatFiles/rawlist/Dump-0805-15-00-45" #改成你自己的文件夹地址
|
||||||
saveHtmlDir = "c:/vWeChatFiles/html/" #改成你自己的保存目录,如果没有要新建
|
saveHtmlDir = "c:/vWeChatFiles/html/" #改成你自己的保存目录,如果没有要新建
|
||||||
|
|||||||
Loading…
Reference in New Issue
Block a user