百度文库下载(用别人的爬虫)

Posted 2020-11-20 pandaboy1123

tags:

篇首语：本文由小常识网(cha138.com)小编为大家整理，主要介绍了百度文库下载(用别人的爬虫)相关的知识，希望对你有一定的参考价值。

参考博客：https://blog.csdn.net/qq_17054989/article/details/80168214

参考网址：http://39.108.149.27:9999

import requests

header = {‘Accept‘: ‘text/plain, */*; q=0.01‘,  # 模拟浏览器头信息
          ‘Accept-Encoding‘: ‘gzip, deflate‘,
          ‘Accept-Language‘: ‘zh-CN,zh;q=0.9‘,
          ‘Connection‘: ‘keep-alive‘,
          ‘Content-Length‘: ‘135‘,
          ‘Content-Type‘: ‘application/x-www-form-urlencoded‘,
          ‘Host‘: ‘39.108.149.27:9999‘,
          ‘Origin‘: ‘http://39.108.149.27:9999‘,
          ‘Referer‘: ‘http://39.108.149.27:9999/‘,
          ‘User-Agent‘: ‘Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (Khtml, like Gecko) Chrome/66.0.3359.139 Safari/537.36‘,
          ‘X-Requested-With‘: ‘XMLHttpRequest‘
          }
wenku = input(‘请输入您要下载文档的地址：‘)
# 真实网址
url = ‘http://39.108.149.27:9999/default.aspx‘
# 提交数据
datas = {
    ‘username‘: ‘‘,
    ‘password‘: ‘‘,
    ‘txtUrl‘: ‘{}‘.format(wenku),
    ‘mail‘: ‘[email protected]‘#在这里输入你自己的邮箱哦亲
}
html = requests.request("POST", url, data=datas, headers=header)
print(html.text)

View Code

以上是关于百度文库下载(用别人的爬虫)的主要内容，如果未能解决你的问题，请参考以下文章

python+selenium爬取百度文库不能下载的word文档

Python3实现QQ机器人自动爬取百度文库的搜索结果并发送给好友（主要是爬虫）

百度文库免下载券下载方法

百度文库等免费下载方法！

拒绝某度文库，跟我一起用Python写个下载器

好用的百度文库下载工具：易读