爬取汽车之家
Posted zhanglin123
tags:
篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了爬取汽车之家相关的知识,希望对你有一定的参考价值。
import requests from bs4 import BeautifulSoup response = requests.get(‘https://www.autohome.com.cn/news/‘) response.encoding = ‘gbk‘ soup = BeautifulSoup(response.text,"html.parser") div =soup.find(name=‘div‘,id=‘auto-channel-lazyload-article‘) li_list = div.find_all(name=‘li‘) for li in li_list: h3 = li.find(name=‘h3‘) a = li.find(name=‘a‘) p =li.find(name=‘p‘) img = li.find(name=‘img‘) if not h3: continue print(h3.text) print(a.attrs[‘href‘]) print(p.text) img_url = ‘https:‘+ img.attrs[‘src‘] img_response = requests.get(img_url) file_name = img_url.rsplit(‘/‘,maxsplit=1)[1] with open(file_name,‘wb‘) as f: f.write(img_url.content) print(‘======================‘)
以上是关于爬取汽车之家的主要内容,如果未能解决你的问题,请参考以下文章