python 爬取二手房

Posted 道法自然

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了python 爬取二手房相关的知识,希望对你有一定的参考价值。

import requests
import parsel
import time
import csv

f = open(\'二手房信息.csv\', mode=\'a\', encoding=\'utf-8-sig\', newline=\'\')
csv_writer = csv.DictWriter(f, fieldnames=[\'标题\', \'开发商\', \'房子信息\', \'发布周期\', \'售价/万\', \'单价\'])
csv_writer.writeheader()

for page in range(1, 101):
print(\'===========================正在下载第{}页数据================================\'.format(page))
time.sleep(1)
url = \'https://cs.lianjia.com/ershoufang/pg{}/\'.format(page)
headers = {
\'User-Agent\': \'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (Khtml, like Gecko) Chrome/81.0.4044.138 Safari/537.36\'
}

response = requests.get(url=url, headers=headers)
selector = parsel.Selector(response.text)
lis = selector.css(\'.sellListContent li\')
dit = {}
for li in lis:
title = li.css(\'.title a::text\').get()
dit[\'标题\'] = title
positionInfo = li.css(\'.positionInfo a::text\').getall()
info = \'-\'.join(positionInfo)
dit[\'开发商\'] = info
houseInfo = li.css(\'.houseInfo::text\').get()
dit[\'房子信息\'] = houseInfo
followInfo = li.css(\'.followInfo::text\').get()
dit[\'发布周期\'] = followInfo
Price = li.css(\'.totalPrice span::text\').get()
dit[\'售价/万\'] = Price
unitPrice = li.css(\'.unitPrice span::text\').get()
dit[\'单价\'] = unitPrice
csv_writer.writerow(dit)
print(dit)

 

以上是关于python 爬取二手房的主要内容,如果未能解决你的问题,请参考以下文章

python爬取安居客二手房网站数据(转)

python爬虫:爬取链家深圳全部二手房的详细信息

Python爬取链家二手房数据——重庆地区

python 学习 - 爬虫入门练习 爬取链家网二手房信息

Python开发爬虫之BeautifulSoup解析网页篇:爬取安居客网站上北京二手房数据

python3 爬虫之爬取安居客二手房资讯(多线程版)