爬取去哪网景点数据

Posted persistence-ok

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了爬取去哪网景点数据相关的知识,希望对你有一定的参考价值。

import urllib.parse
import urllib.request
import requests
from bs4 import BeautifulSoup
import csv
import time
import re

sd=[名字,地址,价格,月销量,景点概述]
with open(C:\\Users\\惠普\\Desktop\\ac2.csv,a+,newline=‘‘,encoding=utf-8)as f:
       writers=csv.writer(f)
       writers.writerow(sd)

header=User-Agent:Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (Khtml, like Gecko) Chrome/55.0.2883.87 Safari/537.36,
X-Requested-With:XMLHttpRequest
base=&subject=&sku=
for i in range(1,80):
   url=http://piao.qunar.com/ticket/list.htm?keyword=%E7%83%AD%E9%97%A8%E6%99%AF%E7%82%B9&region=&from=mpl_search_suggest&page=.format(i)
   url=url+base
   response=requests.get(url,headers=header)
   soup=BeautifulSoup(response.text)
   name=soup.find_all(h3,class:sight_item_caption)
   address=soup.find_all(p,class:address color999)
   price=soup.find_all(span,class:sight_item_price)
   xiaoliang=soup.find_all(span,class:hot_num)
   note=soup.find_all(div,class:intro color999)
   with open(C:\\Users\\惠普\\Desktop\\ac2.csv,a+,newline=‘‘,encoding=utf-8)as f:
       writers=csv.writer(f)
       for i in range(len(name)):
           listw=[]
           c=price[i].text.replace(¥,‘‘).replace(,‘‘)
           print(c)
           if c==免费:
               listw=[name[i].text,address[i].text.replace(地址:,‘‘),0,0,note[i].text]
               writers.writerow(listw)
               
           else:
               listw=[name[i].text,address[i].text.replace(地址:,‘‘),c,xiaoliang[i].text,note[i].text]
               writers.writerow(listw)

 

以上是关于爬取去哪网景点数据的主要内容,如果未能解决你的问题,请参考以下文章

2021年国庆你的朋友去哪浪了?让Python告诉你!

Python爬虫开源项目代码,爬取微信淘宝豆瓣知乎新浪微博QQ去哪网等 代码整理

用Python爬取了全国近5000家旅游景点,分析元旦去哪玩

去哪网实习总结:如何配置数据库连接(JavaWeb)

项目- Vue全家桶实战去哪网App

去哪网实习总结:开发定时任务(JavaWeb)