[当尝试解析亚马逊产品时，我的请求有时会重定向到其他页面

Question

我正在尝试解析亚马逊产品。我运行代码的一半时间运行良好，并返回信息，另一半将我的请求重定向到一个亚马逊页面，该页面似乎旨在抵御恶意请求。当我尝试返回页面的URL时，它将返回我的原始输入URL，而不是亚马逊页面之一。从我阅读过的使用标头的内容应该可以解决此问题，但是同样，它仅处理大约一半的请求，这确实很奇怪。无论如何，有没有办法确保我总是得到真实的回应？

下面是代码：

    import requests
    from bs4 import BeautifulSoup as soup

    #constants
    url = "https://www.amazon.com/Zephyrus-GeForce-i7-9750H-Windows-GX531GW- 
    AB76/dp/B07QN3683G/ref=sr_1_12?dchild=1&keywords=zephyrus+g15&qid=1586732721&sr=8-12"

    #Amazon data class
class items:

def __init__(self, url):
    self.url = url

#parses page and returns info
def data(self):
    headers = {"User-Agent":"Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (Khtml, like Gecko) Chrome/80.0.3987.163 Safari/537.36"}
    #get html response
    try:
        html = requests.get(self.url, headers=headers).content
    except Exception:
        print("Could not retrieve page")
    else:
        #parse the page
        pagesoup = soup(html, "html5lib")
        #get price, name
        try:
            price = pagesoup.find("span", id="priceblock_ourprice").get_text().strip()
        except Exception:
            print("Price could not be extracted")
            price = None
        try:
            name = pagesoup.find("span", id="productTitle").get_text().strip()
        except Exception:
            print("Product name could not be extracted")
            name = None
    return price, name

  #test
  item_1 = items(url)
  print(item_1.data())

Answer 1

另一答案