如何用python和漂亮的汤从html代码中提取一个小时

Posted 2023-05-07

技术标签:

【中文标题】如何用python和漂亮的汤从html代码中提取一个小时【英文标题】：How to extract an hour from html code with python and beautiful soup 【发布时间】：2020-04-05 23:10:36 【问题描述】：

我对 Python 和美丽的汤有点陌生。任何人都可以帮助并回答我如何从这个 html 代码中提取一个小时？

<a class="hour-link fancybox-reservation" href="/47,Lodz/Seans/info/seans/CC527207-4B9C-45CD-812F-3501A647E1B3/dzien/146231/film/16892">12:20</a>

输出应该是：12:20

提前感谢您的所有回答！

【问题讨论】：

这能回答你的问题吗？ BeautifulSoup innerhtml? 【参考方案1】：

你可以试试：

>>> from bs4 import BeautifulSoup as bs

>>> data = """<a class="hour-link fancybox-reservation" href="/47,Lodz/Seans/info/seans/CC527207-4B9C-45CD-812F-3501A647E1B3/dzien/146231/film/16892">12:20</a>"""

>>> soup = bs(StringIO(data))
>>> a_tag = soup.find_all('a')

>>> a_tag[0]
<a class="hour-link fancybox-reservation" href="/47,Lodz/Seans/info/seans/CC527207-4B9C-45CD-812F-3501A647E1B3/dzien/146231/film/16892">12:20</a>

>>> a_tag[0].text
'12:20'

【讨论】：

【参考方案2】：

查看Soup documentation 并尝试先自己制定答案。我建议您查看 find_all('a') 和 .text 功能作为您的示例。

【讨论】：

以上是关于如何用python和漂亮的汤从html代码中提取一个小时的主要内容，如果未能解决你的问题，请参考以下文章