替换Python字符串中的自定义“HTML”标记

Question

我希望能够在字符串中包含自定义“HTML”标记，例如："This is a <photo id="4" /> string"。

在这种情况下，自定义标记是<photo id="4" />。如果它更容易，即[photo id:4]或其他什么，我也可以改变这个自定义标签以改写。

我希望能够将此字符串传递给将提取标签<photo id="4" />的函数，并允许我将其转换为更复杂的模板，如<div class="photo"><img src="...." alt="..."></div>，然后我可以使用它来替换原始字符串中的标记。

我正在成像这样的工作：

>>> content = "This is a <photo id="4" /> string"
# Pass the string to a function that returns all the tags with the given name.
>>> tags = parse_tags('photo', string)
>>> print(tags)
[{'tag': 'photo', 'id': 4, 'raw': '<photo id="4" />'}]
# Now that I know I need to render a photo with ID 4, so I can pass that to some sort of template thing
>>> rendered = render_photo(id=tags[0]['id'])
>>> print(rendered)
<div class="photo"><img src="...." alt="..."></div>
>>> content = content.replace(tags[0]['raw'], rendered)
>>> print(content)
This is a <div class="photo"><img src="...." alt="..."></div> string

我认为这是一个相当常见的模式，比如把照片放在博客文章中，所以我想知道是否有一个库可以做类似于上面的示例parse_tags函数。或者我需要写它吗？

这个照片标签的例子只是一个例子。我想要有不同名称的标签。作为一个不同的例子，也许我有一个人的数据库，我想要一个像<person name="John Doe" />的标签。在那种情况下，我想要的输出就像{'tag': 'person', 'name': 'John Doe', 'raw': '<person name="John Doe" />'}。然后我可以使用该名称查看该人并返回该人的vcard或其他东西的渲染模板。

Answer 1

另一答案