libxml的使用--xpath搜索节点树

Posted 2020-09-16 fire909090

tags:

篇首语：本文由小常识网(cha138.com)小编为大家整理，主要介绍了libxml的使用--xpath搜索节点树相关的知识，希望对你有一定的参考价值。

在libxml的tutorial中介绍了一种用关键字查找节点的方法，这种方法将使用打xpath系列API。由于我才刚刚接触libxml，所以我对xpath的认识也仅仅是在tutorial提供的功能之内了。废话少说，直接进入整体。

我们在操作xml文件是经常需要根据特定的条件查找一系列的节点，为了实现这样的功能，我们需要一个xmlXPathContextPtr和一个expression。我们调用xmlXPathEvalExpression函数来得到一个xmlXPathObjectPtr指针，这个指针包含了一个xmlNodeSetPtr，其中有一个变量nodeTab是我们所需要的节点数组。

[cpp] view plain copy print ?

xmlXPathObjectPtr ret = NULL;
xmlXPathContextPtr con = NULL;
con = xmlXPathNewContext(doc);
ret = xmlXPathEvalExpression((xmlChar*)expr, con);
xmlXPathFreeContext(con);

这样我们就得到了查询的结果了。expr是查询的条件，tutorial给的例子里，这个条件是“//keyword”，表示找出所有名称为keyword的节点。至于其他的条件，我现在还不知道。

得到了查询的结果，我们就要对结果进行处理。

[cpp] view plain copy print ?

if(NULL == ret) {
fprintf(stderr, "eval func error\n");
exit(1);
}
if(xmlXPathNodeSetIsEmpty(ret->nodesetval)){
fprintf(stderr, "node set empty\n");
xmlXPathFreeObject(ret);
exit(1);
}
xmlNodeSetPtr nodeset = ret->nodesetval;
int i;
for(i = 0; i < nodeset->nodeNr; i ++) {
//handle the node
}
xmlXPathFreeObject(ret);

下面是一个程序的实例。用于提取出网页中的链接：

[html] view plain copy print ?

<html>
<head>
<title>web</title>
</head>
<body>
<a href="www.baidu.com">baidu</a>
<a href="www.google.com">Google</a>
</body>
</html>

link.c

[cpp] view plain copy print ?

输出结果为：

[html] view plain copy print ?

link address:www.baidu.com
link address:www.google.com

以上是关于libxml的使用--xpath搜索节点树的主要内容，如果未能解决你的问题，请参考以下文章

[libxml2]_[XML处理]_[使用libxml2的xpath特性修改xml文件内容]

名称空间和 xpath 的 libxml2 错误

XPath 搜索所有文本节点，而不是任何其他子节点的内部文本

页面元素定位及操作--xpath

元素定位XPath 简单操作分享