如何在 Python NLTK 中提取 WordNet 同义词集的偏移量并给出同义词集？

Posted 2023-02-24

技术标签:

【中文标题】如何在 Python NLTK 中提取 WordNet 同义词集的偏移量并给出同义词集？【英文标题】：How do I extract the offset of a WordNet synset give a synset in Python NLTK? 【发布时间】：2015-09-22 06:08:46 【问题描述】：

WordNet 中的感觉偏移量是一个 8 位数字，后跟一个 POS 标记。例如，synset 'dog.n.01' 的偏移量是 '02084071-n'。我试过以下代码：

    from nltk.corpus import wordnet as wn

    ss = wn.synset('dog.n.01')
    offset = str(ss.offset)
    print (offset)

但是，我得到了这个输出：

    <bound method Synset.offset of Synset('dog.n.01')>

如何获得这种格式的实际偏移量：'02084071-n'？

【问题讨论】：

您将从here 和here 得到答案并进行一些实验 【参考方案1】：

>>> from nltk.corpus import wordnet as wn
>>> ss = wn.synset('dog.n.01')
>>> offset = str(ss.offset()).zfill(8) + '-' + ss.pos()
>>> offset
u'02084071-n'

【讨论】：

谢谢！，请检查我的其他问题：***.com/questions/31234168/…@alvas

以上是关于如何在 Python NLTK 中提取 WordNet 同义词集的偏移量并给出同义词集？的主要内容，如果未能解决你的问题，请参考以下文章