Python 自然语言处理字频统计

Posted 2023-01-20

tags:

篇首语：本文由小常识网(cha138.com)小编为大家整理，主要介绍了Python 自然语言处理字频统计相关的知识，希望对你有一定的参考价值。

import jieba
txt = open("lg.txt", "r", encoding="gb18030").read()

import collections

txt1 = txt
txt1 = txt1.replace(\\n, )  # 删掉换行符
txt1 = txt1.replace(，, )  # 删掉逗号
txt1 = txt1.replace(。, )  # 删掉句号
mylist = list(txt1)
mycount = collections.Counter(mylist)
for key, val in mycount.most_common(10):  # 有序（返回前10个）
    print(key, val)

以上是关于Python 自然语言处理字频统计的主要内容，如果未能解决你的问题，请参考以下文章