文件方式实现完整的英文词频统计实例
Posted 109-梁浩辉
tags:
篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了文件方式实现完整的英文词频统计实例相关的知识,希望对你有一定的参考价值。
1.读入待分析的字符串
2.分解提取单词
3.计数字典
4.排除语法型词汇
5.排序
6.输出TOP(20)
fo=open(\'dead romance.txt\',\'w\') fo.write(\'\'\'in a rainy night can you hear me in a rainy night can you help me man,what are you thinking of man,what do you need man,nobody tell you what to do man ,you need somebody to hurt in a rainy night can you hear me in a rainy night can you help me man,you feel so lonely man,can you hear the message come from the sky man,you are driving into the rain man,you know it\'s time to find the prey in a rainy night can you hear me in a rainy night can you help me\'\'\') fo.close() fo=open(\'dead romance.txt\',\'r\') A= fo.read() exc={\'the\',\'and\',\'to\',\'of\',\'in\',\'a\',\'for\',\'with\',\'\'} for i in \',.?!\\n"\': A=A.replace(i,\' \') A=A.lower() A=A.split(" ") words=set(A) dic={} keys=set(A)#出现过单词的集合,字典的KEY keys=keys-exc for i in keys: dic[i]=A.count(i) w=list(dic.items()) w.sort(key=lambda x:x[1],reverse=True) for i in range(20): print(w[i]) fo.close()
以上是关于文件方式实现完整的英文词频统计实例的主要内容,如果未能解决你的问题,请参考以下文章