python 统计单词个数---从文件读取版本---不去重

Posted 2020-07-27

tags:

篇首语：本文由小常识网(cha138.com)小编为大家整理，主要介绍了python 统计单词个数---从文件读取版本---不去重相关的知识，希望对你有一定的参考价值。

#/usr//bin/env  pytnon
#coding:utf-8

import re

‘‘‘
words = ‘hello word  [email protected]#  ####   ##$dfabpple# apple##%#$  3423421apple        apple323423 432422324##[email protected]@#  @@@@@  $$!!#!#‘
‘‘‘

#编译生成正则表达式对象
regexp = re.compile(r‘[a-zA-Z]{2,}‘)

#with open(‘./aa.txt‘)  as f:
f = open(‘./aa.txt‘)
line = f.readlines()
print len(line)

j = 0
sum = 0
for i in range(len(line)):
     j = len(regexp.findall(line[i]))
     sum = sum + j
print sum

本文出自 “linux与网络那些事” 博客，请务必保留此出处http://khaozi.blog.51cto.com/952782/1790792

以上是关于python 统计单词个数---从文件读取版本---不去重的主要内容，如果未能解决你的问题，请参考以下文章