python 在文件上运行clean函数 - 将输出文本写入新的csv

Posted 2021-05-08

tags:

篇首语：本文由小常识网(cha138.com)小编为大家整理，主要介绍了python 在文件上运行clean函数 - 将输出文本写入新的csv相关的知识，希望对你有一定的参考价值。


def initial_clean(text):
    text = re.sub("((\S+)?(http(s)?)(\S+))|((\S+)?(www)(\S+))|((\S+)?(\@)(\S+)?)", " ", text)
    text = re.sub("[^a-zA-Z ]", "", text)
    text = text.lower() # lower case the text
    text = nltk.word_tokenize(text)
    return text



for file in glob.glob('*.csv'):
    # a.txt => a-out.txt, b.txt => b-out.txt, etc.
    with open(file.replace('.csv', '-out.csv'),'w') as outfile:
        # read a line at time from input file and
        # write a line at a time to output file
        with open(file,errors='ignore') as infile:
            for line in infile:
                print(initial_clean(line), end='', file=outfile)

以上是关于python 在文件上运行clean函数 - 将输出文本写入新的csv的主要内容，如果未能解决你的问题，请参考以下文章