如何找到使用Python的数据上最常用的单词？ [重复]

Question

这个问题在这里已有答案：

Count most frequent 100 words from sentences in Dataframe Pandas 2回答

我正在使用Python进行情感分析项目（使用自然语言处理）。我已经从twitter收集了数据并将其保存为CSV文件。该文件包含推文，主要是关于加密货币。我使用分类算法清理数据并应用情绪分析。

由于数据很干净，我想找到最常用的单词。这是我用来导入库和csv文件的代码：

# importing Libraries
from pandas import DataFrame, read_csv
import chardet
import matplotlib.pyplot as plt; plt.rcdefaults()
from matplotlib import rc
%matplotlib inline
import pandas as pd
plt.style.use('ggplot')
import numpy as np
import re
import warnings

#Visualisation
import matplotlib.pyplot as plt
import matplotlib
import seaborn as sns
from IPython.display import display
from mpl_toolkits.basemap import Basemap
from wordcloud import WordCloud, STOPWORDS

#nltk
from nltk.stem import WordNetLemmatizer
from nltk.sentiment.vader import SentimentIntensityAnalyzer
from nltk.sentiment.util import *
from nltk import tokenize
from sklearn.feature_extraction.text import TfidfVectorizer
from nltk.stem.snowball import SnowballStemmer


matplotlib.style.use('ggplot')
pd.options.mode.chained_assignment = None
warnings.filterwarnings("ignore")

## Reading CSV File and naming the object called crime
ltweet=pd.read_csv("C:Users
ameDocumentspython assignmentitcoin1.csv",index_col = None, skipinitialspace = True)
print(btweet)

我没有必要发布其他代码，因为它们很长。对于数据清理，我摆脱了超链接，RT（转推），URL，标点符号，以小写形式放置文本等。

例如，这是正面推文列表的输出

In [35]: btweet[btweet.sentiment_type == 'POSITIVE'].Tweets.reset_index(drop = True)[0:5]

Out[35]:
0    anizameddine more than just bitcoin blockchain...
1    bitcoinmagazine icymi wyoming house unanimousl...
2    bitracetoken bitrace published the smart contr...
3    unusual and quite promising ico banca banca_of...
4    airdrop coinstocks link it is a exchange so ge...
Name: Tweets, dtype: object

有没有办法找到数据中最常用的单词？任何人都可以帮我写代码吗？

Answer 1

另一答案

如何找到使用Python的数据上最常用的单词？ [重复]

suppose string a