如何在R中反转unnest_tokens [重复]

Posted 2023-04-12

技术标签:

【中文标题】如何在R中反转unnest_tokens [重复]【英文标题】：how to reverse unnest_token in R [duplicate] 【发布时间】：2020-11-16 04:50:01 【问题描述】：

我想知道如何反转 unnest_token 函数以导出推文并在 python 中工作。

这是我的数据集

ID DATE          TWEETS
1  2020-02-29    people tends stay home reach customer directly bulk email

这是我申请的

  flood_tweet_messages <- df %>%
  dplyr::select(ID, cleaned_tweets) %>%
  unnest_tokens(word, cleaned_tweets) %>%

我得到了这个

ID word

1  people
1  tends
1  stay 
1  home

如何反转该过程并返回到数据集的原始形式？是否可以将日期保留在未嵌套的数据集中？怎么样？

感谢大家的帮助！

【问题讨论】：

试试df %>% group_by(ID) %>% summarize(cleaned_tweets = str_flatten(word, " ")) 如果我在 df 数据集中有 ID 列，它会起作用，但我不知道如何添加它。那你怎么知道哪些词属于哪些文本呢？好的，我做到了。修复是愚蠢的。您的代码效果很好！非常感谢！ 【参考方案1】：

这解决了问题！

df %>% group_by(ID) %>% summarize(cleaned_tweets = str_flatten(word, " "))

感谢用户@Phil

【讨论】：

以上是关于如何在R中反转unnest_tokens [重复]的主要内容，如果未能解决你的问题，请参考以下文章

unnest_tokens 及其错误（“”）

使用txt文件作为源时的Tidytext unnest_tokens错误

如何在 Twitter 文本数据上使用 unnest_token？

如何在Python中反转单词[重复]

如何反转多维数组中的重复值

如何在Matlab中反转矩阵矩阵？