获取一篇文章中的单词频次(出现次数)

Posted tiancaige

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了获取一篇文章中的单词频次(出现次数)相关的知识,希望对你有一定的参考价值。

  string text = @"var query = from info in infoList 
    where info.AuditFlag == null || info.AuditFlag == false 
    join emp in empList 
       on info.SaleMan equals emp.EmployeeCode 
    join house in houseList 
       on info.WareHouse equals house.WareHouseCode 
    join client in clientList 
       on info.ClientCode equals client.ClientCode 
    join dictPayMode in dictList 
       on info.PayMode equals dictPayMode.ValueCode 
    where dictPayMode.TypeCode == ‘PayMode‘ 
    join dictInvoiceType in dictList 
       on info.InvoiceType equals dictInvoiceType.ValueCode 
    where dictInvoiceType.TypeCode == ‘InvoiceType‘
    select new 
    { 
       id = info.ID,
       SaleBillCode = info.SaleBillCode,
       SaleMan = emp.Name,
       SaleDate = info.SaleDate,
       Provider = client.ShortName,
       WareHouse = house.ShortName,
       PayMode = dictPayMode.ValueName,
       InvoiceType = dictInvoiceType.ValueName,
       InvoiceCode = info.InvoiceCode,
       AuditFlag = info.AuditFlag 
    };";
            //按单词转换为数组
            string[] allWords = text.Split(new char[] { ., ?, !,  , ;, :, , }, StringSplitOptions.RemoveEmptyEntries);
            string[] distinctWords = allWords.Distinct().ToArray<string>();//去掉单词数组中重复的单词
            int[] counts = new int[distinctWords.Length];//创建一个存放词频统计信息的数组
            for (int i = 0; i < distinctWords.Length; i++)//遍历每个单词
            {
                string tempWord = distinctWords[i];
                //计算每个单词出现的次数
                var query = from item in allWords
                            where item.ToLower() == tempWord.ToLower()
                            select item;
                counts[i] = query.Count();
            }
            //输出词频统计结果
            for (int i = 0; i < counts.Count(); i++)
            {
                label1.Text+=distinctWords[i] + "出现 " + counts[i].ToString() + "";
            }

 

以上是关于获取一篇文章中的单词频次(出现次数)的主要内容,如果未能解决你的问题,请参考以下文章

如何用python统计单词的频率

齐普夫定律的起源

N个任务掌握java系列之统计一篇文章中单词出现的次数

统计单词数

2011统计单词数

统计一篇英文文章内每个单词出现频率,并返回出现频率最高的前10个单词及其出现次数