[DataMining]WEEK1 - text-retrieval and search engine

Posted oDoraemon

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了[DataMining]WEEK1 - text-retrieval and search engine相关的知识,希望对你有一定的参考价值。

  • What does a computer have to do in order to understand a natural language sentence?
  • What is ambiguity?
  • Why is natural language processing (NLP) difficult for computers?
  • What is bag-of-words representation? Why do modern search engines use this simple representation of text?
  • What are the two modes of text information access? Which mode does a web search engine such as Google support?
  • When is browsing more useful than querying to help a user find relevant information?
  • Why is a text retrieval task defined as a ranking task?
  • What is a retrieval model?
  • What are the two assumptions made by the Probability Ranking Principle?
  • What is the Vector Space Retrieval Model? How does it work?
  • How do we define the dimensions of the Vector Space Model? What does “bag of words” representation mean?
  • What does the retrieval function intuitively capture when we instantiate a vector space model with bag of words representation and bit representation for documents and queries?

以上是关于[DataMining]WEEK1 - text-retrieval and search engine的主要内容,如果未能解决你的问题,请参考以下文章

DataMining-朴素贝叶斯Naive Bayesian

不懂这几个问题,怎么学好数据挖掘!

数据挖掘相关的10个问题

斯坦福-随机图模型-week1.6_

斯坦福-随机图模型-week1.3_

斯坦福-随机图模型-week1.2