R爬取网页信息

Posted 2020-08-16 马家寨香椿

tags:

篇首语：本文由小常识网(cha138.com)小编为大家整理，主要介绍了R爬取网页信息相关的知识，希望对你有一定的参考价值。

#爬取电影票房信息
library(stringr)
library(XML)
library(maps)
#htmlParse()用来interpreting HTML
#创建一个object
movie_parsed<-htmlParse("http://58921.com/boxoffice/wangpiao/20161004",
                        encoding = "UTF-8")
#the next step:extract tables/data
#readHTMLTable() for identifying and reading out those tables
tables<-readHTMLTable(movie_parsed,stringsAsFactors=FALSE)
is.matrix(tables)
is.character(tables)
is.data.frame(tables)
is.list(tables)
#so we got an "list" format#

以上是关于R爬取网页信息的主要内容，如果未能解决你的问题，请参考以下文章