r NHL 2011 Defensive Stats.R

Posted

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了r NHL 2011 Defensive Stats.R相关的知识,希望对你有一定的参考价值。

# set working directory
setwd("/My Dropbox/Projects/NHL Defensemen 2011 performance")

# load the libraries I commonly use
library(XML)
library(plyr)
library(lubridate)
library(ggplot2)

# grab the data
URL <- "http://www.hockey-reference.com/leagues/NHL_2011_skaters.html"
tables <- readHTMLTable(URL)$stats
head(tables)

# filter on D
ds <- tables[tables$Pos == 'D', ]
nrow(ds)  # number of records

# change data types -- probably an easier way, but this helped me learn R
str(ds)
for (i in c(1,3,6:19)) {
  ds[,i] <- as.numeric(as.character(ds[,i])) # important! -- convert factor to string first
}
for (i in c(2, 4:5, 20)) {
  ds[,i] <- as.character(ds[,i])
}

# lets cut on games played to "core" set of players -- upper 50%
summary(ds$GP)
hist(ds$GP, xlab="Games played", main="Distribution of games played")
ds <- ds[ds$GP >= median(ds$GP),]

# lets look at the distribution of +/-
names(ds)[11] <- "plusmin"
summary(ds$plusmin)
hist(ds$plusmin)

# sort the dataframe
sorted.pm <- ds[order(ds$plusmin, decreasing=T), ]

# top 25
head(sorted.pm, n=25)

# plot plusmin and time on ice
plot(ds$TOI, ds$plusmin, xlab="+/-", ylab="Points", pch=20, cex=.8)

# sort the dataframe on TOI
sorted.toi <- ds[order(ds$TOI, decreasing=T), ]

# top 25
head(sorted.toi, n=25)

# Zee was top on +/- and top 3 in TOI..... +/- not the best stat, but coupled with TOI, its a start IMO

以上是关于r NHL 2011 Defensive Stats.R的主要内容,如果未能解决你的问题,请参考以下文章

r NHL-ALLSTAR聚类,helpers.r

r 解析CapFriendly NHL Salary网站

Defensive Programming 随笔

THMIntro to Defensive Security(防御性安全介绍)-学习

css NHL代码

加入来自不同季节/年的两个数据集,与体育有关(例如,NHL,1991和1992年季节)