特征选择Boruta

Posted

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了特征选择Boruta相关的知识,希望对你有一定的参考价值。

使用Boruta前 ,需要对缺失值进行填充。 

https://www.analyticsvidhya.com/blog/2016/03/select-important-variables-boruta-package/

Variable selection is an important aspect of model building which every analyst must learn. After all, it helps in building predictive models free from correlated variables, biases and unwanted noise.

A lot of novice analysts assume that keeping all (or more) variables will result in the best model as you are not losing any information. Sadly, that is not true!

How many times has it happened that removing a variable from model has increased your model accuracy ?

At least, it has happened to me. Such variables are often found to be correlated and hinder achieving higher model accuracy. Today, we’ll learn one of the ways of how to get rid of such variables in R. I must say, R has an incredible CRAN repository. Out of all packages, one such available package for variable selection is Boruta Package.

以上是关于特征选择Boruta的主要内容,如果未能解决你的问题,请参考以下文章

R语言基于Boruta进行机器学习特征筛选(Feature Selection)

特征选择特征选择方法

《机学三》特征工程3 —— 特征选择:特征选择数据降维

特征选择方法总结

特征选择与特征子集(二)

特征选择嵌入式特征选择法