EnhancementModeratehelp wantedmodule:feature_selection
描述
Hi,
Is there any way to specify a tolerance when determining the number of optimal features when using RFECV (like n the CARET package)? Currently when using RFECV with many of the tree-based classifiers, the removal of the least important n features often results in a very slight reduction in model accuracy. This means that all of the features end up being selected as 'optimal', even though the reduction in model accuracy is very slight if we used a much smaller set of features. A tolerance setting, like 1% would allow only the features to be selected that would otherwise cause a large drop in model accuracy.
Steve