scikit-learn/scikit-learn

Tolerance on RFECV

Open

#7,559 建立於 2016年10月3日

在 GitHub 查看
 (14 留言) (0 反應) (0 負責人)Python (66,084 star) (27,020 fork)batch import
EnhancementModeratehelp wantedmodule:feature_selection

描述

Hi,

Is there any way to specify a tolerance when determining the number of optimal features when using RFECV (like n the CARET package)? Currently when using RFECV with many of the tree-based classifiers, the removal of the least important n features often results in a very slight reduction in model accuracy. This means that all of the features end up being selected as 'optimal', even though the reduction in model accuracy is very slight if we used a much smaller set of features. A tolerance setting, like 1% would allow only the features to be selected that would otherwise cause a large drop in model accuracy.

Steve

貢獻者指南