Tolerance on RFECV · scikit-learn/scikit-learn#7559

(14 留言) (0 反應) (0 負責人)Python (27,020 fork)batch import

EnhancementModeratehelp wantedmodule:feature_selection

倉庫指標

Star: (66,084 star)
PR 合併指標: (平均合併 10天) (30 天內合併 90 個 PR)

描述

Hi,

Is there any way to specify a tolerance when determining the number of optimal features when using RFECV (like n the CARET package)? Currently when using RFECV with many of the tree-based classifiers, the removal of the least important n features often results in a very slight reduction in model accuracy. This means that all of the features end up being selected as 'optimal', even though the reduction in model accuracy is very slight if we used a much smaller set of features. A tolerance setting, like 1% would allow only the features to be selected that would otherwise cause a large drop in model accuracy.

Steve

貢獻者指南

研究方向: 研究 scikit learn 中 RFECV 的實現，並提議添加一個容差參數，當分數下降在容差範圍內時停止特徵移除。參考其他庫（如 R 中的 CARET）中的類似實現。
技術棧: pythonscikit learn
領域: machine learningai
議題類型: 功能
難度: 3
預計時間: 1-2 天
活動狀態: 活躍
清晰度: 清晰
前置要求: Pythonscikit learnFeature selection
新手友善度: 65

倉庫指標

描述

貢獻者指南

每天在信箱收到新鮮 Easy issues。