scikit-learn-contrib/category_encoders

Handle missing in one hot encoder

Open

#400 创建于 2023年3月12日

在 GitHub 查看
 (3 评论) (0 反应) (0 负责人)Python (2,322 star) (397 fork)batch import
buggood first issue

描述

Expected Behavior

Currently, handle_missing=value adds a new column although the documentation says 'value' will encode a new value as 0 in every dummy column. Furthermore, we need a test for this

Actual Behavior

adds a column instead of using all 0

Steps to Reproduce the Problem

from category_encoders import OneHotEncoder
import pandas as pd

he = OneHotEncoder(handle_missing="value")

data = [("foo", 1), ("bar", 2), (None, 6)]
data = pd.DataFrame(data, columns=["c1", "c2"])
print(he.fit_transform(data))

Specifications

  • Version: 2.6
  • Platform: linux

贡献者指南