[ASK] LibffmConverter - are the fit and transform function similar to Sklearn function such as OrdinalEncoder? · recommenders-team/recommenders#1818

仓库指标

Star: (17,706 star)
PR 合并指标: (平均合并 6天 16小时) (30 天内合并 10 个 PR)

描述

Description

I am splitting my dataset into Train, Validation, Test dataset.

For sklearn, when using OrdinalEncoder, the fit function will only be performed on Train dataset, I can use the same OrdinalEncoder fitted object to transform on unseen data(in this case Validation and Test dataset) without fitting again, this will preserve the same encoding and unseen data will be encoded as -1.

Does LibffmConverter perform the same way by simply fitting only the Train dataset and I can use the fitted object to transform other unseen(Validation and Test) dataset?

Example: converter = LibffmConverter().fit(train_df, col_rating='rating') train_df_new = converter.transform(train_df)

valid_df_new = converter.transform(valid_df). # preserving train fitted dictionary mapping and handle unseen data?  test_df_new = converter.transform(test_df). # preserving train fitted dictionary mapping and handle unseen data?

贡献者指南

研究方向: 检查 recommenders 仓库中 LibffmConverter 的源代码，特别是 fit 和 transform 方法。将其实现与 sklearn 的 OrdinalEncoder 进行比较，重点关注在 transform 过程中如何处理未见过的类别。查阅现有文档或测试以明确此行为。该 issue 处于开放状态并有两条评论；请阅读评论以查看是否有维护者已提供解答。
技术栈: python
领域: machine learningdata
议题类型: 调研
难度: 1
预计时间: 1 小时以内
活动状态: 活跃
清晰度: 清晰
前置要求: Python
新手友好度: 80

仓库指标

描述

Description

贡献者指南

每天在邮箱收到新鲜 Easy issues。