[ASK] LibffmConverter - are the fit and transform function similar to Sklearn function such as OrdinalEncoder? · recommenders-team/recommenders#1818

Repository metrics

Stars: (17,706 stars)
PR merge metrics: (Avg merge 6d 16h) (10 merged PRs in 30d)

Description

I am splitting my dataset into Train, Validation, Test dataset.

For sklearn, when using OrdinalEncoder, the fit function will only be performed on Train dataset, I can use the same OrdinalEncoder fitted object to transform on unseen data(in this case Validation and Test dataset) without fitting again, this will preserve the same encoding and unseen data will be encoded as -1.

Does LibffmConverter perform the same way by simply fitting only the Train dataset and I can use the fitted object to transform other unseen(Validation and Test) dataset?

Example: converter = LibffmConverter().fit(train_df, col_rating='rating') train_df_new = converter.transform(train_df)

valid_df_new = converter.transform(valid_df). # preserving train fitted dictionary mapping and handle unseen data?  test_df_new = converter.transform(test_df). # preserving train fitted dictionary mapping and handle unseen data?

Contributor guide

Research direction: Check the LibffmConverter documentation or source code to see if the fit method stores a mapping that is reused in transform and handles unseen data.
Tech stack: python
Domain: machine learningdata
Issue type: Research
Difficulty: 1
Estimated time: Under 1 hour
Activity status: Active
Clarity: Clear
Prerequisites: Python
Newbie friendliness: 80

Repository metrics

Description

Description

Contributor guide

Get fresh easy issues in your inbox.