[ASK] LibffmConverter - are the fit and transform function similar to Sklearn function such as OrdinalEncoder? · recommenders-team/recommenders#1818

Métriques du dépôt

Stars: (17 706 stars)
Métriques de merge PR: (Merge moyen 6j 16h) (10 PRs mergées en 30 j)

Description

I am splitting my dataset into Train, Validation, Test dataset.

For sklearn, when using OrdinalEncoder, the fit function will only be performed on Train dataset, I can use the same OrdinalEncoder fitted object to transform on unseen data(in this case Validation and Test dataset) without fitting again, this will preserve the same encoding and unseen data will be encoded as -1.

Does LibffmConverter perform the same way by simply fitting only the Train dataset and I can use the fitted object to transform other unseen(Validation and Test) dataset?

Example: converter = LibffmConverter().fit(train_df, col_rating='rating') train_df_new = converter.transform(train_df)

valid_df_new = converter.transform(valid_df). # preserving train fitted dictionary mapping and handle unseen data?  test_df_new = converter.transform(test_df). # preserving train fitted dictionary mapping and handle unseen data?

Guide contributeur

Direction de recherche: Examinez le code source de LibffmConverter dans le dépôt recommenders, en particulier les méthodes fit et transform. Comparez l'implémentation avec OrdinalEncoder de sklearn, en vous concentrant sur la façon dont il gère les catégories non vues pendant transform. Vérifiez la documentation ou les tests existants qui clarifient ce comportement. L'issue est ouverte et comporte deux commentaires ; lisez les commentaires pour voir si un mainteneur a fourni une réponse.
Stack technique: python
Domaine: machine learningdata
Type d'issue: Recherche
Difficulté: 1
Temps estimé: Moins d'une heure
Statut d'activité: Active
Clarté: Claire
Prérequis: Python
Accessibilité débutant: 80

Métriques du dépôt

Description

Description

Guide contributeur

Recevez de nouvelles issues Easy par e-mail.