feast-dev/feast

Latest Only option for Historical Retrieval

Open

#1,687 建立於 2021年7月5日

在 GitHub 查看
 (12 留言) (1 反應) (0 負責人)Python (5,029 star) (896 fork)batch import
Community Contribution Neededgood first issuekeep-openkind/featurepriority/p1

描述

Is your feature request related to a problem? Please describe.

In many batch workflows, it is worthwhile to retrieve the latest features by entity only. This is useful from the purposes of both production and backtesting purposes.

E.g. if I have an hourly/daily batch which goes through our whole customer base to find fraudulent customers, we wouldn't really use the online store for this.

Describe the solution you'd like

Allow users to specify an entity set extracted from a feature view should have an option to be deduplicated by latest. Depends on #1611

my_daily_batch_scoring_df = store.get_latest_features(
    entity_df = "my_df", 
    feature_refs = [...],
)

Additional context Linked issue #1611

貢獻者指南

Latest Only option for Historical Retrieval · feast-dev/feast#1687 | Good First Issue