apache/hudi

Limit the amount of partitions considered for GlobalBloomIndex

Open

#14 484 ouverte le 30 nov. 2025

Voir sur GitHub
 (0 commentaires) (0 réactions) (0 assignés)Java (4 823 stars) (2 431 forks)batch import
area:indexfrom-jiragood first issuepriority:hightype:improvement

Description

Currently, global bloom index will check inputs against files in all partitions.. In lot of cases, the user may know a range of partitions actually impacted from updates clearly (e.g upstream system drops updates older than a year, ... ).. In such a scenario,it may make sense to support an option for Global bloom to control how many partitions you want to match against, to gain performance.

JIRA info

Guide contributeur