apache/hudi

Limit the amount of partitions considered for GlobalBloomIndex

Open

#14.484 geöffnet am 30. Nov. 2025

Auf GitHub ansehen
 (0 Kommentare) (0 Reaktionen) (0 zugewiesene Personen)Java (4.823 Stars) (2.431 Forks)batch import
area:indexfrom-jiragood first issuepriority:hightype:improvement

Beschreibung

Currently, global bloom index will check inputs against files in all partitions.. In lot of cases, the user may know a range of partitions actually impacted from updates clearly (e.g upstream system drops updates older than a year, ... ).. In such a scenario,it may make sense to support an option for Global bloom to control how many partitions you want to match against, to gain performance.

JIRA info

Contributor Guide