apache/hudi

Limit the amount of partitions considered for GlobalBloomIndex

Open

#14,484 创建于 2025年11月30日

在 GitHub 查看
 (0 评论) (0 反应) (0 负责人)Java (4,823 star) (2,431 fork)batch import
area:indexfrom-jiragood first issuepriority:hightype:improvement

描述

Currently, global bloom index will check inputs against files in all partitions.. In lot of cases, the user may know a range of partitions actually impacted from updates clearly (e.g upstream system drops updates older than a year, ... ).. In such a scenario,it may make sense to support an option for Global bloom to control how many partitions you want to match against, to gain performance.

JIRA info

贡献者指南

Limit the amount of partitions considered for GlobalBloomIndex · apache/hudi#14484 | Good First Issue