apache/hudi

Incremental Cleaning should not revert to brute force all-partition scanning in any cases

Open

#14.569 geöffnet am 30. Nov. 2025

Auf GitHub ansehen
 (0 Kommentare) (0 Reaktionen) (0 zugewiesene Personen)Java (4.823 Stars) (2.431 Forks)batch import
component:cleaningfrom-jiragood first issuehelp-wantedpriority:hightype:feature

Beschreibung

After [https://github.com/apache/incubator-hudi/pull/1576] . Incremental Cleaning would still resort to full partition scan when  no previous clean operation was done in the dataset. This ticket is to design and implement a safe solution which would avoid full scanning in all cases.

JIRA info


Comments

09/Jul/20 13:10;309637554;i am happy to take this issue.;;;

Contributor Guide