apache/hudi

Make sure that Compression Codec configuration is respected across the board

Open

#14963 opened on Nov 30, 2025

View on GitHub
 (0 comments) (0 reactions) (1 assignee)Java (4,823 stars) (2,431 forks)batch import
area:storagefrom-jiragood first issuepriority:hightype:bug

Description

Currently there are quite a few places where we assume GZip as the compression codec which is incorrect, given that this is configurable and users might actually prefer to use different compression codec.

Examples:

[HoodieParquetDataBlock|https://github.com/apache/hudi/pull/4333/files#diff-798a773c6eef4011aef2da2b2fb71c25f753500548167b610021336ef6f14807]

JIRA info

Contributor guide