apache/pinot

Refactor to eliminate duplicate code in Spark 2.x and 3.x Batch Ingestion

Open

#14,271 建立於 2024年10月22日

在 GitHub 查看
 (1 留言) (0 反應) (1 負責人)Java (4,937 star) (1,234 fork)batch import
cleanupgood first issueingestion

描述

Currently between spark 2.x and spark 3.x batch ingestion lot of code is duplicated other than SparkSegmentMetadataPushJobRunner class. This ticket is to refactor the code to eliminate the duplicate code without breaking the compatibility. Code changes required are to :

  • Abstract common functionality into shared methods/classes.
  • Separate version-specific details.
  • Ensure the resulting codebase is cleaner and easier to maintain.

貢獻者指南