apache/pinot
Voir sur GitHubRefactor to eliminate duplicate code in Spark 2.x and 3.x Batch Ingestion
Open
#14 271 ouverte le 22 oct. 2024
cleanupgood first issueingestion
Description
Currently between spark 2.x and spark 3.x batch ingestion lot of code is duplicated other than SparkSegmentMetadataPushJobRunner class. This ticket is to refactor the code to eliminate the duplicate code without breaking the compatibility. Code changes required are to :
- Abstract common functionality into shared methods/classes.
- Separate version-specific details.
- Ensure the resulting codebase is cleaner and easier to maintain.