rapidsai/cudf

[FEA] Expand ORC and Parquet benchmarks to cover different stripe/rowgroup sizes

Open

#10,100 创建于 2022年1月21日

在 GitHub 查看
 (5 评论) (0 反应) (0 负责人)C++ (6,000 star) (735 fork)batch import
PerformancecuIOfeature requestgood first issuelibcudftests

描述

Add a set of benchmarks with varying stripe/rowgroup sizes to each affected component:

  • ORC reader
  • ORC writer
  • Parquet reader
  • Parquet writer

Use the new benchmarks to evaluate the effects of these options and potentially determine the optimal settings.

贡献者指南