rapidsai/cudf

Use grid stride in CSV reader kernels

Open

#14,066 创建于 2023年9月8日

在 GitHub 查看
 (1 评论) (0 反应) (1 负责人)C++ (6,000 star) (735 fork)batch import
PerformancecuIOgood first issue

描述

Currently, the CSV reader parses data using a thread per row, and a separate thread is used for each row, regardless of the file size. Using a grid stride loop would allow kernels to launch with preset number of blocks even with large input.

This applies both to the parser and the data inference kernels.

贡献者指南