rapidsai/cudf

Use grid stride in CSV reader kernels

Open

#14,066 建立於 2023年9月8日

在 GitHub 查看
 (1 留言) (0 反應) (1 負責人)C++ (6,000 star) (735 fork)batch import
PerformancecuIOgood first issue

描述

Currently, the CSV reader parses data using a thread per row, and a separate thread is used for each row, regardless of the file size. Using a grid stride loop would allow kernels to launch with preset number of blocks even with large input.

This applies both to the parser and the data inference kernels.

貢獻者指南