rapidsai/cudf

[FEA] efficient way to truncate by month

Open

#18,650 建立於 2025年5月3日

在 GitHub 查看
 (5 留言) (0 反應) (1 負責人)C++ (6,000 star) (735 fork)batch import
Pythonfeature requestgood first issuelibcudf

描述

Is there an efficient way to truncate a Series by month / quarter in cuDF?

https://docs.rapids.ai/api/cudf/latest/user_guide/api_docs/api/cudf.core.series.datetimeproperties.floor/ dt.floor doesn't support month / year / quarter

In pandas I can either, depending on the dtype backend:

  • drop to numpy and use astype('datetime64["3M"]'))
  • drop to PyArrow and use pc.floor_temporal

Any suggestion for cuDF?

Example: given

import cudf
from datetime import datetime

s = cudf.Series([datetime(2020, 2, 4)])
s.dt.truncate('3 months')  # Not valid syntax, I know :)

I'd like to end up with

'2020-01-01'

貢獻者指南

[FEA] efficient way to truncate by month · rapidsai/cudf#18650 | Good First Issue