rapidsai/cudf

[FEA] efficient way to truncate by month

Open

#18,650 opened on 2025年5月3日

GitHub で見る
 (5 comments) (0 reactions) (1 assignee)C++ (6,000 stars) (735 forks)batch import
Pythonfeature requestgood first issuelibcudf

説明

Is there an efficient way to truncate a Series by month / quarter in cuDF?

https://docs.rapids.ai/api/cudf/latest/user_guide/api_docs/api/cudf.core.series.datetimeproperties.floor/ dt.floor doesn't support month / year / quarter

In pandas I can either, depending on the dtype backend:

  • drop to numpy and use astype('datetime64["3M"]'))
  • drop to PyArrow and use pc.floor_temporal

Any suggestion for cuDF?

Example: given

import cudf
from datetime import datetime

s = cudf.Series([datetime(2020, 2, 4)])
s.dt.truncate('3 months')  # Not valid syntax, I know :)

I'd like to end up with

'2020-01-01'

コントリビューターガイド