rapidsai/cudf

[FEA] Implement find_valid_index and its permutations (first_ and last_)

Open

#1,480 opened on Apr 22, 2019

View on GitHub
 (4 comments) (0 reactions) (0 assignees)C++ (6,000 stars) (735 forks)batch import
Pythonfeature requestgood first issue

Description

Is your feature request related to a problem? Please describe. As a user, I'd like to be able to call last_valid_index or first_valid_index to get the index of the first/last non-null value in a Series.

Describe the solution you'd like I'd like to be able to do the following with cudf:

import pandas as pd
import numpy as np

ser1 = [1,2,3,4]
ser2 = [1,2,3,None]
pdf = pd.DataFrame({'a':ser1, 'b':ser2})
print(pdf.b.last_valid_index())
2

Additional context The pandas implementation is here. This will help with API compatibility.

Contributor guide