-
Notifications
You must be signed in to change notification settings - Fork 891
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Null Handling in where
#9328
Comments
This feels like a nullable dtype discrepancy or perhaps pandas/our documentation is in need of an update?
If |
Looks like you were correct @beckernick; the nulls are replaced after a call to cudf/cpp/include/cudf/copying.hpp Lines 769 to 770 in fd0b710
A simple workaround here would be to simply assign |
We're equivalent to pandas nullable types here:
|
In that case, is the larger issue here that Pandas series still infer non-nullable types, while cuDF infers nullable? Based on Pandas docs, it seems like the recommendation here would be to explicitly provide https://pandas.pydata.org/pandas-docs/stable/user_guide/integer_na.html Although I do think it would be nice in the docs of both libraries to emphasize that |
Yes - I think that is what is happening. |
Thanks for jumping in @charlesbluca and @brandon-b-miller . A doc update might be the resolution here |
This issue has been labeled |
This issue has been labeled |
Closes #9328 Authors: - https://github.com/brandon-b-miller Approvers: - Vyas Ramasubramani (https://github.com/vyasr) URL: #11276
Null handling in
where
does not reflect Pandas Null handling:In the above
Nan
is always treat as a value less than any int/float, etc.cc @randerzander
The text was updated successfully, but these errors were encountered: