-
Notifications
You must be signed in to change notification settings - Fork 232
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] assert failed test_struct_self_join and test_computation_in_grpby_columns #5286
Comments
It looks like the issue is in this diff: https://github.com/rapidsai/cudf/compare/6c79b5902d55bab599731a9bded7e89b9c4875c5..65b1cbdeda9cab57243d0a98e646c860ef86039e#diff-50ba2711690aca8e4f28d7b491373a4dd76443127c8b452a77b6c1fe2388d9e3. There were some string changes here that could be related, so I am reverting those to confirm. |
Reverting: rapidsai/cudf#10673 fixes the test failure. It is specific to when rows with empty strings are joined, as regular projections are working fine. |
thanks for looking into this! I am wondering if we could add some UTs in cudfjni side so we could catch the error earlier~ |
Quick update, here's a minimum repro case in java (this test fails, where we should be getting a table with a single row/column with the empty string). I'll move to working on this in cuDF.
|
@pxLi I'll try, but this a chain of things. I have to have a string column with an empty string row, then I need to call contiguous split, and finally I should call partition. Removing rapidsai/cudf#10673 fixes the issue, removing contiguous split also fixes the issue, and if the row isn't a string or it is a non-empty string it all works. It seems we are assuming that "" (size 0 string) is null, so we are loosing track of the fact that it is a valid string. |
@abellina is this resolved? |
Almost. The cudf change is in, but we still need to re-enable the disabled tests. I'll be posting a PR shortly. |
Describe the bug
The failures occurred in integration tests, seems like related to recent cudf changes
join_test.py::test_struct_self_join[IGNORE_ORDER({'local': True})],
hash_aggregate_test.py::test_computation_in_grpby_columns[IGNORE_ORDER],
detailed log,
The text was updated successfully, but these errors were encountered: