Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] count reductions are failing on databricks because lack for Complete support #898

Closed
revans2 opened this issue Oct 1, 2020 · 5 comments · Fixed by #1217
Closed
Assignees
Labels
bug Something isn't working P0 Must have for release

Comments

@revans2
Copy link
Collaborator

revans2 commented Oct 1, 2020

Describe the bug
In databricks the tests for test_generic_reductions are failing. I traced this down to databricks using the Complete aggregation type, which our code does not properly support.

Steps/Code to reproduce bug
Run test_generic_reductions on databricks with the latest branch-0.3 code.

Expected behavior
Test should pass

This looks like it should be somewhat simple to fix in a stop gap, but I am not 100% sure that we are doing the right thing in all cases with this. Ideally we check all possible aggregation types instead of just checking for a single one, but that may be difficult.

The reason this works for count(*) is because it is turned into count(1) and we do a SUM reduction on it, so it comes out the same.

@revans2 revans2 added bug Something isn't working P0 Must have for release labels Oct 1, 2020
@kuhushukla
Copy link
Collaborator

I know that Databricks was doing something different with inserting Complete aggregates that @tgravescs put up a PR for in the past. I can try and look at this next week if @revans2 and @tgravescs think it is a good idea.

@revans2
Copy link
Collaborator Author

revans2 commented Oct 1, 2020

@abellina also had a look at it a little, but yes @kuhushukla if you want to take lead on fixing it that would be great. @tgravescs should we mark the tests as xfail for now? The aggregations should work fine if we don't coalesce the data to a single partition. I am doing that in the test so first and last get a deterministic result, but it should not be that common in the real world.

Because of that I really want us to fix this correctly and not just patch it. We should have mode be checked completely everywhere so we know 100% if we support that mode or not for reductions as well as aggregations.

@kuhushukla
Copy link
Collaborator

Thanks, will look into it and update in the coming days. +1 on xfailing until then.

@tgravescs
Copy link
Collaborator

yes I'm fine with xfailing since this is marked p1 for 0.3

@revans2
Copy link
Collaborator Author

revans2 commented Oct 1, 2020

I put up a patch to mark them as xfail here #899

I have not tested it on databricks yet, but I will soon.

@sameerz sameerz added this to the Oct 26 - Nov 6 milestone Oct 23, 2020
@sameerz sameerz modified the milestones: Oct 26 - Nov 6, Nov 9 - Nov 20 Nov 6, 2020
tgravescs pushed a commit to tgravescs/spark-rapids that referenced this issue Nov 30, 2023
Signed-off-by: Peixin Li <pxli@nyu.edu>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working P0 Must have for release
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants