[BUG] count reductions are failing on databricks because lack for Complete support #898

revans2 · 2020-10-01T16:57:59Z

Describe the bug
In databricks the tests for test_generic_reductions are failing. I traced this down to databricks using the Complete aggregation type, which our code does not properly support.

Steps/Code to reproduce bug
Run test_generic_reductions on databricks with the latest branch-0.3 code.

Expected behavior
Test should pass

This looks like it should be somewhat simple to fix in a stop gap, but I am not 100% sure that we are doing the right thing in all cases with this. Ideally we check all possible aggregation types instead of just checking for a single one, but that may be difficult.

The reason this works for count(*) is because it is turned into count(1) and we do a SUM reduction on it, so it comes out the same.

The text was updated successfully, but these errors were encountered:

kuhushukla · 2020-10-01T17:03:46Z

I know that Databricks was doing something different with inserting Complete aggregates that @tgravescs put up a PR for in the past. I can try and look at this next week if @revans2 and @tgravescs think it is a good idea.

revans2 · 2020-10-01T17:41:45Z

@abellina also had a look at it a little, but yes @kuhushukla if you want to take lead on fixing it that would be great. @tgravescs should we mark the tests as xfail for now? The aggregations should work fine if we don't coalesce the data to a single partition. I am doing that in the test so first and last get a deterministic result, but it should not be that common in the real world.

Because of that I really want us to fix this correctly and not just patch it. We should have mode be checked completely everywhere so we know 100% if we support that mode or not for reductions as well as aggregations.

kuhushukla · 2020-10-01T17:56:45Z

Thanks, will look into it and update in the coming days. +1 on xfailing until then.

tgravescs · 2020-10-01T18:14:59Z

yes I'm fine with xfailing since this is marked p1 for 0.3

revans2 · 2020-10-01T18:55:47Z

I put up a patch to mark them as xfail here #899

I have not tested it on databricks yet, but I will soon.

Signed-off-by: Peixin Li <pxli@nyu.edu>

revans2 added bug Something isn't working P0 Must have for release labels Oct 1, 2020

sameerz assigned kuhushukla Oct 9, 2020

sameerz added this to the Oct 26 - Nov 6 milestone Oct 23, 2020

sameerz modified the milestones: Oct 26 - Nov 6, Nov 9 - Nov 20 Nov 6, 2020

sameerz modified the milestones: Nov 9 - Nov 20, Nov 23 - Dec 4 Nov 23, 2020

kuhushukla mentioned this issue Nov 30, 2020

Aggregate reductions in Complete mode should use updateExpressions #1217

Merged

kuhushukla closed this as completed in #1217 Nov 30, 2020

tgravescs pushed a commit to tgravescs/spark-rapids that referenced this issue Nov 30, 2023

Enable automerge 23.02 to 23.04 (NVIDIA#898)

ea13b86

Signed-off-by: Peixin Li <pxli@nyu.edu>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] count reductions are failing on databricks because lack for Complete support #898

[BUG] count reductions are failing on databricks because lack for Complete support #898

revans2 commented Oct 1, 2020

kuhushukla commented Oct 1, 2020

revans2 commented Oct 1, 2020

kuhushukla commented Oct 1, 2020

tgravescs commented Oct 1, 2020

revans2 commented Oct 1, 2020

[BUG] count reductions are failing on databricks because lack for Complete support #898

[BUG] count reductions are failing on databricks because lack for Complete support #898

Comments

revans2 commented Oct 1, 2020

kuhushukla commented Oct 1, 2020

revans2 commented Oct 1, 2020

kuhushukla commented Oct 1, 2020

tgravescs commented Oct 1, 2020

revans2 commented Oct 1, 2020