fixUpJoinConsistency rule now works when AQE is enabled #676

andygrove · 2020-09-06T21:05:36Z

GpuOverrides applies a fixUpJoinConsistency rule to ensure that the inputs to a SortMergeJoin or ShuffledHashJoin are either both on CPU, or both on GPU. We can't support a mix of CPU and GPU because the hashing algorithms are not compatible, and therefore the join would produce incorrect results.

This PR adds a unit test for this rule, and also ensures that the rule is applied when AQE is enabled.

This closes #631

Signed-off-by: Andy Grove <andygrove@nvidia.com>

abellina

Just had one question

abellina · 2020-09-07T00:48:37Z

sql-plugin/src/main/scala/com/nvidia/spark/rapids/RapidsMeta.scala

-      // since they already started to execute
+      // since they already started to execute, but we verify that they are both on CPU or
+      // both on GPU
+      if (queryStages.map(isGpuQueryStage).distinct.size == 2) {


so for my edification, we can't get in this situation unless there is a bug, or perhaps a change in behavior from AQE rules proper?

I think we could only get here due to a bug when planning query stages.

Signed-off-by: Andy Grove <andygrove@nvidia.com>

andygrove · 2020-09-07T14:49:49Z

build

Signed-off-by: Andy Grove <andygrove@nvidia.com>

andygrove · 2020-09-08T14:27:47Z

build

andygrove · 2020-09-08T14:28:46Z

@abellina I was able to greatly simplify the logic in this PR. Please take a look when you can.

* fixUpJoinConsistency rule now works with AQE Signed-off-by: Andy Grove <andygrove@nvidia.com> * Add comma to error message Signed-off-by: Andy Grove <andygrove@nvidia.com> * Improved validation checks and error messages Signed-off-by: Andy Grove <andygrove@nvidia.com> * bug fix: walk tree once to find shuffle exchanges and query stages Signed-off-by: Andy Grove <andygrove@nvidia.com> * code simplification Signed-off-by: Andy Grove <andygrove@nvidia.com>

* first pass at a benchmark. Float only for now. * signoff Signed-off-by: Mike Wilson <knobby@burntsheep.com> Signed-off-by: Mike Wilson <knobby@burntsheep.com>

andygrove added 2 commits September 6, 2020 15:01

fixUpJoinConsistency rule now works with AQE

4f8ee04

Signed-off-by: Andy Grove <andygrove@nvidia.com>

Add comma to error message

d6b93cb

Signed-off-by: Andy Grove <andygrove@nvidia.com>

andygrove added this to the Aug 31 - Sep 11 milestone Sep 6, 2020

andygrove self-assigned this Sep 6, 2020

andygrove changed the title ~~[WIP] fixUpJoinConsistency rule now works when AQE is enabled~~ fixUpJoinConsistency rule now works when AQE is enabled Sep 6, 2020

andygrove mentioned this pull request Sep 6, 2020

Enable UCX + AQE #613

Merged

andygrove changed the title ~~fixUpJoinConsistency rule now works when AQE is enabled~~ [WIP] fixUpJoinConsistency rule now works when AQE is enabled Sep 6, 2020

Improved validation checks and error messages

dd6a5be

Signed-off-by: Andy Grove <andygrove@nvidia.com>

andygrove changed the title ~~[WIP] fixUpJoinConsistency rule now works when AQE is enabled~~ fixUpJoinConsistency rule now works when AQE is enabled Sep 6, 2020

abellina reviewed Sep 7, 2020

View reviewed changes

bug fix: walk tree once to find shuffle exchanges and query stages

c227dfc

Signed-off-by: Andy Grove <andygrove@nvidia.com>

code simplification

260067b

Signed-off-by: Andy Grove <andygrove@nvidia.com>

sameerz added the test Only impacts tests label Sep 8, 2020

revans2 approved these changes Sep 8, 2020

View reviewed changes

andygrove added the bug Something isn't working label Sep 8, 2020

andygrove merged commit 221e1c5 into NVIDIA:branch-0.2 Sep 8, 2020

andygrove deleted the fix-up-joins branch September 8, 2020 18:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixUpJoinConsistency rule now works when AQE is enabled #676

fixUpJoinConsistency rule now works when AQE is enabled #676

andygrove commented Sep 6, 2020

abellina left a comment

abellina Sep 7, 2020

andygrove Sep 7, 2020

andygrove commented Sep 7, 2020

andygrove commented Sep 8, 2020

andygrove commented Sep 8, 2020

fixUpJoinConsistency rule now works when AQE is enabled #676

fixUpJoinConsistency rule now works when AQE is enabled #676

Conversation

andygrove commented Sep 6, 2020

abellina left a comment

Choose a reason for hiding this comment

abellina Sep 7, 2020

Choose a reason for hiding this comment

andygrove Sep 7, 2020

Choose a reason for hiding this comment

andygrove commented Sep 7, 2020

andygrove commented Sep 8, 2020

andygrove commented Sep 8, 2020