Make shuffle run on CPU if we do a join where we read from bucketed table #785

tgravescs · 2020-09-16T21:57:00Z

closes #780

The issue is that when we have a join that is reading from 2 datasource and one of them is bucketed, the partitioning mismatches because the bucketed side is hashed on CPU side and then the other datasource which if it has a shuffle, will then be hashed and partitioned by the GPU side. The hashing is different so the data won't end up on the same partition and then we will drop data.

The fix is if we fix any joins that have a read from a bucketed table, we make sure the any shuffles to that join happen on the CPU side.

…able

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

tgravescs · 2020-09-16T21:57:32Z

build

tgravescs · 2020-09-16T22:01:44Z

I am going to file an issue to add some more bucketing tests. We should have an AQE one for example and probably some other datasources

andygrove · 2020-09-16T22:04:16Z

sql-plugin/src/main/scala/com/nvidia/spark/rapids/RapidsMeta.scala

+        false :: Nil
+      }
+    case _ =>
+      childPlans.flatMap(_.findBucketedReads())


This code won't recurse down into query stages that have already executed (because query stages are leaf nodes), but I think that's ok since we would already have tagged the initial plan.

The goal is to find the next closest shuffle or read. If we encounter a shuffle before we hit a read it is fine, because the other code path should handle it. We only need to sorry about an input in the same stage.

revans2 · 2020-09-16T22:11:09Z

sql-plugin/src/main/scala/com/nvidia/spark/rapids/RapidsMeta.scala

+    // then we need to make sure that all of them run on the CPU instead
+    if (bucketedReads || !shuffleExchanges.forall(canThisBeReplaced)) {
+      val errMsg = if (bucketedReads) {
+        "can't support shuffle on the GPU with bucketed reads!"


nit: It might be good to explain a little more that this is related to a join. But this is really just a nit.

sql-plugin/src/main/scala/com/nvidia/spark/rapids/RapidsMeta.scala

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

tgravescs · 2020-09-16T22:30:26Z

build

pxLi · 2020-09-17T02:05:41Z

This one is targeting branch-0.3, should this be added to 0.2 too?

sameerz · 2020-09-17T02:10:30Z

@tgravescs should this be targeted to branch-0.2?

tgravescs · 2020-09-17T02:59:13Z

oops it should be against 0.2

tgravescs · 2020-09-17T02:59:45Z

build

…able (NVIDIA#785) * Make shuffle run on CPU if we do a join where we read from bucketed table Signed-off-by: Thomas Graves <tgraves@nvidia.com>

…IDIA#785) Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com> Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>

tgravescs and others added 3 commits September 16, 2020 16:49

Make shuffle run on CPU if we do a join where we read from bucketed t…

d91a3af

…able

Add test for bucketing

c36823e

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

remove extra debug

1040bee

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

tgravescs self-assigned this Sep 16, 2020

tgravescs added the bug Something isn't working label Sep 16, 2020

tgravescs added this to the Sep 14 - Sep 25 milestone Sep 16, 2020

andygrove reviewed Sep 16, 2020

View reviewed changes

revans2 reviewed Sep 16, 2020

View reviewed changes

revans2 previously approved these changes Sep 16, 2020

View reviewed changes

tgravescs commented Sep 16, 2020

View reviewed changes

sql-plugin/src/main/scala/com/nvidia/spark/rapids/RapidsMeta.scala Show resolved Hide resolved

don't count it if shuffle after bucketed read before join

4252d47

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

tgravescs dismissed revans2’s stale review via 4252d47 September 16, 2020 22:29

tgravescs changed the base branch from branch-0.3 to branch-0.2 September 17, 2020 02:59

revans2 approved these changes Sep 17, 2020

View reviewed changes

revans2 merged commit 6fa0dc1 into NVIDIA:branch-0.2 Sep 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make shuffle run on CPU if we do a join where we read from bucketed table #785

Make shuffle run on CPU if we do a join where we read from bucketed table #785

tgravescs commented Sep 16, 2020

tgravescs commented Sep 16, 2020

tgravescs commented Sep 16, 2020

andygrove Sep 16, 2020

revans2 Sep 16, 2020

revans2 Sep 16, 2020

tgravescs Sep 16, 2020

tgravescs commented Sep 16, 2020

pxLi commented Sep 17, 2020

sameerz commented Sep 17, 2020

tgravescs commented Sep 17, 2020

tgravescs commented Sep 17, 2020

Make shuffle run on CPU if we do a join where we read from bucketed table #785

Make shuffle run on CPU if we do a join where we read from bucketed table #785

Conversation

tgravescs commented Sep 16, 2020

tgravescs commented Sep 16, 2020

tgravescs commented Sep 16, 2020

andygrove Sep 16, 2020

Choose a reason for hiding this comment

revans2 Sep 16, 2020

Choose a reason for hiding this comment

revans2 Sep 16, 2020

Choose a reason for hiding this comment

tgravescs Sep 16, 2020

Choose a reason for hiding this comment

tgravescs commented Sep 16, 2020

pxLi commented Sep 17, 2020

sameerz commented Sep 17, 2020

tgravescs commented Sep 17, 2020

tgravescs commented Sep 17, 2020