Use cudf to compute exact hash join output row sizes #3288

jlowe · 2021-08-24T19:27:15Z

Removes the hash join code to estimate the join size and replaces it with the cudf output join size APIs. This also removes the OOM catch-and-retry logic since it theoretically should no longer be necessary since we are no longer producing an estimate but instead an exact amount.

Posting as a draft for the following reasons:

Depends on fix for [BUG] cudf::hash_join compute size fails on struct columns rapidsai/cudf#9095
Depends on fix for [BUG] cudf::hash_join throws exception when computing inner join size of empty build table rapidsai/cudf#9092
Need to validate performance gain by removing estimation code

Signed-off-by: Jason Lowe <jlowe@nvidia.com>

revans2

From the look of things it appears that the builtHash does not out live a single batch being returned. The rest of the changes look good.

jlowe · 2021-08-24T21:43:30Z

From the look of things it appears that the builtHash does not out live a single batch being returned.

Yeah, the built hash isn't spillable, and the code tries to make everything spillable before it produces a batch. Therefore I thought it prudent to throw away the built hash to avoid a potential OOM.

jlowe · 2021-08-27T13:59:27Z

build

jlowe · 2021-08-27T15:29:08Z

I ran Q75 which has quite a few joins in it at SF=100 on my local desktop. It's in the ballpark of 5% faster, around 33 seconds before and 31.5 seconds after. I verified with an Nsight Systems trace that the hash table is getting re-used between the output size and join calculations for an individual batch.

…3288)" This reverts commit 25bad3d.

…3288)" This reverts commit 25bad3d. Signed-off-by: Jason Lowe <jlowe@nvidia.com>

…#3657) This reverts commit 25bad3d. Signed-off-by: Jason Lowe <jlowe@nvidia.com>

Use cudf to compute exact hash join output row sizes

ea06a41

Signed-off-by: Jason Lowe <jlowe@nvidia.com>

jlowe added the cudf_dependency An issue or PR with this label depends on a new feature in cudf label Aug 24, 2021

jlowe added this to the Aug 16 - Aug 27 milestone Aug 24, 2021

jlowe self-assigned this Aug 24, 2021

revans2 approved these changes Aug 24, 2021

View reviewed changes

jlowe marked this pull request as ready for review August 27, 2021 15:34

jlowe merged commit 25bad3d into NVIDIA:branch-21.10 Aug 27, 2021

jlowe deleted the hash-join-output-size branch September 10, 2021 15:37

abellina mentioned this pull request Sep 23, 2021

[BUG] q82 regression after #3288 #3640

Closed

abellina added a commit to abellina/spark-rapids that referenced this pull request Sep 23, 2021

Revert "Use cudf to compute exact hash join output row sizes (NVIDIA#…

7bcdcb6

…3288)" This reverts commit 25bad3d.

jlowe added a commit to jlowe/spark-rapids that referenced this pull request Sep 24, 2021

Revert "Use cudf to compute exact hash join output row sizes (NVIDIA#…

672fbba

…3288)" This reverts commit 25bad3d.

jlowe added a commit to jlowe/spark-rapids that referenced this pull request Sep 24, 2021

Revert "Use cudf to compute exact hash join output row sizes (NVIDIA#…

a25ba9c

…3288)" This reverts commit 25bad3d. Signed-off-by: Jason Lowe <jlowe@nvidia.com>

This was referenced Sep 24, 2021

Revert "Use cudf to compute exact hash join output row sizes (#3288)" #3657

Merged

[FEA] Use CUDF API for getting join output size #2440

Open

tgravescs pushed a commit that referenced this pull request Sep 24, 2021

Revert "Use cudf to compute exact hash join output row sizes (#3288)" (…

f08ac9a

…#3657) This reverts commit 25bad3d. Signed-off-by: Jason Lowe <jlowe@nvidia.com>

jlowe mentioned this pull request Oct 22, 2021

[FEA] AST enabled GpuBroadcastNestedLoopJoin left side can't be small #3832

Closed

jlowe mentioned this pull request Nov 4, 2021

Use libcudf to compute hash join output size #4036

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use cudf to compute exact hash join output row sizes #3288

Use cudf to compute exact hash join output row sizes #3288

jlowe commented Aug 24, 2021 •

edited

Loading

revans2 left a comment

jlowe commented Aug 24, 2021

jlowe commented Aug 27, 2021

jlowe commented Aug 27, 2021

Use cudf to compute exact hash join output row sizes #3288

Use cudf to compute exact hash join output row sizes #3288

Conversation

jlowe commented Aug 24, 2021 • edited Loading

revans2 left a comment

Choose a reason for hiding this comment

jlowe commented Aug 24, 2021

jlowe commented Aug 27, 2021

jlowe commented Aug 27, 2021

jlowe commented Aug 24, 2021 •

edited

Loading