Adds pre/post steps for merge and update aggregate #3417

abellina · 2021-09-09T01:25:52Z

This PR is prequel/continuation of #3373.

The work here adds part of the work by @ttnghia, and I added it to make sense of the extra processing added to the aggregate function expressions, and to be able to test it all.

The refactor adds a pre/post step to updates and merges. An example of a "pre merge" step is casting, or creating a struct, as is needed by MERGE_M2. The "pre update" case is not overloaded, and so it's the attribute reference as is (a pass-through projection). The "post update" step can be used to cast (as is done in GpuM2) in the update, and then later in the merge, the "post merge" where a struct is decomposed, and fields casted, as expected by Spark.
These steps allow 1 set of casting to be removed from the grouped aggregates in aggregates.scala. I did not mess with reduction aggregates in this PR, I can do that next. It was not required for the stddev_pop work.
An untested (other than some quick examples in a shell) impl of stddev_pop is adapted from Support stddev and variance aggregations families [databricks] #3373 to demonstrate how the buffers are put together to produce the final result (sqrt(M2/n)).

The code here really needs testing, as such there is no GpuOverrides node added in this PR. In other words, the code is there, but it is not being actively used. The two new projections for pre/post steps are getting executed by existing aggs.

I tested the diffs with the integration tests locally, and in databricks 8.2. I have not run in databricks 7.3 yet, but I wanted to get this up to get some 👀. Note that on databricks 8.2 I am noticing other issues with the tests (as did @revans2), especially when we run with the parallel setting. Tests were failing due to some unrelated bugs, so I'll re-run tests and comment here tomorrow, and we'll need some follow ups for that.

Signed-off-by: Alessandro Bellina <abellina@nvidia.com>

Co-authored-by: Nghia Truong <nghiatruong.vn@gmail.com>

abellina · 2021-09-09T01:27:40Z

build

sql-plugin/src/main/scala/com/nvidia/spark/rapids/aggregate.scala

sql-plugin/src/main/scala/org/apache/spark/sql/rapids/AggregateFunctions.scala

revans2 · 2021-09-10T14:42:26Z

sql-plugin/src/main/scala/com/nvidia/spark/rapids/aggregate.scala

@@ -995,7 +1019,7 @@ abstract class GpuBaseAggregateMeta[INPUT <: SparkPlan](
  override def convertToGpu(): GpuExec = {
    GpuHashAggregateExec(
      requiredChildDistributionExpressions.map(_.map(_.convertToGpu())),
-      groupingExpressions.map(_.convertToGpu()),
+      groupingExpressions.map(_.convertToGpu()).asInstanceOf[Seq[NamedExpression]],


Because of type erasure (how java does generics) the sequence could contain things that are not NamedExpressions and the cast would happily pass. I would prefer to have us do something more like

groupingExpressions.map(_.convertToGpu().asInstanceOf[NamedExpression])

…e_post_steps

…g/pre_post_steps

abellina · 2021-09-14T16:56:38Z

I see tests passing in 7.3 and 8.2 for databricks and locally. I found a leak, and addressed it with: a626e65.

…g/pre_post_steps

abellina · 2021-09-14T17:32:21Z

build

ttnghia · 2021-09-14T18:44:34Z

It seems that we have enough material to merge this PR. @abellina before merging this please remove the standard deviation stuffs (CudfM2, CudfMergeM2, GpuM2, GpuStddevPop) as they will be reworked and reviewed in a separately PR.

abellina · 2021-09-14T20:24:22Z

It seems that we have enough material to merge this PR. @abellina before merging this please remove the standard deviation stuffs (CudfM2, CudfMergeM2, GpuM2, GpuStddevPop) as they will be reworked and reviewed in a separately PR.

I started doing this here d4807c1, but re-reviewing my code I had comments around this because some of the changes are only for the M2 aggregates. Do you want me to remove comments as well, or rework them to not refer to M2?

At this point, you could let this in with the M2 reference implementation and change what you need to change, or just take the branch and do something on your own.

Or merge with the comments that point to future aggregates, if you are going to put your patch up soon.

abellina · 2021-09-14T20:31:05Z

build

ttnghia · 2021-09-14T20:35:19Z

It seems that we have enough material to merge this PR. @abellina before merging this please remove the standard deviation stuffs (CudfM2, CudfMergeM2, GpuM2, GpuStddevPop) as they will be reworked and reviewed in a separately PR.

I started doing this here d4807c1, but re-reviewing my code I had comments around this because some of the changes are only for the M2 aggregates. Do you want me to remove comments as well, or rework them to not refer to M2?

At this point, you could let this in with the M2 reference implementation and change what you need to change, or just take the branch and do something on your own.

Or merge with the comments that point to future aggregates, if you are going to put your patch up soon.

Thanks, I just merged the code you removed and will continue working on them.

abellina and others added 3 commits September 8, 2021 19:55

Refactor aggregate functions to add pre/post update and merge

e33ffa4

Signed-off-by: Alessandro Bellina <abellina@nvidia.com>

Add basic implementation of GpuM2

60cb3db

Co-authored-by: Nghia Truong <nghiatruong.vn@gmail.com>

Use Expression consistently in GpuAggregateFunction

379ff8a

Clean up a few comments

8c82004

revans2 reviewed Sep 9, 2021

View reviewed changes

sql-plugin/src/main/scala/com/nvidia/spark/rapids/aggregate.scala Outdated Show resolved Hide resolved

Cleanup bound cudf aggregate expressions

d2ff0e1

revans2 reviewed Sep 9, 2021

View reviewed changes

abellina added 2 commits September 10, 2021 00:41

groupingExpressions as NamedExpression

2f5d812

Add comments around BoundCudfAggregate

1b7c5be

revans2 reviewed Sep 10, 2021

View reviewed changes

abellina added 4 commits September 10, 2021 09:54

Use the Table.concatenate approach, instead of per column

6fc81f8

Fix typo in doc

bfd307a

Merge remote-tracking branch 'github_origin/branch-21.10' into agg/pr…

b9e5696

…e_post_steps

Type erasure fix

e3c5235

abellina mentioned this pull request Sep 10, 2021

[FEA] simplify the casting in hash aggregate #3442

Closed

abellina marked this pull request as ready for review September 10, 2021 16:13

sameerz assigned abellina Sep 10, 2021

sameerz added the task Work required that improves the product but is not user facing label Sep 10, 2021

abellina requested a review from ttnghia September 10, 2021 16:32

abellina added 3 commits September 10, 2021 12:38

Add comment on computeBoundCudfAggregates

3c05fe1

Merge branch 'branch-21.10' of github.com:NVIDIA/spark-rapids into ag…

9786055

…g/pre_post_steps

Fix leak when concatenating batches

a626e65

abellina mentioned this pull request Sep 14, 2021

[FEA] Simplify the aggregation logic #3194

Closed

Merge branch 'branch-21.10' of github.com:NVIDIA/spark-rapids into ag…

e86700f

…g/pre_post_steps

revans2 previously approved these changes Sep 14, 2021

View reviewed changes

Remove the M2 aggregate implementation

d4807c1

abellina dismissed revans2’s stale review via d4807c1 September 14, 2021 20:15

revans2 approved these changes Sep 14, 2021

View reviewed changes

abellina merged commit 4ae2aea into NVIDIA:branch-21.10 Sep 14, 2021

abellina mentioned this pull request Sep 15, 2021

Fix GpuSum type to match resultType #3500

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds pre/post steps for merge and update aggregate #3417

Adds pre/post steps for merge and update aggregate #3417

abellina commented Sep 9, 2021 •

edited

Loading

abellina commented Sep 9, 2021

revans2 Sep 10, 2021

abellina commented Sep 14, 2021

abellina commented Sep 14, 2021

ttnghia commented Sep 14, 2021 •

edited

Loading

abellina commented Sep 14, 2021 •

edited

Loading

abellina commented Sep 14, 2021

ttnghia commented Sep 14, 2021

Adds pre/post steps for merge and update aggregate #3417

Adds pre/post steps for merge and update aggregate #3417

Conversation

abellina commented Sep 9, 2021 • edited Loading

abellina commented Sep 9, 2021

revans2 Sep 10, 2021

Choose a reason for hiding this comment

abellina commented Sep 14, 2021

abellina commented Sep 14, 2021

ttnghia commented Sep 14, 2021 • edited Loading

abellina commented Sep 14, 2021 • edited Loading

abellina commented Sep 14, 2021

ttnghia commented Sep 14, 2021

abellina commented Sep 9, 2021 •

edited

Loading

ttnghia commented Sep 14, 2021 •

edited

Loading

abellina commented Sep 14, 2021 •

edited

Loading