[FEA] Support ScalarSubquery #1600

sperlingxx · 2021-01-27T03:40:29Z

Is your feature request related to a problem? Please describe.
ScalarSubquery, which turning the output of a (sub)query into a single scalar value, has yet supported under GPU. This feature is important for better performance.

Describe the solution you'd like
Implement GPU overrides for org.apack.spark.sql.execution.SubqueryExec and org.apack.spark.sql.execution.ScalarSubquery.

The text was updated successfully, but these errors were encountered:

revans2 · 2021-01-27T16:47:15Z

I am not sure we need to implement SubqueyExec for this use case. ScalarSubquery will run a query that produces a single result. One row of one column. It will run that sub-query and collect the result back to the driver.

https://github.com/apache/spark/blob/5718d64f3104f7a24a9d4b619748bcca03031c48/sql/core/src/main/scala/org/apache/spark/sql/execution/subquery.scala#L82-L96

But then it keeps the data cached in the expression and just returns that value as if it were a literal.

There is really almost no value at all in trying to make the various SubqueryExecBase implementations run on the GPU because it is returning just a single value. If it was more data then it might make since to do something like BroadcastExchangeExec so we can keep a large amount of the data columnar. Here the amount of data is likely small enough that there is no point.

sperlingxx added feature request New feature or request ? - Needs Triage Need team to review and classify labels Jan 27, 2021

sameerz assigned sperlingxx Jan 28, 2021

sameerz removed the ? - Needs Triage Need team to review and classify label Feb 2, 2021

sameerz added this to the Feb 1 - Feb 12 milestone Feb 2, 2021

sameerz linked a pull request Feb 2, 2021 that will close this issue

support GpuScalarSubquery #1639

Merged

sperlingxx closed this as completed in #1639 Feb 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Support ScalarSubquery #1600

[FEA] Support ScalarSubquery #1600

sperlingxx commented Jan 27, 2021

revans2 commented Jan 27, 2021

[FEA] Support ScalarSubquery #1600

[FEA] Support ScalarSubquery #1600

Comments

sperlingxx commented Jan 27, 2021

revans2 commented Jan 27, 2021