RAPIDS accelerated Spark Scala UDF support #1636

jlowe · 2021-01-29T23:59:17Z

Closes #1594.

This implements a GPU version of ScalaUDF which is used to track Spark Scala UDFs. Note that this class is also used for Java UDFs which are NOT supported by this change due to the obscuring of the user's class by a lambda wrapper for that case. Adding support for Spark Java UDFs is tracked by #1635.

A working example of a RAPIDS accelerated Spark Scala UDF is also provided, which required adding the Scala version to the udf-examples jar (and everywhere it was referenced). A unit test was added to exercise it. It was not implemented as a Python test as was done for the Hive UDFs because PySpark does not support registering Scala UDFs (it uses the Java UDF interface instead).

The RAPIDS accelerated UDF documentation has also been updated to reflect the new functionality.

Signed-off-by: Jason Lowe <jlowe@nvidia.com>

jlowe · 2021-01-29T23:59:51Z

build

integration_tests/pom.xml

Signed-off-by: Jason Lowe <jlowe@nvidia.com>

RAPIDS accelerated Spark Scala UDF support

d06722a

Signed-off-by: Jason Lowe <jlowe@nvidia.com>

jlowe added feature request New feature or request SQL part of the SQL/Dataframe plugin labels Jan 29, 2021

jlowe self-assigned this Jan 29, 2021

jlowe requested review from GaryShen2008, NvTimLiu, revans2 and tgravescs as code owners January 29, 2021 23:59

tgravescs approved these changes Feb 1, 2021

View reviewed changes

integration_tests/pom.xml Show resolved Hide resolved

jlowe mentioned this pull request Feb 1, 2021

Update CI builds for new udf-examples jar name #1640

Closed

jlowe added this to the Feb 1 - Feb 12 milestone Feb 1, 2021

revans2 approved these changes Feb 1, 2021

View reviewed changes

jlowe merged commit d6be108 into NVIDIA:branch-0.4 Feb 1, 2021

jlowe mentioned this pull request Feb 11, 2021

[BUG] Scala UDF compiler can decompile UDFs with RAPIDS implementation #1712

Closed

nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021

RAPIDS accelerated Spark Scala UDF support (NVIDIA#1636)

67d842d

Signed-off-by: Jason Lowe <jlowe@nvidia.com>

nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021

RAPIDS accelerated Spark Scala UDF support (NVIDIA#1636)

4ee4b9a

Signed-off-by: Jason Lowe <jlowe@nvidia.com>

jlowe deleted the scala-udf branch September 10, 2021 15:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RAPIDS accelerated Spark Scala UDF support #1636

RAPIDS accelerated Spark Scala UDF support #1636

jlowe commented Jan 29, 2021

jlowe commented Jan 29, 2021

RAPIDS accelerated Spark Scala UDF support #1636

RAPIDS accelerated Spark Scala UDF support #1636

Conversation

jlowe commented Jan 29, 2021

jlowe commented Jan 29, 2021