Add an example RAPIDS-accelerated Hive UDF using native code #1472

jlowe · 2021-01-07T21:36:25Z

This adds an example of a Hive UDF that leverages native CUDA code to implement a RAPIDS-accelerated version. A Dockerfile is provided to show how to setup an environment for compiling the native UDF code against libcudf and the version of cub, thrust, and libcudacxx used by libcudf.

An integration test is provided, but the user must supply the --rapids_udf_example_native flag in order to enable the test, as most nightly builds will not bother to perform the extra steps necessary to build the native code required by the new example.

Documentation that covers this native example and implementing RAPIDS-acclerated Hive UDFs in general will be covered in #1354.

Signed-off-by: Jason Lowe <jlowe@nvidia.com>

jlowe · 2021-01-07T21:37:02Z

build

tgravescs

overall looks fine to me, but would be good to get someone more familiar with c++/cuda side to review.
Do we want to put the cuda version in the name of the jar or library?
Do we want to add README here to explain using docker file and use profile udf-native-examples or is that in the other PR?

tgravescs · 2021-01-08T19:04:38Z

oops never mind, you already said docs later

jlowe · 2021-01-08T19:20:17Z

Do we want to put the cuda version in the name of the jar or library?

The native code statically links the CUDA runtime, so we don't need to put the CUDA version in the jar name.

…1472) Signed-off-by: Jason Lowe <jlowe@nvidia.com>

…IDIA#1472) Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>

Add an example RAPIDS-accelerated Hive UDF using native code

01d9d18

Signed-off-by: Jason Lowe <jlowe@nvidia.com>

jlowe added this to the Jan 4 - Jan 15 milestone Jan 7, 2021

jlowe self-assigned this Jan 7, 2021

jlowe requested review from GaryShen2008, NvTimLiu, revans2 and tgravescs as code owners January 7, 2021 21:36

This was referenced Jan 8, 2021

Documentation for RAPIDS-accelerated Hive UDFs #1478

Merged

[FEA] Execute UDFs that provide a RAPIDS execution path #1351

Closed

tgravescs approved these changes Jan 8, 2021

View reviewed changes

sameerz added the task Work required that improves the product but is not user facing label Jan 11, 2021

jlowe merged commit 1f73c88 into NVIDIA:branch-0.4 Jan 11, 2021

nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021

Add an example RAPIDS-accelerated Hive UDF using native code (NVIDIA#…

f90233c

…1472) Signed-off-by: Jason Lowe <jlowe@nvidia.com>

nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021

Add an example RAPIDS-accelerated Hive UDF using native code (NVIDIA#…

935a1c4

…1472) Signed-off-by: Jason Lowe <jlowe@nvidia.com>

jlowe deleted the hive-native-udf branch September 10, 2021 15:41

tgravescs pushed a commit to tgravescs/spark-rapids that referenced this pull request Nov 30, 2023

Update submodule cudf to 3964950ba2fecf7f962917276058a6381d396246 (NV…

e8219ce

…IDIA#1472) Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an example RAPIDS-accelerated Hive UDF using native code #1472

Add an example RAPIDS-accelerated Hive UDF using native code #1472

jlowe commented Jan 7, 2021

jlowe commented Jan 7, 2021

tgravescs left a comment

tgravescs commented Jan 8, 2021

jlowe commented Jan 8, 2021

Add an example RAPIDS-accelerated Hive UDF using native code #1472

Add an example RAPIDS-accelerated Hive UDF using native code #1472

Conversation

jlowe commented Jan 7, 2021

jlowe commented Jan 7, 2021

tgravescs left a comment

Choose a reason for hiding this comment

tgravescs commented Jan 8, 2021

jlowe commented Jan 8, 2021