-
Notifications
You must be signed in to change notification settings - Fork 232
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG][Databricks 12.2] GpuRowBasedHiveGenericUDF ClassCastException #8318
Comments
This is the key part of the stack trace:
This appears to be a bug in Databricks' Spark code. |
After more experimentation, I have a fix for the issue, but it changes the semantics of how Hive generic UDFs work. In deferredObjects(i).set(() => childRowAccessors(idx)(childrenRow)) In open-source Spark, when the UDF calls I can get the test passing on DBR 12.2 if I change the code to just set the value rather than providing a function: deferredObjects(i).set(childRowAccessors(idx)(childrenRow)) If I try the same code change with open-source Spark, it fails to compile, further confirming that this is a change in functionality in Databricks 12.2
|
@firestarman fyi |
Spark master (3.5.0) has the same new behavior, introduced in apache/spark#39555 |
Describe the bug
#8282 adds support for Databricks 12.2 but skips the test
row-based_udf_test.py::test_hive_empty_generic_udf
, which needs further investigation.Steps/Code to reproduce bug
Run the test on DB 12.2
Expected behavior
Should fall back to CPU or pass.
Environment details (please complete the following information)
N/A
Additional context
The text was updated successfully, but these errors were encountered: