-
Notifications
You must be signed in to change notification settings - Fork 232
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] test_hash_multiple_mode_query failing #1185
Comments
I'm not sure this failure is specific to Databricks. |
Do you folks want me to take a look at this in case @tgravescs and @jlowe are not? |
@kuhushukla if you could look into this that'd be great. The nightly Spark 3.0.0 integration tests have been failing with this as well, it appears. |
Will update here asap. |
Seems related to NullType support IMO, as of the failing ones - at least one column is Null. |
I am unable to repro this locally even with the spark-3.0.0 artifact that is used by the nightly build. I will see what I can do further and pissbly add explain-all for debug if nothing works out. |
Thanks @kuhushukla It looks like the failure is happening on 3.0.1, and it is related to First.
I'll see if I can fix it. |
Yup it is my fault. I forgot to update the 3.0.1 shim layer to let first and last work with NullType. I got the 3.0.0 shim layer. I'll put up a PR shortly. |
…IDIA#1185) Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>
Describe the bug
11:37:59 FAILED src/main/python/hash_aggregate_test.py::test_hash_multiple_mode_query[{'spark.rapids.sql.variableFloatAgg.enabled': 'true', 'spark.rapids.sql.hasNans': 'false', 'spark.rapids.sql.castStringToFloat.enabled': 'true'}-[('a', Null), ('b', Integer), ('c', Long)]][IGNORE_ORDER, INCOMPAT, APPROXIMATE_FLOAT]
11:37:59 FAILED src/main/python/hash_aggregate_test.py::test_hash_query_multiple_distincts_with_non_distinct[{'spark.rapids.sql.variableFloatAgg.enabled': 'true', 'spark.rapids.sql.hasNans': 'false', 'spark.rapids.sql.castStringToFloat.enabled': 'true'}-[('a', Null), ('b', Integer), ('c', Long)]][IGNORE_ORDER, INCOMPAT, APPROXIMATE_FLOAT]
11:37:59 FAILED src/main/python/hash_aggregate_test.py::test_hash_query_max_with_multiple_distincts[{'spark.rapids.sql.variableFloatAgg.enabled': 'true', 'spark.rapids.sql.hasNans': 'false', 'spark.rapids.sql.castStringToFloat.enabled': 'true'}-[('a', RepeatSeq(String)), ('b', Integer), ('c', Null)]][IGNORE_ORDER, INCOMPAT, APPROXIMATE_FLOAT]
One of the reasons is it looks like HashAggregate is not on the GPU:
The text was updated successfully, but these errors were encountered: