-
Notifications
You must be signed in to change notification settings - Fork 232
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Audit] Add bucketed scan info in query plan of data source v1 [databricks] #4461
[Audit] Add bucketed scan info in query plan of data source v1 [databricks] #4461
Conversation
Signed-off-by: remzi <13716567376yh@gmail.com>
build |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just some nits. But, we need to understand and fix the failing test. We also should run these against databricks so we are sure it works there too.
assert "Bucketed: false (disabled by query planner)" in df._sc._jvm.PythonSQLUtils.explainString(df._jdf.queryExecution(), "simple") | ||
|
||
|
||
with_gpu_session(bucket_column_not_read) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why have 1 test for all of these instead of 4 separate tests? It does not look like you are saving any code space or anything like that.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for your feedback! I will split it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There is one failing test because the Spark version is before 3.1.0. I will add a skipif
mark to the test.
@allow_non_gpu(any=True) | ||
def test_explain_bucketed_scan(spark_tmp_table_factory): | ||
""" | ||
https://github.com/NVIDIA/spark-rapids/issues/3952 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A text explanation would be good to be able to get a high level idea of what is happening.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for the feedback! I will add comments.
Signed-off-by: remzi <13716567376yh@gmail.com>
Signed-off-by: remzi <13716567376yh@gmail.com>
build |
Solve merge conflicts because shim layers are refactored. |
Signed-off-by: remzi <13716567376yh@gmail.com>
build |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It looks like some of the files might need to be updated for 2022 copyright. Other than that it looks good.
Thank you. I will update them |
Signed-off-by: remzi <13716567376yh@gmail.com>
build |
Signed-off-by: remzi <13716567376yh@gmail.com>
build |
Signed-off-by: remzi 13716567376yh@gmail.com
close #3952