Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update Spark 2.x explain API with changes in 22.04 #5062

Merged
merged 24 commits into from
Mar 28, 2022

Conversation

tgravescs
Copy link
Collaborator

fixes #4908

This updates all the Spark 2.x explain api code to pull over checks from the changes to the 22.04 code.

I updated diff scripts and checks all the Overrides to try to catch anything new added.

I tested using the NDS queries. I diff'd the new output of 22.04 vs the 22.02 explain api output and the main differences now seem to be with ExistenceJoin being supported and then LeftSemi join. Unfortunately I also found a case the Spark 2.x planner inserts EqualNullSafe, which we don't support on the GPU, but in Spark 3.x it doesn't generate that, it uses other expressions so in Spark 3.x it does say it will run on the GPU. I documented that one.
I also diff'd the results vs explain on 3.1 and didn't see anything we missed.

@tgravescs tgravescs self-assigned this Mar 25, 2022
Signed-off-by: Thomas Graves <tgraves@nvidia.com>
@tgravescs
Copy link
Collaborator Author

build

@sameerz sameerz added this to the Mar 21 - Apr 1 milestone Mar 26, 2022
@tgravescs tgravescs merged commit f07a0aa into NVIDIA:branch-22.04 Mar 28, 2022
@tgravescs tgravescs deleted the spark2-branch2204 branch March 28, 2022 14:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Update spark 2.x explain code for 22.04
3 participants