Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix issues with canonicalization #623

Merged
merged 1 commit into from
Sep 2, 2020

Conversation

revans2
Copy link
Collaborator

@revans2 revans2 commented Aug 28, 2020

This fixes #606

While debugging this I ran into a number of errors in canonicalization of the plan that I fixed. The one that was specifically causing #606 was how children was set in GpuAggregateExpression. Spark does a lot of magic around Expression and SparkPlan, and if your children does not include parameters passed into your case class then some operations, like canonicalization, just skip them, and they stay unchanged.

Signed-off-by: Robert (Bobby) Evans <bobby@apache.org>
@revans2 revans2 added bug Something isn't working performance A performance related task/issue labels Aug 28, 2020
@revans2 revans2 added this to the Aug 31 - Sep 11 milestone Aug 28, 2020
@revans2 revans2 self-assigned this Aug 28, 2020
@revans2
Copy link
Collaborator Author

revans2 commented Aug 28, 2020

build

@abellina abellina self-requested a review August 28, 2020 16:19
@revans2
Copy link
Collaborator Author

revans2 commented Sep 2, 2020

@abellina are there changes you want me to make?

@abellina
Copy link
Collaborator

abellina commented Sep 2, 2020

Sorry @revans2 I lost track of this one, nope. I am just going to resolve the conversations.

Copy link
Collaborator

@abellina abellina left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did a pass and compared against CPU. I do not see any surprises.

@revans2 revans2 merged commit 7943866 into NVIDIA:branch-0.2 Sep 2, 2020
@revans2 revans2 deleted the fix-canonicalize branch September 2, 2020 15:16
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
Signed-off-by: Robert (Bobby) Evans <bobby@apache.org>
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
Signed-off-by: Robert (Bobby) Evans <bobby@apache.org>
tgravescs pushed a commit to tgravescs/spark-rapids that referenced this pull request Nov 30, 2023
…IDIA#623)

Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>

Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working performance A performance related task/issue
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] Multiple scans for same data source with TPC-DS query59 with delta format
2 participants