Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Avoid empty batches on columnar to row conversion #1204

Merged
merged 3 commits into from
Dec 2, 2020

Conversation

revans2
Copy link
Collaborator

@revans2 revans2 commented Nov 25, 2020

The probably fixes #1195

I have not been able to reproduce the issue itself, but from what was showing it looks like an empty batch some how got into the column to row conversion. This filters out those empty batches so it should fix it.

Signed-off-by: Robert (Bobby) Evans <bobby@apache.org>
@revans2 revans2 added this to the Nov 23 - Dec 4 milestone Nov 25, 2020
@revans2 revans2 self-assigned this Nov 25, 2020
@revans2
Copy link
Collaborator Author

revans2 commented Nov 25, 2020

build

@abellina
Copy link
Collaborator

I'll try this patch out @revans2

@abellina
Copy link
Collaborator

Good news, with this change q51 at 3TB passes for me. This is the one that showed the empty queue.

@jlowe
Copy link
Member

jlowe commented Nov 25, 2020

CI failed due to Spark 3.1 ShuffleOrigin change. Merging to latest on branch-0.3 should fix (and should keep prior approvals).

@sameerz sameerz added the bug Something isn't working label Nov 26, 2020
@revans2
Copy link
Collaborator Author

revans2 commented Dec 1, 2020

build

@abellina
Copy link
Collaborator

abellina commented Dec 2, 2020

This PR also needs @jlowe's fix for SortOrder.

@revans2
Copy link
Collaborator Author

revans2 commented Dec 2, 2020

build

@revans2 revans2 merged commit 15ac4bd into NVIDIA:branch-0.3 Dec 2, 2020
@revans2 revans2 deleted the avoid_empty_batchs_to_row branch December 2, 2020 18:43
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
Signed-off-by: Robert (Bobby) Evans <bobby@apache.org>
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
Signed-off-by: Robert (Bobby) Evans <bobby@apache.org>
tgravescs pushed a commit to tgravescs/spark-rapids that referenced this pull request Nov 30, 2023
…IDIA#1204)

Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] AcceleratedColumnarToRowIterator queue empty
4 participants