Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support launching Map Pandas UDF on empty partitions #9557

Merged

Conversation

firestarman
Copy link
Collaborator

@firestarman firestarman commented Oct 27, 2023

fixes #9480

This PR adds support of launching Map Pandas UDF on empty partitions to align with Spark's behavior.

So far I don't see other types of Pandas UDF will be called for empty partitions.

The test is copied from the example in the linked issue.

Signed-off-by: Firestarman <firestarmanllc@gmail.com>
@firestarman
Copy link
Collaborator Author

build

@sameerz sameerz added the bug Something isn't working label Oct 27, 2023
Signed-off-by: Firestarman <firestarmanllc@gmail.com>
@firestarman
Copy link
Collaborator Author

build

@firestarman firestarman merged commit 2e282f9 into NVIDIA:branch-23.12 Oct 30, 2023
36 of 37 checks passed
@firestarman firestarman deleted the run-pandas-udf-empty-part branch October 30, 2023 01:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] mapInPandas doesn't invoke udf on empty partitions
4 participants