Fix a hanging issue when processing empty data. #841

firestarman · 2020-09-24T01:51:46Z

The output iterator will wait on the batch queue when calling hasNext, and suppose to be waked up when the Python runner inserts something into the batch queue. But the insertion will never happen if the input data is empty. So it hangs forever.

The solution is to let the Python runner always wake up the output iterator after it finishes the data writing by calling the new added API finish().

Also added the test for it. The 'small_data' is small enough to let some tasks get no data when running.

Signed-off-by: Firestarman firestarmanllc@gmail.com

firestarman · 2020-09-24T08:21:59Z

build

revans2

Great work finding this.

firestarman · 2020-09-25T00:51:03Z

build

The output iterator will wait on the batch queue when calling `hasNext`, and suppose to be waked up when the Python runner inserts something into the batch queue. But the insertion will never happen if the input data is empty. So it hangs forever. The solution is to let the Python runner always wake up the output iterator after it finishes the data writing by calling the new added API `finish()`. Signed-off-by: Firestarman <firestarmanllc@gmail.com>

The 'small_data' is small enough to let some tasks get no data when running. Now only test this for the Scalar type who just implements the columnar pipeline. Signed-off-by: Firestarman <firestarmanllc@gmail.com>

firestarman · 2020-09-25T01:46:35Z

build

firestarman · 2020-09-25T01:52:29Z

@revans2 Added the test for it. Could you take another look?

firestarman · 2020-09-25T03:02:30Z

build

revans2 · 2020-09-25T13:06:42Z

build

* Fix a hanging issue when processing empty data. The output iterator will wait on the batch queue when calling `hasNext`, and suppose to be waked up when the Python runner inserts something into the batch queue. But the insertion will never happen if the input data is empty. So it hangs forever. The solution is to let the Python runner always wake up the output iterator after it finishes the data writing by calling the new added API `finish()`. Signed-off-by: Firestarman <firestarmanllc@gmail.com> * Add tests for processing empty data. The 'small_data' is small enough to let some tasks get no data when running. Now only test this for the Scalar type who just implements the columnar pipeline. Signed-off-by: Firestarman <firestarmanllc@gmail.com>

…IDIA#841) Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com> Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>

firestarman requested a review from revans2 September 24, 2020 01:51

firestarman linked an issue Sep 24, 2020 that may be closed by this pull request

[BUG] udf_cudf_test::test_with_column fails with IPC error #750

Closed

sameerz added the bug Something isn't working label Sep 24, 2020

revans2 previously approved these changes Sep 24, 2020

View reviewed changes

firestarman added 2 commits September 25, 2020 09:10

Add tests for processing empty data.

fa7d545

The 'small_data' is small enough to let some tasks get no data when running. Now only test this for the Scalar type who just implements the columnar pipeline. Signed-off-by: Firestarman <firestarmanllc@gmail.com>

firestarman dismissed revans2’s stale review via fa7d545 September 25, 2020 01:17

firestarman force-pushed the fix-hang-issue branch from 2b334cd to fa7d545 Compare September 25, 2020 01:17

firestarman changed the title ~~[WIP] Fix a hanging issue when processing empty data.~~ Fix a hanging issue when processing empty data. Sep 25, 2020

revans2 approved these changes Sep 25, 2020

View reviewed changes

firestarman merged commit 779b9fa into NVIDIA:branch-0.3 Sep 25, 2020

firestarman deleted the fix-hang-issue branch September 25, 2020 23:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix a hanging issue when processing empty data. #841

Fix a hanging issue when processing empty data. #841

firestarman commented Sep 24, 2020 •

edited

Loading

firestarman commented Sep 24, 2020

revans2 left a comment

firestarman commented Sep 25, 2020

firestarman commented Sep 25, 2020

firestarman commented Sep 25, 2020

firestarman commented Sep 25, 2020

revans2 commented Sep 25, 2020

Fix a hanging issue when processing empty data. #841

Fix a hanging issue when processing empty data. #841

Conversation

firestarman commented Sep 24, 2020 • edited Loading

firestarman commented Sep 24, 2020

revans2 left a comment

Choose a reason for hiding this comment

firestarman commented Sep 25, 2020

firestarman commented Sep 25, 2020

firestarman commented Sep 25, 2020

firestarman commented Sep 25, 2020

revans2 commented Sep 25, 2020

firestarman commented Sep 24, 2020 •

edited

Loading