Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

catch rmm::out_of_memory exceptions in jni #9525

Merged
merged 2 commits into from
Oct 27, 2021

Conversation

rongou
Copy link
Contributor

@rongou rongou commented Oct 26, 2021

RMM now throws a more specific exception when we know for sure a resource is running out of memory, so catching this exception in the jni layer and translate it to oom errors so that we don't hide other non-recoverable cuda errors.

@rongou rongou added 3 - Ready for Review Ready for review by team RMM Java Affects Java cuDF API. Spark Functionality that helps Spark RAPIDS improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Oct 26, 2021
@rongou rongou requested review from jlowe and revans2 October 26, 2021 16:48
@rongou rongou self-assigned this Oct 26, 2021
@rongou rongou requested a review from a team as a code owner October 26, 2021 16:48
@codecov
Copy link

codecov bot commented Oct 26, 2021

Codecov Report

Merging #9525 (c6c0eb7) into branch-21.12 (ab4bfaa) will decrease coverage by 0.12%.
The diff coverage is n/a.

Impacted file tree graph

@@               Coverage Diff                @@
##           branch-21.12    #9525      +/-   ##
================================================
- Coverage         10.79%   10.66%   -0.13%     
================================================
  Files               116      117       +1     
  Lines             18869    19725     +856     
================================================
+ Hits               2036     2104      +68     
- Misses            16833    17621     +788     
Impacted Files Coverage Δ
python/dask_cudf/dask_cudf/sorting.py 92.90% <0.00%> (-1.21%) ⬇️
python/cudf/cudf/io/csv.py 0.00% <0.00%> (ø)
python/cudf/cudf/io/hdf.py 0.00% <0.00%> (ø)
python/cudf/cudf/io/orc.py 0.00% <0.00%> (ø)
python/cudf/cudf/__init__.py 0.00% <0.00%> (ø)
python/cudf/cudf/_version.py 0.00% <0.00%> (ø)
python/cudf/cudf/core/abc.py 0.00% <0.00%> (ø)
python/cudf/cudf/api/types.py 0.00% <0.00%> (ø)
python/cudf/cudf/io/dlpack.py 0.00% <0.00%> (ø)
python/cudf/cudf/core/frame.py 0.00% <0.00%> (ø)
... and 65 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 306e42f...c6c0eb7. Read the comment docs.

@rongou
Copy link
Contributor Author

rongou commented Oct 27, 2021

@gpucibot merge

@rapids-bot rapids-bot bot merged commit d8f23c1 into rapidsai:branch-21.12 Oct 27, 2021
@rongou rongou deleted the catch-oom branch November 23, 2021 17:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
3 - Ready for Review Ready for review by team improvement Improvement / enhancement to an existing function Java Affects Java cuDF API. non-breaking Non-breaking change Spark Functionality that helps Spark RAPIDS
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants