Use thread-local to track CUDA device in JNI [skip ci] #6597

jlowe · 2020-10-26T14:49:44Z

Looking at a recent Nsight profile of a Java application using cudf, I noticed a lot of CPU samples in cudaGetDevice. This is caused by the auto_set_device calls made in each cudf JNI function to ensure each thread is using the same device used when RMM was initialized. Initially we thought calling cudaGetDevice would be "cheap enough", but this is apparently not the case, at least when profiling under Nsight.

This changes the cudf JNI code to track the thread's CUDA device in a thread local rather than needing to call cudaGetDevice on each cudf call to obtain it. This saved 0.5ms per Table.contiguousSplit call in a microbenchmark, and I noticed it also significantly reduced the time dilation we've seen in Nsight profiles of cudf Java applications.

GPUtester · 2020-10-26T14:50:18Z

Please update the changelog in order to start CI tests.

View the gpuCI docs here.

Use thread-local to track CUDA device in JNI

bd13cb1

jlowe added Java Affects Java cuDF API. 4 - Needs cuDF (Java) Reviewer labels Oct 26, 2020

jlowe requested a review from a team as a code owner October 26, 2020 14:49

jlowe self-assigned this Oct 26, 2020

changelog

507bd8b

abellina approved these changes Oct 26, 2020

View reviewed changes

revans2 approved these changes Oct 26, 2020

View reviewed changes

jlowe merged commit cbd2726 into rapidsai:branch-0.17 Oct 26, 2020

jlowe deleted the jni-cuda-device branch September 10, 2021 15:43

vyasr added 4 - Needs Review Waiting for reviewer to review or respond and removed 4 - Needs cuDF (Java) Reviewer labels Feb 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use thread-local to track CUDA device in JNI [skip ci] #6597

Use thread-local to track CUDA device in JNI [skip ci] #6597

jlowe commented Oct 26, 2020

GPUtester commented Oct 26, 2020

Use thread-local to track CUDA device in JNI [skip ci] #6597

Use thread-local to track CUDA device in JNI [skip ci] #6597

Conversation

jlowe commented Oct 26, 2020

GPUtester commented Oct 26, 2020