Skip to content

Unable to Visualize Multiple Concurrent Tasks on a GPU #756

Answered by lijinf2
mnlcarv asked this question in Q&A
Discussion options

You must be logged in to vote

Thank you for sharing the experimental results. The data is presented very clearly.

Allow me to provide some interpretation. Given that there is only one physical GPU in your cluster, it’s expected that there would be a single GPU task. Internally, Spark RAPIDS ML repartitions the input Spark DataFrame into a number of partitions equal to the number of available GPUs, before invoking cuML and NCCL for GPU computation. cuML and NCCL currently operate under the one-process-per-GPU expectation, so repartitioning occurs when the number of partitions does not align with the number of available GPUs.

In the case of Case 2, the runtime is slightly slower than Case 1. This suggests that the overh…

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@mnlcarv
Comment options

@lijinf2
Comment options

@mnlcarv
Comment options

@lijinf2
Comment options

Answer selected by mnlcarv
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants