Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEA] Make better use of pinned memory with Spark shuffle #4516

Closed
rongou opened this issue Jan 12, 2022 · 1 comment
Closed

[FEA] Make better use of pinned memory with Spark shuffle #4516

rongou opened this issue Jan 12, 2022 · 1 comment
Labels
cudf_dependency An issue or PR with this label depends on a new feature in cudf improve performance A performance related task/issue

Comments

@rongou
Copy link
Collaborator

rongou commented Jan 12, 2022

Is your feature request related to a problem? Please describe.
With the RAPIDS shuffle manager (UCX) turned off, we don't use much pinned memory, and profiling shows there are a lot of cudaMemcpy calls done with pageable memory.

Describe the solution you'd like
Make better use of pinned memory so that most, if not all cudaMemcpy calls take advantage of pinned memory.

Describe alternatives you've considered
With UCX enabled, increasing the size of the pinned memory pool seems to greatly boost performance. We may want to consider allowing the RAPIDS shuffle manager without UCX, or creating an external shuffle manager that caches GPU memory buffers.

Additional context
PR for improving pinned memory usage with UCX enabled: #4497

@rongou rongou added feature request New feature or request ? - Needs Triage Need team to review and classify performance A performance related task/issue improve labels Jan 12, 2022
@rongou rongou added this to the Jan 10 - Jan 28 milestone Jan 12, 2022
@rongou rongou self-assigned this Jan 12, 2022
@sameerz sameerz removed ? - Needs Triage Need team to review and classify feature request New feature or request labels Jan 18, 2022
@rongou
Copy link
Collaborator Author

rongou commented Jan 21, 2022

Started working on a host memory pool in RMM, but it's sort of blocked by the pending move to cuda::memory_resource, see rapidsai/rmm#618.

@sameerz sameerz removed this from the Jan 10 - Jan 28 milestone Jan 30, 2022
@rongou rongou removed their assignment Feb 11, 2022
@sameerz sameerz added the cudf_dependency An issue or PR with this label depends on a new feature in cudf label Mar 1, 2022
@jlowe jlowe closed this as not planned Won't fix, can't repro, duplicate, stale Feb 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cudf_dependency An issue or PR with this label depends on a new feature in cudf improve performance A performance related task/issue
Projects
None yet
Development

No branches or pull requests

3 participants