Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replace cudaMemPrefetchAsync with rmm::prefetch #5

Open
wants to merge 20 commits into
base: feat/prefetching
Choose a base branch
from

Conversation

mhaseeb123
Copy link

@mhaseeb123 mhaseeb123 commented Jul 12, 2024

This PR replaces the raw cudaMemPrefetchAsync and error checking with rmm::prefetch() which implements the same functionality for portability and to avoid code duplication. rmm::prefetch() Item 1 is used as it applies to both device_buffers and device_vectors

@mhaseeb123 mhaseeb123 changed the title Replace cudaMemPrefetchAsync with rmm::prefetch for portability and code duplication. Replace cudaMemPrefetchAsync with rmm::prefetch Jul 12, 2024
@vyasr vyasr force-pushed the feat/prefetching branch 2 times, most recently from 7bc1366 to dc5a807 Compare July 18, 2024 21:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants