[FEA] support 4 KiB alignment when reading shuffle buffers #2492

rongou · 2021-05-24T21:37:58Z

Is your feature request related to a problem? Please describe.
When GDS spill is enabled, we would like to perform aligned IO, i.e. reads and writes between GPU memory and NVMe drives should be aligned to 4 KiB. However, the current implementation in the shuffle server (specifically in BufferSendState) packs shuffle buffers into the UCX bounce buffer, making it impossible to do aligned reads.

Describe the solution you'd like
An option in the shuffle server to allow 4 KiB aligned reads of shuffle buffers.

Describe alternatives you've considered
An alternative is to unspill whole shuffle buffers back into GPU memory before streaming them to the UCX bounce buffer, but for large shuffle buffers this might cause more spilling, thus less efficient.

Additional context
The GPUDirect Storage best practices guide talks about aligned vs. unaligned IO (https://docs.nvidia.com/gpudirect-storage/best-practices-guide/index.html).

@abellina @jlowe

The text was updated successfully, but these errors were encountered:

sameerz · 2021-05-25T20:15:18Z

@rongou can we target this change to the 21.08 release, given that we are in burndown?

rongou added feature request New feature or request ? - Needs Triage Need team to review and classify labels May 24, 2021

sameerz added task Work required that improves the product but is not user facing and removed ? - Needs Triage Need team to review and classify feature request New feature or request labels May 25, 2021

rongou mentioned this issue Jun 4, 2021

[FEA] GDS Integration #1445

Closed

11 tasks

rongou closed this as completed Jan 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] support 4 KiB alignment when reading shuffle buffers #2492

[FEA] support 4 KiB alignment when reading shuffle buffers #2492

rongou commented May 24, 2021

sameerz commented May 25, 2021

[FEA] support 4 KiB alignment when reading shuffle buffers #2492

[FEA] support 4 KiB alignment when reading shuffle buffers #2492

Comments

rongou commented May 24, 2021

sameerz commented May 25, 2021