Support serializing packed tables directly for shuffle write #19

firestarman · 2024-06-17T03:50:38Z

This PR is trying to accelerate the normal shuffle path by partitioning and slicing tables on GPU.

The sliced table is already serializable so can be written to the Shuffle output stream directly, along with a lightweight metadata (a TableMeta) to rebuild the table on the Shuffle read side.

On the Shuffle read side, the new introduced PackedTableIterator will read the tables from the Shuffle input stream and rebuild them on GPU by leveraging the existing utils (MetaUtils, GpuCompressedColumnVector). Next, the existing GpuCoalesceBatches node is used to do the batch concatenation for the downstream operators, similar as what Rapids Shuffle does.

Signed-off-by: Firestarman firestarmanllc@gmail.com

--------- Signed-off-by: Firestarman <firestarmanllc@gmail.com>

Signed-off-by: Firestarman <firestarmanllc@gmail.com>

firestarman added 2 commits June 17, 2024 11:42

Support serializing packed tables directly for shuffle write

4bb1230

--------- Signed-off-by: Firestarman <firestarmanllc@gmail.com>

add a missing metric

b037d50

Signed-off-by: Firestarman <firestarmanllc@gmail.com>

wjxiz1992 closed this Jun 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support serializing packed tables directly for shuffle write #19

Support serializing packed tables directly for shuffle write #19

firestarman commented Jun 17, 2024 •

edited

Loading

Support serializing packed tables directly for shuffle write #19

Support serializing packed tables directly for shuffle write #19

Conversation

firestarman commented Jun 17, 2024 • edited Loading

firestarman commented Jun 17, 2024 •

edited

Loading