Allow skipping host spill for a direct device->disk spill #9211

abellina · 2023-09-08T18:00:01Z

Contributes to #8881.

This is a work in progress to allow spill to skip host. I need to run this under a workload (by enabling the "skip host" mode locally) and check for leaks and make sure all the accounting is working correctly. It leaves that mode disabled by default because I think this is something that will be integrated later against the new HostAlloc api, but I can try integrating that in this PR if needed.

I did some minor refactoring because I didn't want to preclude a certain order between calling free and getting the size of the spilled RapidsBuffer in the source tier. I made that method a val, so we are guaranteed that it is set until GC.

This also fixes #9223. This is an issue with 0-byte RapidsBuffers as they spill to disk. mmap on 0-byte length file segment is not valid, yet this can very well given a ColumnarBatch with 0 rows for example. The code in this PR will not allow these 0-byte buffers to be marked as spillable, so they will never reach disk.

…MemoryBuffer

Signed-off-by: Alessandro Bellina <abellina@nvidia.com>

sql-plugin/src/main/scala/com/nvidia/spark/rapids/AbstractHostByteBufferIterator.scala

abellina · 2023-09-08T18:02:45Z

sql-plugin/src/main/scala/com/nvidia/spark/rapids/RapidsHostMemoryStore.scala

-                  throw new IllegalStateException("copying from buffer without device memory")
+      stream: Cuda.Stream): Option[RapidsBufferBase] = {
+    val wouldFit = trySpillToMaximumSize(other, stream)
+    // TODO: this is disabled for now since subsequent work will tie this into


@revans2 this is I think an integration point with the host allocator, so I left this as a TODO disabled by default so it does what it used to do, if we decide to merge this. Let me know if I should look into the apis and try to integrate things, or if this is what you needed.

revans2

I did a quick pass and it is looking good. I want to spend some more time on it later to dig in a little deeper.

abellina · 2023-09-11T20:42:04Z

I have an issue with the posted code where I will loose a RapidsBuffer if it spills while I am spilling. This is because of how I grouped them together and updated tiers all at once. I am testing a fix and will push today.

…ing spilled at a higher tier

Signed-off-by: Alessandro Bellina <abellina@nvidia.com>

abellina · 2023-09-13T16:35:31Z

@revans2 this is ready for another pass

revans2 · 2023-09-13T17:31:51Z

sql-plugin/src/main/scala/com/nvidia/spark/rapids/RapidsBufferCatalog.scala

@@ -659,38 +642,19 @@ class RapidsBufferCatalog(
          logDebug(s"Skipping spilling $buffer ${buffer.id} to ${spillStore.name} as it is " +
            s"already stored in multiple tiers")
        }
+>>>>>>> 740d1175060555ad373f30ed780a58215b6c3419


I don't think the upmerge was done properly.

you are right, this method needs to go away (it's now in the store). Fixed in 4916333

abellina · 2023-09-13T17:48:58Z

build

abellina · 2023-09-13T17:55:55Z

sql-plugin/src/main/scala/com/nvidia/spark/rapids/RapidsBufferCatalog.scala

+    // try one last time after locking the catalog (slow path)
+    // if there is a lot of contention here, I would rather lock the world than
+    // have tasks error out with "Unable to acquire"
+    synchronized {


this is not strictly necessary, but while debugging some other issue I thought it wouldn't hurt. Before this we would perform the attempts and throw, without trying to lock the catalog and try it one last time. I don't have evidence of this saving a task that would otherwise fail however.

abellina · 2023-09-13T19:33:07Z

My intellij was running without assertions , so I didn't see test failures reported by CI. Fixed.

abellina · 2023-09-13T19:35:07Z

build

abellina added 5 commits September 7, 2023 13:42

Refactor code to allow skipping host

648ba87

Implement RapidsBufferChannelWritable in RapidsTable and RapidsDevice…

5a2258f

…MemoryBuffer

Add a test for device->disk skipping host

08d419d

Small fixes

cbb954f

getMemoryUsedBytes -> memoryUsedBytes as a val

49659ba

Signed-off-by: Alessandro Bellina <abellina@nvidia.com>

abellina force-pushed the host_spill_skip_to_disk branch from b365369 to 49659ba Compare September 8, 2023 18:01

abellina commented Sep 8, 2023

View reviewed changes

sql-plugin/src/main/scala/com/nvidia/spark/rapids/AbstractHostByteBufferIterator.scala Outdated Show resolved Hide resolved

abellina commented Sep 8, 2023

View reviewed changes

abellina changed the title ~~Host spill skip to disk~~ Allow skipping host spill for a direct device->disk spill Sep 8, 2023

Update copyright

6949acc

revans2 reviewed Sep 8, 2023

View reviewed changes

sameerz added the reliability Features to improve reliability or bugs that severly impact the reliability of the plugin label Sep 9, 2023

Fix bug where buffer could be spilled to a lower tier while it was be…

7890e75

…ing spilled at a higher tier

abellina mentioned this pull request Sep 12, 2023

[BUG] Failed to create memory map on query14_part1 at 100TB with spark.executor.cores=64 #9223

Closed

abellina added 3 commits September 12, 2023 08:56

Fix leak in aggregate when there are retries

1dac039

Signed-off-by: Alessandro Bellina <abellina@nvidia.com>

Ensure that 0-byte RapidsBuffers are never spillable

bb2d818

Upmerge

c90c411

abellina marked this pull request as ready for review September 13, 2023 16:35

revans2 reviewed Sep 13, 2023

View reviewed changes

Remove spillBuffer from catalog

4916333

abellina commented Sep 13, 2023

View reviewed changes

Fix test issues

17bbf3f

revans2 approved these changes Sep 13, 2023

View reviewed changes

abellina merged commit 34d615d into NVIDIA:branch-23.10 Sep 15, 2023
28 checks passed

abellina deleted the host_spill_skip_to_disk branch September 15, 2023 14:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow skipping host spill for a direct device->disk spill #9211

Allow skipping host spill for a direct device->disk spill #9211

abellina commented Sep 8, 2023 •

edited

Loading

abellina Sep 8, 2023

revans2 left a comment

abellina commented Sep 11, 2023

abellina commented Sep 13, 2023

revans2 Sep 13, 2023

abellina Sep 13, 2023 •

edited

Loading

abellina commented Sep 13, 2023

abellina Sep 13, 2023

abellina commented Sep 13, 2023

abellina commented Sep 13, 2023

Allow skipping host spill for a direct device->disk spill #9211

Allow skipping host spill for a direct device->disk spill #9211

Conversation

abellina commented Sep 8, 2023 • edited Loading

abellina Sep 8, 2023

Choose a reason for hiding this comment

revans2 left a comment

Choose a reason for hiding this comment

abellina commented Sep 11, 2023

abellina commented Sep 13, 2023

revans2 Sep 13, 2023

Choose a reason for hiding this comment

abellina Sep 13, 2023 • edited Loading

Choose a reason for hiding this comment

abellina commented Sep 13, 2023

abellina Sep 13, 2023

Choose a reason for hiding this comment

abellina commented Sep 13, 2023

abellina commented Sep 13, 2023

abellina commented Sep 8, 2023 •

edited

Loading

abellina Sep 13, 2023 •

edited

Loading