-
Notifications
You must be signed in to change notification settings - Fork 232
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Iterator to make it easier to work with a window of blocks in the RAPIDS shuffle #934
Iterator to make it easier to work with a window of blocks in the RAPIDS shuffle #934
Conversation
The code looks fine, I just don't like to add in code if it is not actively being used. |
@revans2 that's ok, I am fine closing this and trying to fit it in with another PR that targets mostly the receive/send states in the rapids shuffle. It's so far a pretty big diff 1.5K lines, so I can try reducing it. |
I am fine with putting this in if I know that there will be something coming shortly. So if you could put up a follow on PR that depends on this, then I would feel better about it, because then it is clear that you are breaking it up and that there will not be dead code for long. |
@revans2 marked it as WIP, as I put up the rest. It is going to take me a couple of days to get that second PR up likely. |
b247bec
to
dfff723
Compare
sql-plugin/src/main/scala/com/nvidia/spark/rapids/shuffle/WindowedBlockIterator.scala
Show resolved
Hide resolved
sql-plugin/src/main/scala/com/nvidia/spark/rapids/shuffle/WindowedBlockIterator.scala
Show resolved
Hide resolved
sql-plugin/src/main/scala/com/nvidia/spark/rapids/shuffle/WindowedBlockIterator.scala
Show resolved
Hide resolved
sql-plugin/src/main/scala/com/nvidia/spark/rapids/shuffle/WindowedBlockIterator.scala
Outdated
Show resolved
Hide resolved
sql-plugin/src/main/scala/com/nvidia/spark/rapids/shuffle/WindowedBlockIterator.scala
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me, just a minor nit.
sql-plugin/src/main/scala/com/nvidia/spark/rapids/shuffle/WindowedBlockIterator.scala
Outdated
Show resolved
Hide resolved
build |
9a5c6d7
to
3422a3f
Compare
build |
I believe blossom-ci is stuck, something about a volume having issues. I can try and kill the job and start CI over. |
build |
1 similar comment
build |
Signed-off-by: Alessandro Bellina <abellina@nvidia.com>
Signed-off-by: Alessandro Bellina <abellina@nvidia.com>
3422a3f
to
8ba3160
Compare
Rebased to pick up CI fix for 3.1.0 |
build |
…IDS shuffle (NVIDIA#934) * Adds iterator to make it easier to work with ranges of blocks Signed-off-by: Alessandro Bellina <abellina@nvidia.com>
…IDS shuffle (NVIDIA#934) * Adds iterator to make it easier to work with ranges of blocks Signed-off-by: Alessandro Bellina <abellina@nvidia.com>
…IDS shuffle (NVIDIA#934) * Adds iterator to make it easier to work with ranges of blocks Signed-off-by: Alessandro Bellina <abellina@nvidia.com>
…IDIA#934) Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>
Signed-off-by: Alessandro Bellina abellina@nvidia.com
This introduces an iterator that will be used in subsequent commits for the rapids shuffle. Adding it separate from them to reduce the size of PRs. If reviewers want me to put up a bigger PR with the rest of this I can, but I am trying to keep it reviewable.
The iterator, given a set of blocks will return on
next()
ranges of blocks that pack a window (of bytes)The main advantage of the approach supported by this iterator is that we utilize bounce buffers more efficiently, and also reduce the amount of calls to the underlying protocol.