feat(kad): reuse outbound streams instead of closing them #3474

nazar-pc · 2023-02-15T23:55:50Z

Description

This implements outgoing stream reuse in Kademlia by storing substream for future reuse after message was processed successfully instead of closing substream right away.

Notes

This is, unfortunately not working properly and after spending a few hours on it I could use external help.

Links to any relevant issues

This will be useful for #3468 (regardless of whatever version of it ends up landing).

Open Questions

None

Change checklist

I have performed a self-review of my own code
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
A changelog entry has been made in the appropriate crates

…t implement `Debug` trait

nazar-pc · 2023-02-17T00:37:30Z

I think this is ready to review, please take a look. Changes are relatively straightforward.

protocols/kad/src/handler.rs

protocols/kad/src/protocol.rs

protocols/kad/src/handler.rs

protocols/kad/CHANGELOG.md

protocols/kad/src/handler.rs

…t outgoing substreams

nazar-pc · 2023-02-17T10:56:27Z

Applied suggestions, diff is much smaller now.

While it is easy to read, it feels a bit wrong to swap something with poisoned value and replace back with itself in case it didn't match. Such pattern is used in other places, so I adopted it for consistency, but memory copies right and left during iteration just to simplify things might be a bit less efficient than checking for match first and only update value if necessary. On the other hand maybe compiler is smart enough to optimize it away since the code is quite obvious, I didn't benchmark to compare.

thomaseizinger

One more request, otherwise LGTM.

I'd also like @mxinden to approve this. The gist is: By reusing substreams here as the spec allows, we can make use of yamux's backpressure because a stream's buffer will eventually fill up if the remote is slower in reading AddProvider messages than we are with writing them.

Together with a solution for backpressuring how fast we read new messages from the stream, this should allow us to saturate the network or at least get us closer to it.

protocols/kad/src/handler.rs

thomaseizinger · 2023-02-17T11:28:08Z

Applied suggestions, diff is much smaller now.

While it is easy to read, it feels a bit wrong to swap something with poisoned value and replace back with itself in case it didn't match. Such pattern is used in other places, so I adopted it for consistency, but memory copies right and left during iteration just to simplify things might be a bit less efficient than checking for match first and only update value if necessary. On the other hand maybe compiler is smart enough to optimize it away since the code is quite obvious, I didn't benchmark to compare.

I think we could avoid the copies by first doing a find on the list and only swapping the one that matches!(s, OutboundSubstreamState::Idle). That also creates an unreachable path but that one would at least be fairly local.

Feel free to leave as is though.

thomaseizinger

Clippy fails, otherwise LGTM.

thomaseizinger · 2023-02-17T19:42:54Z

Waiting for @mxinden now.

mxinden

I am sorry for only reviewing this change now. Thanks for the proposal. @nazar-pc can you please expand on the benefit of this change?

The gist is: By reusing substreams here as the spec allows, we can make use of yamux's backpressure because a stream's buffer will eventually fill up if the remote is slower in reading AddProvider messages than we are with writing them.

The default initial Yamux stream receive window is 256 KiB. With this change there are never more than one Kademlia request in-flight at once on a single stream. I don't think a single Kademlia request ever exceeds the stream receive window size (256 KiB). Thus I don't think the Yamux backpressure mechanism will ever kick in.

Together with a solution for backpressuring how fast we read new messages from the stream, this should allow us to saturate the network or at least get us closer to it.

Looking at this from a Kademlia perspective only, I would be surprised if one could saturate a link with Kademlia requests only. Though that is only an intuition.

I would like to go away from reusing streams across all libp2p protocols. Muxers should provide backpressure on new-stream creation (e.g. QUIC does). A consumer should backpressure on the number of inflight requests, which would equal the number of open streams, through the muxer implementation. For per-protocol backpressure see libp2p/specs#520.

thomaseizinger · 2023-02-20T19:32:11Z

The gist is: By reusing substreams here as the spec allows, we can make use of yamux's backpressure because a stream's buffer will eventually fill up if the remote is slower in reading AddProvider messages than we are with writing them.

The default initial Yamux stream receive window is 256 KiB. With this change there are never more than one Kademlia request in-flight at once on a single stream.

Incorrect, the opposite is the intention. AddProvider does not have a reponse so the state machine is complete after sending the message. With this change, we will instantly reuse the stream and eventually fill up the buffer if the other party is slower than us.

I would like to go away from reusing streams across all libp2p protocols. Muxers should provide backpressure on new-stream creation (e.g. QUIC does). A consumer should backpressure on the number of inflight requests, which would equal the number of open streams, through the muxer implementation. For per-protocol backpressure see libp2p/specs#520.

I would like to move away from it too. But it is going to take a while before we can design and ship backpressure via status codes.

I would argue that it doesn't matter how we ship the feature. We have a spec-compliant solution here. Once we have backpressure via error codes or something else working, we should be able to transparently (for the user) change the implementation.

This gives us the best of both worlds: A working solution today and buys us time for designing something more general.

nazar-pc · 2023-02-20T19:50:01Z

I would like to go away from reusing streams across all libp2p protocols. Muxers should provide backpressure on new-stream creation (e.g. QUIC does). A consumer should backpressure on the number of inflight requests, which would equal the number of open streams, through the muxer implementation. For per-protocol backpressure see libp2p/specs#520.

I'm curious, what is the benefit of re-creating streams over and over again rather than creating once and using it for as long as necessary? Sounds less efficient and more tricky to manage.

mxinden · 2023-02-20T19:59:25Z

The gist is: By reusing substreams here as the spec allows, we can make use of yamux's backpressure because a stream's buffer will eventually fill up if the remote is slower in reading AddProvider messages than we are with writing them.

The default initial Yamux stream receive window is 256 KiB. With this change there are never more than one Kademlia request in-flight at once on a single stream.

Incorrect, the opposite is the intention. AddProvider does not have a reponse so the state machine is complete after sending the message. With this change, we will instantly reuse the stream and eventually fill up the buffer if the other party is slower than us.

Good point.

I would like to go away from reusing streams across all libp2p protocols. Muxers should provide backpressure on new-stream creation (e.g. QUIC does). A consumer should backpressure on the number of inflight requests, which would equal the number of open streams, through the muxer implementation. For per-protocol backpressure see libp2p/specs#520.

I would like to move away from it too. But it is going to take a while before we can design and ship backpressure via status codes.

I don't think we need status codes. libp2p-quic (i.e. quinn) gives us a way to limit the number of inbound streams, e.g. I think the default is 100. As long as we don't drop an inbound stream, the remote is eventually backpressured by that limit if we use a new stream per request.

I would like to go away from reusing streams across all libp2p protocols. Muxers should provide backpressure on new-stream creation (e.g. QUIC does). A consumer should backpressure on the number of inflight requests, which would equal the number of open streams, through the muxer implementation. For per-protocol backpressure see libp2p/specs#520.

I'm curious, what is the benefit of re-creating streams over and over again rather than creating once and using it for as long as necessary?

When reusing streams, how does a producer know how many concurrent requests it can send? In other words, how does it know how many streams it can open and keep open? I would prefer not to keep the magic number 32 that we have today.

When not reusing streams, the consumer provides a dynamic limit on the number of streams via the multiplexer stream-creation backpressure mechanism. I.e. by keeping streams alive on the inbound side, the outbound side is backpressured at creating new outbound streams, i.e. new outbound requests.

When reusing streams and sending requests without a response (here the AddProvider case), how do you prevent head-of-line blocking?

When not reusing streams, the underlying transport can do retransmits on per-stream basis, thus one request is not head-of-line blocked by another.

Sounds less efficient

I would treat streams as free, at least from a networking perspective. What makes you think they are expensive?

nazar-pc · 2023-02-20T20:09:12Z

I would prefer not to keep the magic number 32 that we have today.

Fair, I'm certainly onboard with this.

When reusing streams and sending requests without a response (here the AddProvider case), how do you prevent head-of-line blocking?

When not reusing streams, the underlying transport can do retransmits on per-stream basis, thus one request is not head-of-line blocked by another.

I'm not as familiar with implementation, but I assumed that the whole point of streams within the same TCP connection is that one stream doesn't block another. Otherwise what is the point?

I would treat streams as free, at least from a networking perspective. What makes you think they are expensive?

Just intuitively, if feels like some sort of negotiation is necessary, so when we're talking about tens or hundreds of concurrent requests on each peer for hours it adds up. Especially because messages are small, overhead will be percentage-wise larger. I might be completely wrong though, I'm not familiar with implementation details yet.

thomaseizinger · 2023-02-20T22:38:20Z

I would like to go away from reusing streams across all libp2p protocols. Muxers should provide backpressure on new-stream creation (e.g. QUIC does). A consumer should backpressure on the number of inflight requests, which would equal the number of open streams, through the muxer implementation. For per-protocol backpressure see libp2p/specs#520.

I would like to move away from it too. But it is going to take a while before we can design and ship backpressure via status codes.

I don't think we need status codes. libp2p-quic (i.e. quinn) gives us a way to limit the number of inbound streams, e.g. I think the default is 100. As long as we don't drop an inbound stream, the remote is eventually backpressured by that limit if we use a new stream per request.

But our quic implementation is in alpha status and thus not recommendes for production I guess?

In general, I don't think these efforts are incompatible with each other.

Until we have a back-pressure mechanism across all transports and muxers, we will need to keep the limit to mitigate DoS attacks. As long as we have the limit, we will drop new inbound streams upon bursts of requests where there isn't a response to requests because the client-side limit of 32 gets "out of sync" with the server side.

Why can't we provide backpressure through reuse in Kademlia and switch to backpressure through new streams at a later point? Both are spec compliant because streams CAN be reused but don't have to.

thomaseizinger · 2023-02-20T23:14:42Z

When reusing streams and sending requests without a response (here the AddProvider case), how do you prevent head-of-line blocking?

When not reusing streams, the underlying transport can do retransmits on per-stream basis, thus one request is not head-of-line blocked by another.

I'm not as familiar with implementation, but I assumed that the whole point of streams within the same TCP connection is that one stream doesn't block another. Otherwise what is the point?

Streams avoid HoL blocking between each other. Once you start to pipeline requests within a stream, you have HoL blocking again. In fact, the HoL blocking is desireable in this case because it creates a form of backpressure.

I would treat streams as free, at least from a networking perspective. What makes you think they are expensive?

Just intuitively, if feels like some sort of negotiation is necessary, so when we're talking about tens or hundreds of concurrent requests on each peer for hours it adds up. Especially because messages are small, overhead will be percentage-wise larger. I might be completely wrong though, I'm not familiar with implementation details yet.

No negotiation is necessary for new streams. QUIC operates on a locally-known budget. Yamux only sets a flag with a new payload frame to indicate a new stream. Thus, the only overhead is the framing of the muxer which applies to new and existing streams as well.

Together with upgrade::Version::V1Lazy, the multistream-select handshake for new streams is 0 RTT but has a few bytes overhead. That is a small price to pay for the benefits of dynamically scaling the use of resources.

mergify · 2023-03-02T10:48:03Z

This pull request has merge conflicts. Could you please resolve them @nazar-pc? 🙏

mergify · 2023-04-11T14:49:07Z

This pull request has merge conflicts. Could you please resolve them @nazar-pc? 🙏

mxinden · 2023-05-30T05:42:28Z

Closing here as I think we reached consensus that reusing a stream is not the way forward. Please comment in case you disagree.

nazar-pc force-pushed the kad-stream-reuse branch from e2e9d41 to a56df6c Compare February 16, 2023 23:40

nazar-pc added 3 commits February 17, 2023 02:28

Make KadInStreamSink and KadOutStreamSink concrete unit types tha…

5076a4f

…t implement `Debug` trait

Implement outgoing stream reuse in Kademlia

af38ba9

Update changelog

8a807e0

nazar-pc force-pushed the kad-stream-reuse branch from 23aade7 to 8a807e0 Compare February 17, 2023 00:36

nazar-pc marked this pull request as ready for review February 17, 2023 00:36

thomaseizinger approved these changes Feb 17, 2023

View reviewed changes

Apply review suggestions

a79c6a2

thomaseizinger changed the title ~~feat(kad): Outgoing stream reuse~~ feat(kad): reuse outbound streams instead of closing them Feb 17, 2023

Replace separate collection of reusable substreams with search amongs…

a8056e1

…t outgoing substreams

nazar-pc requested a review from thomaseizinger February 17, 2023 10:56

thomaseizinger reviewed Feb 17, 2023

View reviewed changes

protocols/kad/src/handler.rs Outdated Show resolved Hide resolved

protocols/kad/src/handler.rs Show resolved Hide resolved

Refactor stream reuse into a separate function

e41f681

thomaseizinger previously approved these changes Feb 17, 2023

View reviewed changes

Make clippy happy

74a0f57

mxinden requested changes Feb 20, 2023

View reviewed changes

nazar-pc mentioned this pull request Feb 23, 2023

kad: Kademlia::address_failed() periodically using a lot of CPU #3498

Closed

mergify bot dismissed thomaseizinger’s stale review via 74a0f57 April 24, 2023 08:55

mxinden closed this May 30, 2023

nazar-pc deleted the kad-stream-reuse branch June 12, 2023 10:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(kad): reuse outbound streams instead of closing them #3474

feat(kad): reuse outbound streams instead of closing them #3474

nazar-pc commented Feb 15, 2023 •

edited

Loading

nazar-pc commented Feb 17, 2023

nazar-pc commented Feb 17, 2023

thomaseizinger left a comment

thomaseizinger commented Feb 17, 2023

thomaseizinger left a comment

thomaseizinger commented Feb 17, 2023

mxinden left a comment •

edited

Loading

thomaseizinger commented Feb 20, 2023

nazar-pc commented Feb 20, 2023

mxinden commented Feb 20, 2023

nazar-pc commented Feb 20, 2023

thomaseizinger commented Feb 20, 2023

thomaseizinger commented Feb 20, 2023

mergify bot commented Mar 2, 2023

mergify bot commented Apr 11, 2023

mxinden commented May 30, 2023

feat(kad): reuse outbound streams instead of closing them #3474

feat(kad): reuse outbound streams instead of closing them #3474

Conversation

nazar-pc commented Feb 15, 2023 • edited Loading

Description

Notes

Links to any relevant issues

Open Questions

Change checklist

nazar-pc commented Feb 17, 2023

nazar-pc commented Feb 17, 2023

thomaseizinger left a comment

Choose a reason for hiding this comment

thomaseizinger commented Feb 17, 2023

thomaseizinger left a comment

Choose a reason for hiding this comment

thomaseizinger commented Feb 17, 2023

mxinden left a comment • edited Loading

Choose a reason for hiding this comment

thomaseizinger commented Feb 20, 2023

nazar-pc commented Feb 20, 2023

mxinden commented Feb 20, 2023

nazar-pc commented Feb 20, 2023

thomaseizinger commented Feb 20, 2023

thomaseizinger commented Feb 20, 2023

mergify bot commented Mar 2, 2023

mergify bot commented Apr 11, 2023

mxinden commented May 30, 2023

nazar-pc commented Feb 15, 2023 •

edited

Loading

mxinden left a comment •

edited

Loading