Add an UDP listening endpoint to prove reachability #563

tomaka · 2020-03-02T12:18:59Z

With the intent to solve #564

The idea is to add a small UDP listening endpoint that answers requests. Then we can tackle #564 by sending out these requests.

cc'ing @romanb @arkpar @kirushik

tomaka · 2020-03-02T12:23:16Z

I can see two possible ways to design the protocol:

We can copy the ICMP PONG message, or the Kademlia PING message, and just make the sender emit an arbitrary payload which the recipient answer echoes back.
Or we can provide more security by proving that the node owns its key.

If we go the second option, one quick brainstorming I went with a packet containing:

A flag indicating whether this is a request or a response
A timestamp of the emission
Some implementation-specific bytes
The node's network public key.
A signature of hash(concat(flag, timestamp, random_bytes)) using that key.

Recipient sends back an answer containing the bytes that the sender sent.

romanb · 2020-03-02T12:42:34Z

Something that I don't understand is the following: The addresses reported by a peer are multiaddresses. A multiaddress specifies the transport protocol to use. To check whether the address is reachable, we surely must use the transport specified in the address? I.e. in the context of #564 I don't even see how there can be such a choice of transport protocol?

tomaka · 2020-03-02T12:56:26Z

I.e. in the context of #564 I don't even see how there can be such a choice of transport protocol?

To me, that's why we're going to put this code in Substrate and not libp2p.
Substrate enforces that the transport is TCP/IP (or WebSockets).
Considering that Substrate doesn't support any other protocol, if we receive a multiaddress that isn't /ip/.../tcp/...(/ws), we can instantly consider it unreachable without going through any UDP pinging.

On the long term, though, we can probably remove this feature once we switched to QUIC?

kirushik · 2020-03-02T18:15:53Z

(Here I assume that dialing UDP on the same port as your TCP listeners is a good design and a feature we want to keep; it's far from obvious, but for now let's ignore that)

There are two major threat categories here:
a) malicious peer M would falsely identify to our node N, forcing N to send undesired UDP pings to a victim node V;
b) malicious peer M would ping N with a return address of their choice, so N would send undesired UDP pongs to V.

a can't be fully mitigated, since sending UDP packets to sometimes-unreachable endpoints is the explicit goal of the design; we should still take care of the following:

we should set a reasonable limit of the total amount of addresses the peer can send us through Identify;
we shouldn't retry pings;
we shouldn't ping the same address too frequently even if it was advertised from multiple (potentially malicious) peers;
we should rate-limit pinging, ideally by both "maximal pings sent per second" and "maximal allowed pings in-flight" metrics (we must clean up the in-flight pings tracker after a while, though — otherwise M can "ping-starve" us, occupying all the inflight tracking slots with unreachable addresses);
we shouldn't ping addresses in private subnetworks (doesn't make much sense if we're talking about globally-visible DHT records anyway);

The same list also improves our defenses against b when applied to PONGs (since those are general measures protecting an ability to send UDP from abuse), but we would need some additional points:

PONG packet should be shorter than PING, otherwise UDP amplification attacks would be possible;
we need to have some protection against PING packets replay — probably by adding an expiration field, and ignoring pings with both expiration time in the past and too far (say, >10-15 seconds) in the future — careful, this would make the identification+pring protocol sensitive to accurate clock sync.

burdges · 2020-03-02T19:03:02Z

If I understand, this proves nothing about the identity of the node reached, except perhaps that substrate runs there, right?

arkpar · 2020-03-02T19:29:10Z

@burdges pongs should be signed by the key that matches expected peer id.

devp2p design has a few strong points on rate limiting and replay protection already.
For reference:
Current version that includes timestamp in the ping message:
https://github.com/ethereum/devp2p/blob/master/discv4.md
Proposed version:
https://github.com/ethereum/devp2p/blob/master/discv5/discv5.md
I'd recommend whoever's going to work on this to study devp2p discovery first.

twittner · 2020-04-20T14:24:08Z

Given that we support multiple listen addresses we would need to bind one UDP socket to each address in order to answer PINGs to those addresses if they are reported via identify.
A PONG over UDP would also not prove that the service is reachable over TCP, so at best we get some indication that the address might work. In other words we just want better heuristics to decide which addresses to consider and which ones to ignore and to achieve that we could filter addresses based on the address ranges they belong to. This could be done in libp2p-identify which would produce more relevant results.

Currently every address reported via libp2p-identify is inserted into the DHT which thus contains a multitude of unreachable addresses such as from 127.0.0.0/8 or 10.0.0.0/8. Issue #5099 suggested a dedicated service over UDP to gauge the reachability of an address, which would however incur extra I/O costs and be of limited use. As an alternative and simpler tactic, this PR only allows global IP addresses to be inserted into the DHT unless an explicit command-line flag `--allow-non-global-addresses-in-dht` is given or a node is started with `--dev`. This opt-in behaviour is meant to allow site-local networks to still make use of a DHT.

* network: Only insert global addresses into the DHT. Currently every address reported via libp2p-identify is inserted into the DHT which thus contains a multitude of unreachable addresses such as from 127.0.0.0/8 or 10.0.0.0/8. Issue #5099 suggested a dedicated service over UDP to gauge the reachability of an address, which would however incur extra I/O costs and be of limited use. As an alternative and simpler tactic, this PR only allows global IP addresses to be inserted into the DHT unless an explicit command-line flag `--allow-non-global-addresses-in-dht` is given or a node is started with `--dev`. This opt-in behaviour is meant to allow site-local networks to still make use of a DHT. * Enable non-global in more test setups. * Replace command-line option with different name. * Another test fix.

tomaka · 2020-04-29T12:34:27Z

To expand on the previous comment, the problem we have is: if a node connects to us through TCP, how do you know which UDP port to ping?

Do we hard-code a specific port into the client? This means that you could for example no longer start two nodes on the same machine, and it is generally frown upon to not make ports configurable.

But if we make the port configurable, then we need to somehow communicate to remotes which UDP port to try. This means designing a new protocol just for that.

burdges · 2020-04-29T12:45:49Z

There are claims that merely opening a QUIC connection might prove heavier than you'd like, but computationally the crypto for opening a QUIC connection should only cost twice the unencrypted signed message proposed here. We could gain other stuff from doing that if either idling QUIC connections proves cheap or if we used them to initiate 0-RTT.

* rebased main and fixed tests * Added doc comments * changed error handling to log on failure * fixed new ethereum tests

* Use versioned `extrinsic_filter` on pending transactions rpc * Use versioned `current_block` in eth_call when no gas limit * fmt

tomaka added the J0-enhancement label Mar 2, 2020

tomaka assigned twittner Apr 17, 2020

This was referenced Apr 20, 2020

Discv5 interest? libp2p/rust-libp2p#1551

Closed

Verify that addresses reported by identify are correct before inserting them in the DHT #564

Open

twittner mentioned this issue Apr 22, 2020

network: Only insert global addresses into the DHT. paritytech/substrate#5735

Merged

bkchr unassigned twittner Apr 26, 2022

altonen added the U4-some_day_maybe label Dec 14, 2022

altonen transferred this issue from paritytech/substrate Aug 24, 2023

the-right-joyce added I5-enhancement An additional feature request. and removed J0-enhancement labels Aug 25, 2023

claravanstaden pushed a commit to Snowfork/polkadot-sdk that referenced this issue Dec 8, 2023

Added e2e tests for One-click Parachain (paritytech#563)

11c8302

* rebased main and fixed tests * Added doc comments * changed error handling to log on failure * fixed new ethereum tests

helin6 pushed a commit to boolnetwork/polkadot-sdk that referenced this issue Feb 5, 2024

Versioned runtimes fixes (paritytech#563)

7e6992f

* Use versioned `extrinsic_filter` on pending transactions rpc * Use versioned `current_block` in eth_call when no gas limit * fmt

serban300 pushed a commit to serban300/polkadot-sdk that referenced this issue Mar 26, 2024

Bump serde_json from 1.0.59 to 1.0.60 (paritytech#563)

500e06e

serban300 pushed a commit to serban300/polkadot-sdk that referenced this issue Mar 27, 2024

Bump serde_json from 1.0.59 to 1.0.60 (paritytech#563)

364e99b

serban300 pushed a commit to serban300/polkadot-sdk that referenced this issue Apr 8, 2024

Bump serde_json from 1.0.59 to 1.0.60 (paritytech#563)

4d3573d

serban300 pushed a commit to serban300/polkadot-sdk that referenced this issue Apr 8, 2024

Bump serde_json from 1.0.59 to 1.0.60 (paritytech#563)

0b789ea

serban300 pushed a commit to serban300/polkadot-sdk that referenced this issue Apr 8, 2024

Bump serde_json from 1.0.59 to 1.0.60 (paritytech#563)

1e2457f

serban300 pushed a commit to serban300/polkadot-sdk that referenced this issue Apr 8, 2024

Bump serde_json from 1.0.59 to 1.0.60 (paritytech#563)

676adf6

serban300 pushed a commit to serban300/polkadot-sdk that referenced this issue Apr 8, 2024

Bump serde_json from 1.0.59 to 1.0.60 (paritytech#563)

25f38e0

serban300 pushed a commit to serban300/polkadot-sdk that referenced this issue Apr 9, 2024

Bump serde_json from 1.0.59 to 1.0.60 (paritytech#563)

9c2d732

serban300 pushed a commit to serban300/polkadot-sdk that referenced this issue Apr 9, 2024

Bump serde_json from 1.0.59 to 1.0.60 (paritytech#563)

8f1903b

serban300 pushed a commit to serban300/polkadot-sdk that referenced this issue Apr 9, 2024

Bump serde_json from 1.0.59 to 1.0.60 (paritytech#563)

71c5ba5

serban300 pushed a commit to serban300/polkadot-sdk that referenced this issue Apr 9, 2024

Bump serde_json from 1.0.59 to 1.0.60 (paritytech#563)

442cc36

serban300 pushed a commit to serban300/polkadot-sdk that referenced this issue Apr 9, 2024

Bump serde_json from 1.0.59 to 1.0.60 (paritytech#563)

825ab1f

serban300 pushed a commit to serban300/polkadot-sdk that referenced this issue Apr 9, 2024

Bump serde_json from 1.0.59 to 1.0.60 (paritytech#563)

0d5c387

serban300 pushed a commit to serban300/polkadot-sdk that referenced this issue Apr 10, 2024

Bump serde_json from 1.0.59 to 1.0.60 (paritytech#563)

f14675c

serban300 pushed a commit to serban300/polkadot-sdk that referenced this issue Apr 10, 2024

Bump serde_json from 1.0.59 to 1.0.60 (paritytech#563)

be02175

bkchr pushed a commit that referenced this issue Apr 10, 2024

Bump serde_json from 1.0.59 to 1.0.60 (#563)

091a9ba

github-actions bot mentioned this issue Jun 5, 2024

Update polkadot-sdk from v1.10.0 to v1.11.0 moondance-labs/tanssi#577

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an UDP listening endpoint to prove reachability #563

Add an UDP listening endpoint to prove reachability #563

tomaka commented Mar 2, 2020

tomaka commented Mar 2, 2020 •

edited

Loading

romanb commented Mar 2, 2020 •

edited

Loading

tomaka commented Mar 2, 2020

kirushik commented Mar 2, 2020

burdges commented Mar 2, 2020

arkpar commented Mar 2, 2020

twittner commented Apr 20, 2020

tomaka commented Apr 29, 2020

burdges commented Apr 29, 2020

Add an UDP listening endpoint to prove reachability #563

Add an UDP listening endpoint to prove reachability #563

Comments

tomaka commented Mar 2, 2020

tomaka commented Mar 2, 2020 • edited Loading

romanb commented Mar 2, 2020 • edited Loading

tomaka commented Mar 2, 2020

kirushik commented Mar 2, 2020

burdges commented Mar 2, 2020

arkpar commented Mar 2, 2020

twittner commented Apr 20, 2020

tomaka commented Apr 29, 2020

burdges commented Apr 29, 2020

tomaka commented Mar 2, 2020 •

edited

Loading

romanb commented Mar 2, 2020 •

edited

Loading