Support running the relay chain node as an extra process #545

bkchr · 2021-07-21T14:53:10Z

The Vision

Currently every parachain node includes a relay chain node. This relay chain node is required to get certain information about the relay chain that are important for the parachain, for example to query what is the best parachain block currently etc. While this makes it relative easy to run a parachain node, it also brings some problems like the high compilation time as we need to compile the parachain node and the relay chain node. Another, a more bigger problem is that parachain developers are required to update their collators when there is a new relay chain release that requires a timely update, because a new host function is added or something in the client code of the relay chain is fixed. So, it would be nice to have the relay chain node running as an extra process. Collator operators would just run an extra relay chain node (that could maybe even shared between multiple parachain nodes, but this is no real initial requirement!) and can freely update the relay chain node. The relay chain node itself could maybe directly bring the functionality required by Cumulus to connect to it or we provide some sort of wrapper (probably the best way for the first implementation).

The Plan

This feature would be implemented in the following order:

Refactor all usages of the polkadot client to have them behind some common trait or maybe multiple traits. So, we should not have any reference to polkadot-service or polkadot-client in any of the "low-level" functionality of Cumulus. It should only use these interfaces to talk to the relay chain.
Write an implementation of these traits for the "in-node relay chain" so that we are back to on par with the current implementation.
Research what is the best way to implement the inter-process communication. Maybe some sort of json rpc over https://crates.io/crates/parity-tokio-ipc or whatever.
Implement the wrapper and make it work. Running the relay chain as an external process should always be some sort of optional way of doing it. So, if the feature is compiled it should be enabled via some cli flag or something.
...
Profit :P

Open Questions

If you want to help us out and contribute to this issue, in this section you can find open questions and tasks where we would appreciate any input.

Here you can find the board with specific sub-tasks to this milestone:
https://github.com/orgs/paritytech/projects/18/views/8

The text was updated successfully, but these errors were encountered:

xlc · 2021-07-21T21:59:21Z

This should also make it possible to share a relaychain node between multiple parachain nodes?

Will this make it possible to allow relaychain part and parachain part using different versions of Substrate?

nuke-web3 · 2021-07-22T01:21:39Z

Would you be able to have remote relay-nodes that collators could connect to? If so, what requirements (latency, bandwidth, etc.) should be outlined?

bkchr · 2021-07-22T09:28:50Z

This should also make it possible to share a relaychain node between multiple parachain nodes?

Yes, as written above. However, I don't see this as an initial requirement for this issue.

Will this make it possible to allow relaychain part and parachain part using different versions of Substrate?

This is the whole point or better, make parachains collators not required to update when the relay chain requires a node update.

bkchr · 2021-07-22T09:30:49Z

Would you be able to have remote relay-nodes that collators could connect to? If so, what requirements (latency, bandwidth, etc.) should be outlined?

As written in the issue, maybe. As this is not an initial requirement, I don't think we need to outline anything of that.

bkchr · 2021-07-22T09:34:35Z

I also just realized that this will probably be a little bit more complicated for collators, as the connection collators not only read data from the relay chain. They also get called by the overseer when they need to produce a new pov and give it back to the overseer. This would be a little bit more time critical, but should also be solvable.

skunert · 2022-03-02T10:43:21Z

After #963 was merged, it is now possible to start a parachain full node by passing the address to a relay chain full node. The parachain node will not internally create a relay-chain node but fetch all needed data via RPC. Be aware that we are viewing this as an experimental feature currently.
Note: Collation is not supported at this time

Example command (assumes relay chain full node running locally on ws-port 9944):
polkadot-collator --tmp --relay-chain-rpc-url "ws://localhost:9944"

crystalin · 2022-03-02T15:33:00Z

I know this is not directly the target of this change, but in the case of the 1 relay node => many parachain node scenario, it would be better if the parachain nodes can specify multiple --relay-chain-rpc-url for redundancy (like having 20 parachain nodes pointing to the same 2 relay nodes).
This would allow, if one of the relay goes down to still have the parachain node getting synced

purestaketdb · 2022-04-07T18:04:49Z

Similar to @crystalin 's feedback - the two main use cases we are interested in are:

Not requiring updates to the collators when the relay chain version changes. This is not initially supported per the comments of July 2021.
Allowing RPC servers to avoid running their own relay chains. However, for resilience, it would be preferable if either a set of relays could be designated OR if it would be supported/safe to run a set of relay RPCs behind a load balancer.

skunert · 2022-04-11T11:20:07Z

Thanks @crystalin and @purestaketdb for the feedback. Having multiple relay chain nodes to connect to is something I have thought about. I can look into it once the collation over RPC feature is ready.
The main challenge will be to sort out subscription handling for this. Currently, we are listening to RPC subscriptions that notify us about new blocks on the relay chain. To switch relay-chain nodes on the fly, we need to gracefully continue a new subscription where the old one left off.

skunert · 2022-10-10T08:52:45Z

Experimental support for relay chain collators is now merged. You can try this by passing the --relay-chain-rpc-url argument together with --collator. Network related relay chain args are still respected, so you can pass an extra set of bootnodes or other configs.

Next phase is additional testing to discover potential problems.

Road forward:

Investigate having a set of relay chain nodes as suggested
Improve error handling, network stability is currently assumed and we are not resilient against failing requests (we recommend to run the relay chain full node locally)
Look into improved logging and debuggability

skunert · 2023-01-17T17:42:03Z

The points of the last comment have been addressed in #1880. Future enhancements can have their separate issues.

bkchr added F8-enhancement 🎁 labels Jul 21, 2021

bkchr assigned skunert Oct 1, 2021

nuke-web3 mentioned this issue Oct 21, 2021

[Content] Update docs on Arcive vs. full node polkadot-developers/substrate-docs#406

Closed

bkchr mentioned this issue Oct 28, 2021

Feature gate client side of parachain inherent #708

Closed

skunert mentioned this issue Dec 6, 2021

Introduce interface for relay chain interaction #835

Merged

bkchr mentioned this issue Jan 28, 2022

Support running full nodes with overseer enabled paritytech/polkadot#4763

Closed

skunert mentioned this issue Feb 7, 2022

Introduce rpc client for relay chain full node #963

Merged

Garandor mentioned this issue Feb 8, 2022

Expose more ports in dockerfile Manta-Network/Manta#373

Merged

12 tasks

skunert mentioned this issue Feb 15, 2022

Enable collation with external relay chain full node #989

Closed

liuchengxu mentioned this issue Apr 17, 2022

Merge subspace-executor executable into subspace-node autonomys/subspace#366

Merged

the-right-joyce added J0-enhancement An additional feature request. and removed F8-enhancement 🎁 labels Aug 12, 2022

skunert closed this as completed Jan 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support running the relay chain node as an extra process #545

Support running the relay chain node as an extra process #545

bkchr commented Jul 21, 2021 •

edited by andresilva

Loading

xlc commented Jul 21, 2021

nuke-web3 commented Jul 22, 2021

bkchr commented Jul 22, 2021

bkchr commented Jul 22, 2021

bkchr commented Jul 22, 2021

skunert commented Mar 2, 2022

crystalin commented Mar 2, 2022

purestaketdb commented Apr 7, 2022

skunert commented Apr 11, 2022

skunert commented Oct 10, 2022 •

edited

Loading

skunert commented Jan 17, 2023

Support running the relay chain node as an extra process #545

Support running the relay chain node as an extra process #545

Comments

bkchr commented Jul 21, 2021 • edited by andresilva Loading

The Vision

The Plan

Open Questions

xlc commented Jul 21, 2021

nuke-web3 commented Jul 22, 2021

bkchr commented Jul 22, 2021

bkchr commented Jul 22, 2021

bkchr commented Jul 22, 2021

skunert commented Mar 2, 2022

crystalin commented Mar 2, 2022

purestaketdb commented Apr 7, 2022

skunert commented Apr 11, 2022

skunert commented Oct 10, 2022 • edited Loading

skunert commented Jan 17, 2023

bkchr commented Jul 21, 2021 •

edited by andresilva

Loading

skunert commented Oct 10, 2022 •

edited

Loading