what happens during network partition? #146

pdeva · 2017-06-16T17:51:03Z

the vldb paper just says bloomberg prevents network partitions from happening in its own network.

but in a cloud environment, partitions will indeed occur.

in that case will the client throw an error?

if so, then there has to be error handling logic for all calls to comdb2. how does it then simplify coding vs something like postgres? from Alex's talk I was under the assumption that the whole point of comdb2 was to not require tons of error handling.

mponomar · 2017-06-16T18:14:31Z

There's no magic in the world, of course. Partitions occur. But you can guard against them. There's code in Comdb2 to use multiple networks, for example. If a node gets disconnected from a node/set of nodes over one network, it can switch to another.
You can guard against losing half of your nodes (and having a true split brain) by having > 2 sites. As long as some majority of the nodes can still interconnect, they will form a cluster. API code will retry connecting to nodes until it hits a node that's part of a new majority, and resume operating. This happens without application code involvement - no retries necessary. Failures like this usually occur between sites (citation needed, I know).
At a slightly higher cost, you can enable HA mode. In this mode, the API will record all operations in your transaction as well as the database state (log sequence number is sufficient) and replay them against a different node in case of failure. This protects you against having to retry if the node you're connected to fails in the middle of a query or a transaction. We have a lovely demo of a running query/transaction where we kill the database, and the transaction/query resumes when it comes back.

There's still no magic, unfortunately. If you have 2 sites, and the only link between them breaks, there isn't much of a choice. Applications that can still reach half of the cluster that still has the old master will continue to work, as long as exactly half the nodes are connected (this reduces the problem of needing a majority to operate to having a majority to elect). Applications that can't reach that half, are unavailable. If the number of connected systems falls below half, the entire cluster becomes unavailable. This is necessary to maintain consistency. It's an unfortunate choice application/database developers are forced to make.

So in summary: Comdb2 has ways of masking errors that occur with some of the nodes of a cluster. If you maintain multiple redundant network, and have enough sites that a failure of a site doesn't reduce the capacity of a cluster below half, the cluster becomes unavailable.

We can certainly document this in more detail. If you have questions about specific items/points, we'll be sure to include those in the docs.

pdeva · 2017-06-16T18:18:50Z

you used the term 'site' in your answer. what does it mean?
is a 'site' == 'a database node'?

akshatsikarwar · 2017-06-16T18:20:39Z

More like a datacenter. Consider figure 1 from https://bloomberg.github.io/comdb2/transaction_model.html
3 sites of 3 nodes each.

mponomar · 2017-06-16T18:22:41Z

If you're really interested in testing network partitions, there's decent test included here

pdeva · 2017-06-16T18:27:04Z

is data synchronously replicated among all 'sites'?
wouldnt that be extremely high latency since each 'site' can be in a different geographical region, eg one in 'us east' and one in 'japan'?

akshatsikarwar · 2017-06-16T18:31:06Z

Yes and yes.
You can set up async replication (sync to same dc, async to others) but that comes with usual trade-offs.

mponomar · 2017-06-16T18:32:59Z

Or you can make it fully async (with the usual trade-offs, once again).

pdeva · 2017-06-16T19:03:07Z

API code will retry connecting to nodes

does 'API Code' mean the client jdbc driver here?

akshatsikarwar · 2017-06-16T19:03:47Z

Yes

mponomar · 2017-06-16T19:04:58Z

There's a C and Java (jdbc) implementations of the protocol included. Both of them will retry. If there's other implementations, they may not.

mponomar closed this as completed Jun 16, 2017

akshatsikarwar added the question label Jun 16, 2017

This was referenced Jun 17, 2017

High Availability Docs #49

Open

questions on Linearizable isolation level #157

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

what happens during network partition? #146

what happens during network partition? #146

pdeva commented Jun 16, 2017 •

edited

Loading

mponomar commented Jun 16, 2017 •

edited by akshatsikarwar

Loading

pdeva commented Jun 16, 2017

akshatsikarwar commented Jun 16, 2017 •

edited

Loading

mponomar commented Jun 16, 2017

pdeva commented Jun 16, 2017

akshatsikarwar commented Jun 16, 2017

mponomar commented Jun 16, 2017

pdeva commented Jun 16, 2017

akshatsikarwar commented Jun 16, 2017

mponomar commented Jun 16, 2017

what happens during network partition? #146

what happens during network partition? #146

Comments

pdeva commented Jun 16, 2017 • edited Loading

mponomar commented Jun 16, 2017 • edited by akshatsikarwar Loading

pdeva commented Jun 16, 2017

akshatsikarwar commented Jun 16, 2017 • edited Loading

mponomar commented Jun 16, 2017

pdeva commented Jun 16, 2017

akshatsikarwar commented Jun 16, 2017

mponomar commented Jun 16, 2017

pdeva commented Jun 16, 2017

akshatsikarwar commented Jun 16, 2017

mponomar commented Jun 16, 2017

pdeva commented Jun 16, 2017 •

edited

Loading

mponomar commented Jun 16, 2017 •

edited by akshatsikarwar

Loading

akshatsikarwar commented Jun 16, 2017 •

edited

Loading