On 30 Dec 2022, at 15:14, Nathan Angelacos
<nangel(a)tetrasec.net> wrote:
On Fri, 2022-12-30 at 14:26 +0100, Olle E. Johansson wrote:
Dear Kamailians!
I’m trying to figure out DMQ and see how it reacts to server outages
and downtime - the base protocol.
<snip>
Same here. The best I could figure is "it works when it works, until
it doesn't" In our tests, the "edge cases" got weird.
Anyone who wants to explain how DMQ works in failure scenarios - there
are two of us who are interested!
There’s a core protocol that updates each node about node status - active, pending,
disabled, timeout - according to the source code. I haven’t seen timeout in my tests.
On top of that there’s other implementations using this - htable and usrloc are two
examples - and each app have implemented their own protocol on top of the DMQ bus.
I’m trying to focus on the core protocol at this point.
/O