race condition between on_node_down and queue.declare #9804

unipolar · 2023-10-26T18:02:25Z

unipolar
Oct 26, 2023

Describe the bug

In three node rabbitmq cluster with nodes A, B, and C, both B and C executes on_node_down(A) after node A goes down. on_node_down() removes non-HA classic queues that were declared on node A. While such queue (Q) is not deleted, any request to it is rejected with 'suspended by supervisor' message, but after one node complete deletion of Q, a client may declare it again. Such behavior introduce a race condition between on_node_down(A) execution on multiple nodes and Q re-declaration that results in queue silently deleted from mnesia while a client consumes it.

As a side effect, after a channel to such silently deleted queue is closed and the queue is re-declared, the queue is deleted after x-expires timeout again, because x-expires timer is started after channel is closed.

Reproduction steps

create 3 instance rabbitmq cluster
declare 100 non-HA classic queue one one rabbitmq instance
stop that one rabbitmq instance
immediately try to re-declare queues on other rabbitmq, repeat for each queue until success, wait a bit and check queues list.

Expected behavior

queues not silently disappearing after declaration

Additional context

We have stable reproduction of this issue with RabbitMQ 3.8.16 on Erlang/OTP 23 and OpenStack/oslo-messaging as a client.
The code was changed since that version, but I don't see much difference in queue-to-delete filtering and delete_queue in mnesia transaction, so it may be still reproducible on latest versions.

Answered by michaelklishin

Oct 26, 2023

RabbitMQ 3.8 has reached EOL well over one year ago.

This is a known behavior and there is no short term solution for non-mirrored queue types.

RabbitMQ cannot know if a queue declaration is coming in the near future or not. Even if it were to delay a non-mirrored/transient queue cleanup, that would still hold after this initial delay. This is a race condition between two things that cannot be synchronized because they are initiated by two (or more) different applications on different hosts.

So there are a few options:

Treat non-replicated queues types for what they are: non-replicated types that offer few guarantees.
Use quorum queues that would not be automatically deleted like that
On…

View full answer

michaelklishin · 2023-10-26T23:42:58Z

michaelklishin
Oct 26, 2023
Maintainer

RabbitMQ 3.8 has reached EOL well over one year ago.

This is a known behavior and there is no short term solution for non-mirrored queue types.

RabbitMQ cannot know if a queue declaration is coming in the near future or not. Even if it were to delay a non-mirrored/transient queue cleanup, that would still hold after this initial delay. This is a race condition between two things that cannot be synchronized because they are initiated by two (or more) different applications on different hosts.

So there are a few options:

Treat non-replicated queues types for what they are: non-replicated types that offer few guarantees.
Use quorum queues that would not be automatically deleted like that
Once Khepri is widely adopted (and thus all queues become persistent, transient entities simply won't actually be transient) and classic mirrored queues are gone, the on_node_down part that deletes transient queues can potentially be removed entirely

The same problem exists for connections with a very large number of exclusive queues
when the client loses connectivity and then instantly reconnects and tries to re-declare them.
There are two different solutions that together help in practice:

Clients now re-declare exclusive queues under different names so there is no shared resource
OR, applications should avoid using exclusive queues or at least a large number of exclusive queues

Point being is that for as long as transient queues exist and can be deleted in response to a client-initiated even of any sort, this situation cannot be avoided.

0 replies

michaelklishin · 2023-10-26T23:47:05Z

michaelklishin
Oct 26, 2023
Maintainer

If you use queue expiration, you must be OK with queues being deleted at some point. Channels can run into an exception and be closed, or all consumers on them can be cancelled (an online consumer is what keeps TTL from having effect).

If you cannot accept queue deletion by the TTL mechanism in such cases, do not use queue TTL. You very explicitly tell RabbitMQ "if this is unused at some point, delete it". The definition of "used" is simple and specific: it has online consumers. You probably can overprovision consumers and use Single Active Consumer to avoid parallel processing but that sounds like the wrong thing to do. If you cannot afford queue transitivity, use quorum queues with three replicas.

0 replies

michaelklishin · 2023-10-26T23:55:17Z

michaelklishin
Oct 26, 2023
Maintainer

One more option would be to develop a non-replicated queue type that would always migrate between nodes, even if it means losing data when its hosting node goes down. Always choosing availability over consistency, so deleting them when their home node is down won't be necessary by design.

It'd be a reasonable feature to add but we won't start working on it until 4.0 ships in 2024. Right now a lot of other things are much more important, and shipping them would benefit a lot more deployments.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

race condition between on_node_down and queue.declare #9804

{{title}}

Replies: 3 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

race condition between on_node_down and queue.declare #9804

unipolar Oct 26, 2023

Describe the bug

Reproduction steps

Expected behavior

Additional context

Replies: 3 comments

michaelklishin Oct 26, 2023 Maintainer

michaelklishin Oct 26, 2023 Maintainer

michaelklishin Oct 26, 2023 Maintainer

unipolar
Oct 26, 2023

michaelklishin
Oct 26, 2023
Maintainer

michaelklishin
Oct 26, 2023
Maintainer

michaelklishin
Oct 26, 2023
Maintainer