Autocommit action=CREATE on file=plop.md detected

Autocommit action=MODIFY on file=Follow the RabbitMQ - Plan.md detected
Autocommit action=MODIFY on file=README.md detected
2024-05-07 06:52:39 +00:00 · 2024-05-06 12:10:11 +00:00 · 2024-05-05 20:48:28 +00:00 · 2024-05-05 20:45:15 +00:00 · 2024-05-05 20:39:51 +00:00 · 2024-05-05 20:38:39 +00:00
6 changed files with 213 additions and 0 deletions
--- a/.gitignore
+++ b/.gitignore
@ -0,0 +1,2 @@
+.flatnotes
+\[flatnotes\]*Changelog.md
--- a/Plan.md
+++ b/Plan.md
@ -0,0 +1,69 @@
+> Overview of the infra size we operate
+- Intro
+
+> What kind of issues we faced with rabbit
+> Is it a RabbitMQ setup issue or an Openstack issue ?
+* Issues with rabbit ?
+    * flap when rolling out agent / deploying new agent version
+        * even crash on big regions
+    * network flap / rabbit partition
+        * pause-minority helped crash the cluster
+    * reset cluster was ... the solution
+
+> Which methods did we use to troubleshoot those issues
+> Observability, tools
+* What's going on with rabbit ?
+    * What we deployed to help troubleshooting issues
+        * reproduce workload with rabbit perftest
+        * oslo.metrics
+        * rabbitmq exporter / grafana dashboards
+        * smokeping between nodes
+        * rabbitspy
+    * What we learned ?
+        * rabbitmq does not like at all large queue/connection churn
+        * identified issues were mostly related to neutron
+            * rabbit ddos
+                * too many queue declare
+                * too many tcp connection churn
+                * fanout mechanism 1 message published, duplicated to N queues
+        * Nova rpc usage is clearly != neutron
+
+> Before going further, let's take some time to understand how oslo.messaging work
+> How RPC is implemented in Openstack
+> [[ oslo.messaging - How it works with rabbit]]
+* Under the hood ?
+    * pub/sub mechanism
+        * subscriber: RPC server topic=name
+            * setup class endpoints
+            * create queues / setup consumer thread
+        * publish with rpc provided methods
+            * call - reply (topic / transient for reply)
+            * cast (topic queue)
+            * cast / fanout=true (fanout queue)
+            * talk about the transient stuff
+    * notifications for external use: kafka
+
+> What we did to put rabbits back to their holes
+* Journey to get a stable infra.
+    * Infra improvment
+        * split rabbit-neutron / rabbit-\*
+        * scale problematic clusters to 5 node
+        * Upgrade to 3.10+
+            * quorum queue recommended
+        * put back partition strategy to pause-minority
+    * oslo messaging improvments
+        * queue fixed naming to avoid queue churn
+        * heartbeat in pthread fix
+        * move from HA queue > Quorum queues
+            * fix to autodelete broken quorum queues
+        * replace 'fanout' queues by stream queues
+            * reduce queue nb a lot
+            * patch to avoid tcp reconnection when a queue is deleted (kombu/oslo)
+        * reduce queues declared by a RPC server (3 queues by default to only 1)
+        * use same connection for mutiple topics
+
+> ...
+- Conclusion
+    - when rabbitmq is used for what it is designed for, it works better
+    - going further ?
+        - let's write an oslo.messaging driver for another backend ?
--- a/README.md
+++ b/README.md
@ -0,0 +1 @@
+[git](https://git.cosmao.info/ju/openinfraday/src/branch/master/Follow%20the%20RabbitMQ%20-%20Plan.md)
--- a/improvments.md
+++ b/improvments.md
@ -0,0 +1,78 @@
+RabbitMQ is a key component in OpenStack deployment.
+Both nova and neutron heavily rely on it for intra communication (between agents running on computes and API running on control plane).
+RabbitMQ clustering is a must have to let operators manage the lifecycle of rabbitMQ. This is also true when rabbitmq is running in a kubernetes environment.
+OpenStack components consume rabbitMQ through oslo.messaging.
+
+Some recent improvment have been done on oslo.messaging to allow a better scaling and management of rabbitmq queues.
+
+**Here is a list of what we did on OVH side to achieve better stability at large scale.**
+
+* Better eventlet / green thread management
+    AMQP protocol rely on "heartbeats" to keep idle connection open.
+    Two patches were done in oslo.messaging to send hearbeats correctly:
+    the first patch was about sending heartbeats more often to respect the protocol definition.
+    the second patch was about using native threads instead of green thread to send hearbeats.
+    Green threads could be paused by eventlet under some circumstances, leading to connection beeing dropped by rabbitmq because of missed heartbeats.
+    While dropping and creating a new connection is not a big deal on small deployment, it leads to some messages loss and a lot of TCP churn at large scale.
+
+***Both patches are merged upstream and available by default.***
+
+* Replace classic HA with quorum
+    Rabbitmq is moving out of HA classic queues and replacing those with Quorum queues (based on raft algorithm).
+    This is a huge improvment on rabbitmq side. This allow better scalability as well as redundancy of data.
+    Quorum queues were partially implemented on oslo.messaging.
+
+OVH did a patch to finish this implementation (for 'transient' queues)
+
+**Using quorum queues is not yet the default and we would like to enable this by default.**
+
+* Consistent queue naming
+    oslo.messaging was relying on random queue naming.
+    While this seems not a problem on small deployments, it has two bad side effects :
+* it's harder to figure out which service created a specific queue
+* as soon as you restart your services, new random queues are created, leaving a lot of orphaned queues in rabbitmq
+
+These side effects are highly visible at large scale, and even more visible when using quorum queues.
+
+**We did a patch on oslo.messaging to stop using random name.**
+
+This is now merged upstream, but disable by default.
+We would like to enable this by default in the future.
+
+* Reduce the number of queues
+    Both neutron and nova are heavily relying on rabbitmq communication.
+    While nova is the one sending most messages (5x more than neutron), neutron is the one creating most queues (10x more than nova).
+    RabbitMQ is a message broker, not a queue broker.
+    Neutron is creating a lot of queues without even using them (neutron instanciate oslo.messaging for one queue, but oslo.messaging is creating multiples queues for multiple purpose, even if neutron does not need them)
+    With a high number of queues, rabbitmq does not work correctly (timeouts / cpu usage / network usage / etc.).
+
+OVH did some patches to reduce the number of queues created by neutron by patching oslo.messaging and neutron code (we divide neutron number of queues by 5).
+
+**We would like to push this upstream.**
+
+* Replace classic fanouts with streams
+    Both neutron and nova rely on fanout queues to send messages to all computes.
+    Neutron mostly use that to trigger a security group update or any other update on object (populating the remote cache).
+
+When classic queues were used to perform such thing, messages were replicated in all queues for all computes.
+If you were having a region with 2k computes, you would be sending 2k identical messages in 2k queues (1 message per queue). This is not efficient at all.
+
+**OVH did a patch to rely on "stream" queues to replace classic fanouts.**
+With stream queues, all computes listen to the same queue, so only 1 message is sent to 1 queue and is received on 2k computes.
+This is also reducing the number of queues on rabbitmq.
+
+Those patches are merged upstream but disabled by default
+
+**We would like to enable this by default.**
+
+* Get rid of 'transient' queues
+    oslo.messaging is distinguishing 'transient' queues from other queues but it make no sense anymore.
+    Neutron and nova are expecting all queues to be fully replicated and highly available.
+    There is no transient concept in nova / neutron code.
+    This concept lead to bad practices when managing rabbitmq cluster. E.G. not replicating the transient queues, which is bad for both nova and neutron.
+
+OVH stopped distinguishing transients and manage all queues in a high available fashion (using quorum queues).
+This allow us the stop a rabbitmq server from the cluster without any impact on the service.
+
+What we would like is to patch oslo.messaging in the future to stop considering some queues as transient.
+This would simplify the code a lot.
--- a/oslo.messaging
+++ b/oslo.messaging
@ -0,0 +1,39 @@
+# Messaging in Openstack
+
+## oslo_messaging
+
+In PCI infra, oslo_messaging is configured using:
+
+- rabbitmq driver for RPC server/agent communication
+- kafka and log driver for notifications (send events to third party app)
+
+
+### RPC implementation in rabbitmq
+
+[RPC in openstack](https://docs.openstack.org/oslo.messaging/stein/reference/rpcclient.html) is implemented using oslo_messaging library.
+
+!!! note "tldr"
+
+    - rpc call()
+        - blocking call to invoke a method on a topic with 1 reply expected
+    - rpc cast()
+        - invoke a method on a topic in 'best effort' mode without reply. If fanout=true message, is broadcasted to all topic consumers
+
+
+In rabbitmq, a message is published to a queue using an exchange/routing_key.
+Consumers are directly connected to a queue to read messages from.
+
+A oslo.messaging 'topic' is almost equivalent to a rabbitmq queue.
+
+With a rpc call, message will be sent to rabbitmq through exchange=target.exchange  queue={target.topic}.{target.server}
+Response will be sent back to caller using exchange=target.exchange queue={message.reply_queue}
+
+With a rpc cast fanout=false, it's the same but there is no reply mechanism
+
+With a rpc cast fanout=true, message will be sent to rabbitmq through exchange=target.exchange  queue={target.topic}_fanout
+
+For rpc call and rpc cast (fanout=false), we are using quorum queues (1 publisher / 1 consumer).
+For rpc cast (fanout=true), stream queues are used because it's purpose is to broadcast messages (1 publisher / N consumers).
+
+On startup, every server/agent declare queues they will consume from. If queue does not exist on rabbit cluster, it is created.
+It's the same for publishing part with the exchange.
--- a/plop.md
+++ b/plop.md
@ -0,0 +1,24 @@
+```
+<section>image presentation</section>
+
+<!-- Intro -->
+<section>
+▏ ▏ <section>Intro</section>
+▏ ▏ <section>Vertical Slide 1</section>
+</section>
+
+<section data-markdown  data-separator="^\n---\n$" data-separator-vertical="^\n--\n$">
+▏ ▏ <textarea data-template>
+▏ ▏ ▏ ### Issues with rabbit ?
+
+▏ ▏ ▏ --
+
+      ### common
+▏ ▏ ▏ - flap when rolling out agent / deploying new agent version
+▏ ▏ ▏ ▏ - even crash on big regions
+▏ ▏ ▏ - network flap / rabbit partition
+▏ ▏ ▏ - pause-minority helped crash the cluster
+▏ ▏ ▏ - reset cluster was ... the solution
+▏ ▏ </textarea>
+</section>
+```
Author	SHA1	Message	Date
Flatnotes	e2912d0067	Autocommit action=CREATE on file=plop.md detected	2024-05-07 06:52:39 +00:00
Flatnotes	6d649875b8	Autocommit action=MODIFY on file=Follow the RabbitMQ - Plan.md detected	2024-05-06 12:10:11 +00:00
Flatnotes	ebb2341c50	Autocommit action=MODIFY on file=README.md detected	2024-05-05 20:48:28 +00:00
Flatnotes	92d3588542	Autocommit action=MODIFY on file=Follow the RabbitMQ - Plan.md detected	2024-05-05 20:45:15 +00:00
Flatnotes	dab4bb0364	Autocommit action=MODIFY on file=Follow the RabbitMQ - Plan.md detected	2024-05-05 20:39:51 +00:00
Flatnotes	d9da4532b9	Autocommit action=MODIFY on file=Follow the RabbitMQ - Plan.md detected	2024-05-05 20:38:39 +00:00
Flatnotes	a49b0f28be	Autocommit action=MODIFY on file=Follow the RabbitMQ - Plan.md detected	2024-05-05 20:37:16 +00:00
Flatnotes	c4205ebe06	Autocommit action=MODIFY on file=Follow the RabbitMQ - Plan.md detected	2024-05-05 20:34:11 +00:00
Flatnotes	fee991dfd3	Autocommit action=MODIFY on file=Follow the RabbitMQ - Plan.md detected	2024-05-05 20:33:39 +00:00
Flatnotes	4c3f3195cd	Autocommit action=MODIFY on file=Follow the RabbitMQ - Plan.md detected	2024-05-05 20:32:11 +00:00
Flatnotes	ab0f6574fd	Autocommit action=MODIFY on file=RabbitMQ recent improvments.md detected	2024-05-05 20:31:50 +00:00
Flatnotes	eeff712455	Autocommit action=MODIFY on file=Plan.md detected	2024-05-05 20:29:10 +00:00
Flatnotes	0866e4a7e0	Autocommit action=MODIFY on file=Plan.md detected	2024-05-05 20:06:10 +00:00
Flatnotes	01c3c6d78d	Autocommit action=MODIFY on file=Plan.md detected	2024-05-05 19:23:01 +00:00
Flatnotes	6eef026b69	Autocommit action=MODIFY on file=Plan.md detected	2024-05-05 19:14:12 +00:00
Flatnotes	30632f97c6	Autocommit action=MODIFY on file=Plan.md detected	2024-05-05 18:48:10 +00:00
Flatnotes	a322b387de	Autocommit action=CREATE on file=Plan.md detected	2024-05-05 18:32:13 +00:00
Flatnotes	2d68da8612	Autocommit action=MODIFY on file=1. Follow the Rabbitmq.md detected	2024-05-05 18:28:33 +00:00
Flatnotes	20611ec853	Autocommit action=MODIFY on file=1. Follow the Rabbitmq.md detected	2024-05-05 18:27:49 +00:00
Flatnotes	854a37994b	Autocommit action=CREATE on file=oslo.messaging - How it works with rabbit.md detected	2024-05-05 18:09:43 +00:00
Flatnotes	1014610a8b	Autocommit action=MODIFY on file=1. Follow the Rabbitmq.md detected	2024-05-05 18:07:21 +00:00
Flatnotes	4fe6fc0eb4	Autocommit action=MODIFY on file=1. Follow the Rabbitmq.md detected	2024-05-05 17:29:39 +00:00
Flatnotes	12ba3c160f	Autocommit action=MODIFY on file=1. Follow the Rabbitmq.md detected	2024-05-05 17:27:41 +00:00
Flatnotes	06509916b8	Autocommit action=CREATE on file=1. Follow the Rabbitmq.md detected	2024-05-05 17:27:41 +00:00
				`@ -0,0 +1 @@`
				`[git](https://git.cosmao.info/ju/openinfraday/src/branch/master/Follow%20the%20RabbitMQ%20-%20Plan.md)`