We have a Percona Xtradb Cluster 8.0.36-28.1 with 3 nodes running with ProxySQL in front.
In PMM we see a lot of Flow Control Message Sent each day.
I have found some anormalies:
Node A
show status like 'wsrep_cluster%';
Variable_name Value
wsrep_cluster_weight 3
wsrep_cluster_capabilities
wsrep_cluster_conf_id 18446744073709551615
wsrep_cluster_size 0
wsrep_cluster_state_uuid
wsrep_cluster_status Primary
Node B
show status like 'wsrep_cluster%';
Variable_name Value
wsrep_cluster_weight 3
wsrep_cluster_capabilities
wsrep_cluster_conf_id 22
wsrep_cluster_size 3
wsrep_cluster_state_uuid 00000000-0000-0000-0000-000000000000
wsrep_cluster_status Primary
Node C
show status like 'wsrep_cluster%';
Variable_name Value
wsrep_cluster_weight 3
wsrep_cluster_capabilities
wsrep_cluster_conf_id 22
wsrep_cluster_size 3
wsrep_cluster_state_uuid 00000000-0000-0000-0000-000000000000
wsrep_cluster_status Primary
So wsrep_cluster_size 0 looks wrong.
All wsrep_cluster_conf_id expected to be 22.
And wsrep_cluster_state_uuid expected to be equal.
The flow control occurs several times a day. Even a simple one like
ALTER statements (or any other DDL) will stop all server activity (even for the tables in which the DDL is not running|) until the DDL is complete and all nodes are back in sync
Aside from DDLs you should check “fc_limit” and “recv_queue” to check if flow control is happening