Advise on application connectivity to PXC cluster deployed on nodes spread across two regions

VithalAkunuri · July 7, 2025, 2:28pm

Hi,
We have PXC 8.0 cluster deployed on Kubernetes as stateful set 5 with multi master on nodes spread across two regions. Separate services were created on each region to load balance the traffic across the available PXC pods in the cluster. HAProxy was also deployed on the cluster.

Can you please advise on the below?

Is this recommended to create separate services in each region to allow applications connect to database within the same region?
If yes, How does resiliency work in case one of the pod goes offline and comes back after say 30 mins? As the pod joins back the service in Kubernetes, does the data gets replicated automatically as part of pod startup before traffic starts routed by Kubernetes service?
If not, do you recommend to have application connect to haproxy service ( as its already deployed ) so that writes are directed to only one node and resiliency is automatically taken care of ? Currently, application doesn’t have the configuration to segregate write and read operations.
High SQL commits ( ~100msecs )are observed , how can I debug further ?
PMM is not enabled yet, do you recommend to have it enabled on production environment?

Thank you!

CTutte · July 7, 2025, 2:52pm

Hi VithalAkunuri,

I will only comment about the PXC stuff and leave the K8s issues for someone else to reply.

Deploying PXC in multiple regions will affect performance severely and is strongly discouraged. For reference there is a blogpost about this: https://www.percona.com/blog/how-not-to-do-mysql-high-availability-geographic-node-distribution-with-galera-based-replication-misuse/

The reason is that all nodes in the topology needs to communicate in real time (not only when write comes through) so all activity will be funneled and slowed down to the network speed. For refrence you can read more about this on this blogpost https://www.percona.com/blog/investigating-mysql-replication-latency-in-percona-xtradb-cluster/

Regards

matthewb · July 7, 2025, 11:17pm

This is because your cluster is split between regions. The fastest you can commit is equal to the slowest latency between any 2 nodes. Put all 5 nodes into the same region, and configure haproxy to write to a single node.

To maximize PXC, you will need to make this modification. HAProxy does not understand SQL, so the app must decide. Or you can deploy ProxySQL which does understand SQL, and can route connections based on SELECT or not.

VithalAkunuri · July 8, 2025, 3:51pm

Thanks @matthewb for your reply. Can you please comment if its recommended to use Kubernetes service for application connection ?

matthewb · July 8, 2025, 4:33pm

Yes, use K8S service for application connections. The K8S service knows when/if pods restart and automatically maintains the backend mapping of IP->pod.

VithalAkunuri · July 9, 2025, 8:08am

Thanks @matthewb

After the pod restart , does it immediately accepts the write requests ? The fact that the pod may not be in sync with other nodes of the cluster for a brief period of time , this node could be in non-primary until the replication completes?

matthewb · July 10, 2025, 11:22pm

I believe the operator manages that with the service. You would need to test to be sure.

Topic		Replies	Views
PXC cluster for mysql is not choosing the secondary as primary Percona XtraDB Cluster 5.x mysql , percona	2	356	September 10, 2024
Cross-site two way (Or more than two) replication Percona Operator for MySQL percona	6	526	May 2, 2024
PXC load balancing preference and raid question Percona XtraDB Cluster 8.x	10	108	March 14, 2025
PXC and HAProxy HA on geo scale Percona XtraDB Cluster 5.x	4	1240	July 1, 2013
Can there be any challenges in PXC if 2 of its nodes are in GCP mumbai region and 1 node in Delhi region? Percona XtraDB Cluster 5.x percona	2	99	September 6, 2024

Advise on application connectivity to PXC cluster deployed on nodes spread across two regions

Related topics