Percona XtraDB Cluster susceptible to disk-bound nodes

Miggles · January 7, 2015, 1:38pm

We have a three node cluster but for the purposes of experimentation I’m only sending traffic to one node (which I’ll call the active node, the other two being inactive).

If I perform disk-heavy operations on the inactive nodes, the active node gets bogged down behind a lot of pending WSREP commits.

All three nodes are on RAID10 EBS volumes, however only one of them is running with Provisioned IOPs. I’m going to replace the storage on the other two nodes so that they’re using Provisioned IOPs as well and repeat the experiments but I was wondering if there was something else I should be looking into?

Cheers

przemek · January 27, 2015, 3:11am

Ideally all the nodes in PXC cluster should be identical in terms of hardware. This is due to the fact that Galera takes care about replication lag, hence the write throughput of the whole cluster is limited by the slowest node. So if you overload one node, which will make it slow, it may be slower in applying writesets and trigger Flow Control:
[url]http://www.percona.com/blog/2013/05/02/galera-flow-control-in-percona-xtradb-cluster-for-mysql/[/url]
You may however allow a particular node to get behind by switching desync mode on it, see some examples:
[url]http://www.percona.com/blog/2013/10/08/taking-backups-percona-xtradb-cluster-without-stalls-flow-control/[/url]

Topic		Replies	Views
Write performance is horrible on 2nd node Percona XtraDB Cluster 5.x	2	971	September 9, 2013
Some newbie questions Percona XtraDB Cluster 8.x	3	706	January 16, 2023
Getting latency in write operations under heavy load Percona XtraDB Cluster 5.x percona	19	1859	April 27, 2022
Why a disk full on a node kill cluster? Percona XtraDB Cluster 8.x	3	41	July 16, 2025
Replicating into cluster is slow Percona XtraDB Cluster 5.x	15	2358	November 25, 2020

Percona XtraDB Cluster susceptible to disk-bound nodes

Related topics