Cluster failure following a filesystem-level error

unixronin · April 18, 2017, 2:21pm

We have a three-node cluster, on VMs using ExtremeIO storage for the data filesystem, which suffered a failure this morning. The event that triggered the failure appears to have been a storage-level error which caused node 3 to fail to create a new binlog file, in response to which mysqld declared that it was ceasing all logging. Some time afterward, nodes 1 and 2 experienced simultaneous failures to commit a set of updates, declared themselves inconsistent, and shut down, whereupon node 3 lost quorum and declared itself non-primary.

Galera does use ROW replication data, as we all know. At what level does Galera obtain the data, and at what level does logging get shut off in response to a storage-level failure as described here? Would mysqld disabling all logging cause Galera replication from node 3 to fail? Our working theory at present is that nodes 1 and 2 failed because the attempted to update rows which had been written by node 3, but never replicated to nodes 1 and 2 because the binary logging failure on node 3 also disabled outgoing Galera replication from node 3. Does this hypothesis make sense?

przemek · April 25, 2017, 7:07am

First of all, binary logs are not required in Galera, but are useful for PITR-capable backups for example. Still, binary logs are not used for replication in Galera cluster.
If nodes 1 and 2 failed because they could not write to the disk, then it’s normal they had to abort. Even standalone MySQL+InnoDB will not work with filesystem being in read-only mode, unless it is specifically prepared for such case before.
As two nodes out of three failed in unclean way, the remaining node, even if could be healthy in terms of hardware, had to stop accepting queries as it lost the quorum. But if this 3rd node was still OK, you could force it to be primary again by manually bootstrapping it (possible to do it online).

The idea of High Availability with Galera (PXC) is that each node should run on independent hardware. So, a single storage-level failure should NOT affect majority of the nodes in the same time.

Topic		Replies	Views
Binlog Consistency in Galera cluster Percona Server for MySQL 5.7	2	1329	November 23, 2020
Galera cluster mariadb - binlog ? Other MySQL® Questions	3	571	November 26, 2018
Cluster fails replication and partitions node Percona XtraDB Cluster 5.x	3	5178	April 18, 2017
Binary logging confusion Percona XtraDB Cluster 5.x	2	1313	April 30, 2014
Percona Mysql 8.0.19-10 Slave SQL: Could not execute Update_rows event on table Error_code: MY-001032 Percona XtraDB Cluster 8.x	1	2577	January 11, 2021

Cluster failure following a filesystem-level error

Related topics