ALTER TABLE without downtime in a cluster

Edward_Hibbert · February 22, 2017, 5:08pm

I’m struggling to work out how to perform ALTER TABLES without down time in a cluster, even with the nice pt-online-schema-change tool.

If there are many other tables which have foreign key references to the table that you are altering, then my experience is that you’ll get a lot of downtime at the end (presumably even if it selects drop_swap). This also doesn’t seem to play nicely when there are multiple nodes running.

So for large tables with many key references to them, I’ve ended up reducing the cluster to a single node, doing an ALTER TABLE and not using pt-online-schema-change, then forcing SSTs to the other nodes to bring them back in by deleting grastate.dat.

This isn’t great - significant down time. Am I missing something? Yes, maybe I’m missing rolling schema upgrades (RSU).

I’ve not tried this yet - but it looks like the process would be:

take a node out of the load balancer that sends requests to it
set global wsrep_OSU_method=RSU;
do the ALTER TABLE
set global wsrep_OSU_method=TOI;
put the node back into the load balancer
repeat for the other nodes

Is that broadly what people do with RSU?

One special case here is the use of ALTER TABLE to reclaim InnoDB space. Is what I outline above a way to reclaim space without down time?

General hints and tips welcome.

Edward_Hibbert · February 23, 2017, 2:18am

Well, I tried RSU but it didn’t work very well.

set global wsrep_OSU_method=RSU; on one node
OPTIMIZE table on that node
the other nodes also start performing the OPTIMIZE, so the cluster locks up

Am I missing something?

(Edit) I later came across this, which suggests that set global won’t work:

Prior to Percona XtraDB Cluster 5.6.24-25.11, wsrep_OSU_method was only a global variable. Current behavior is now consistent with MySQL behavior for variables that have both global and session scope. This means if you want to change the variable in current session, you need to do it with: SET wsrep_OSU_method (without the GLOBALkeyword). Setting the variable with SET GLOBAL wsrep_OSU_method will change the variable globally but it won’t have effect on the current session.

Edward_Hibbert · February 23, 2017, 11:14am

Following up, just in case anyone else google and finds this - using SET wsrep_OSU_method did the trick, i.e. omitting the GLOBAL. This allows me to alter a table on one, and presumably then the others in turn.

So basically use RSU for any backwards compatible changes affecting large tables rather than pt-online-schema-change.

Topic		Replies	Views
Rolling Schema Upgrade does not work as described in the dokumentation Percona XtraDB Cluster 8.x	4	47	August 6, 2024
pre-commit stage Percona XtraDB Cluster 5.x	3	644	March 30, 2017
Error Code: 1213. WSREP with concurrent DML when DDL is executed Percona XtraDB Cluster 5.x	3	1279	December 12, 2018
Altering range partition at higload table in Galera multimaster cluster Percona XtraDB Cluster 5.x	0	1059	March 22, 2019
ALTER TABLE locks up cluster Percona XtraDB Cluster 5.x	0	700	January 14, 2015

ALTER TABLE without downtime in a cluster

Related topics