Recommended actions for PXC in case maintenance windows

Francisco_Ruben_Jime · March 13, 2023, 5:55pm

Hi all, Good day!

We had implemented 3 PXC clusters, running each on a different OCP cluster. Like the diagram attached.

What are the best practices in the following scenarios?

We need to take the OCP Cluster hosting PXC1 offline for maintenance but do not want to disrupt database services.

Is there an automatic way to switch over pxc, I mean, to change from Source to Replica and from Replica to Source? Or what is your recommended approach for this needs.

Current pxc and ocp version included in the diagram.

matthewb · March 13, 2023, 8:52pm

Hello @Francisco_Ruben_Jime,
Just to clarify, do you have 3-node clusters in each OCP cluster (total 9 servers)? Or just 1 PXC, with 3-nodes (total 3 servers), each node in a different OCP?

Your diagram seems to show the former, 3 separate clusters, using async replication to keep 2 clusters in sync with the primary cluster. If this is the case, I would do the following:

Stop replication from PXC1->PXC3
Establish replication from PXC2->PXC3
Shut down PXC1 as needed based on your plan
Move applications to PXC2

The tricky part comes with PXC1 is back online. You will need to configure replication from PXC2->PXC1 in order to sync all the changes that occurred on PXC2 while PXC1 was offline

Francisco_Ruben_Jime · March 13, 2023, 9:57pm

Your supposition was right, We have 3-node clusters in each OCP cluster (total 9 servers).

Analyzing your suggested action path… and I apologize in advance if I am doing so basic questions, This is our very first implementation of the Percona Operator for MySQL, hopefully not the last one.

When you mention: stopping the replication is just as simple as removing the replicationChannels section from the cr.yaml file, or involve a pause: true of the PXC cluster?

Your step number two, involves first Stop replication from PXC2, and after that configure as a isSource: true for PXC2 and finally configure replication from PXC2 to PXC3.

The step number three, involve a pause: true of the PXC1 cluster, right? for a graceful poweroff of the running instances.

I think we understood the logic of your suggested steps, I have a final doubt, all those steps can be safely without any db service disruption? Is there a plan to do the mentioned process automatically? a sort of Orchestrator: MySQL Replication Topology Manager, for Percona Operator for MySQL.

matthewb · March 14, 2023, 1:29am

I must apologize, I am not familiar enough with K8S and our operator to give guidance on those aspects within the config files. The steps I gave were a high-level process.

You have a highly complex and unusual setup. There isn’t anything I know of that can do this automatically. If you were using a standard MySQL replication, then yes, Orchestrator would be able to handle this via some drag-n-drop. But since you are using PXC in a multi-cluster setup, Orchestrator can’t help here.

Topic		Replies	Views
Replication Cluster MySQL & MariaDB	5	765	January 9, 2024
Migrate from percona server 8.0 to percona xtradb cluster MySQL & MariaDB percona	2	42	March 4, 2025
Percona XtraDB Cluster Operator cross-site replication failover Percona Operator for MySQL	3	961	October 19, 2023
Integrate replication to percona operator pxc cluster frm existing mysql cluster Percona Operator for MySQL	21	1990	October 20, 2022
Upgrading Percona 5.7 to 8.0 Using Asynchronous Replication Percona XtraDB Cluster 8.x mysql , percona , new-release	1	1332	April 10, 2023

Recommended actions for PXC in case maintenance windows

Related topics