Hi guys! We are configuring a backup policy in our cluster. We’ve defined two storage types (s3 and pvc). When we run an on demand backup, we get this error in pod logs:
+ peer-list -on-start=/usr/bin/get-pxc-state -service=cluster1-pxc
2020/09/10 14:37:27 Peer finder enter
2020/09/10 14:37:27 Determined Domain to be pxc.svc.cluster.local
2020/09/10 14:37:27 Peer list updated
was []
now [cluster1-pxc-0.cluster1-pxc.pxc.svc.cluster.local cluster1-pxc-1.cluster1-pxc.pxc.svc.cluster.local cluster1-pxc-2.cluster1-pxc.pxc.svc.cluster.local]
2020/09/10 14:37:27 execing: /usr/bin/get-pxc-state with stdin: cluster1-pxc-0.cluster1-pxc.pxc.svc.cluster.local
cluster1-pxc-1.cluster1-pxc.pxc.svc.cluster.local
cluster1-pxc-2.cluster1-pxc.pxc.svc.cluster.local
2020/09/10 14:37:27
node:cluster1-pxc-0.cluster1-pxc.pxc.svc.cluster.local:wsrep_ready:ON:wsrep_connected:ON:wsrep_local_state_comment:Donor/Desynced:wsrep_cluster_status:Primary
node:cluster1-pxc-1.cluster1-pxc.pxc.svc.cluster.local:wsrep_ready:ON:wsrep_connected:ON:wsrep_local_state_comment:Donor/Desynced:wsrep_cluster_status:Primary
node:cluster1-pxc-2.cluster1-pxc.pxc.svc.cluster.local:wsrep_ready:ON:wsrep_connected:ON:wsrep_local_state_comment:Donor/Desynced:wsrep_cluster_status:Primary
2020/09/10 14:37:28 Peer finder exiting
[ERROR] Cannot find node for backup
+ echo '[ERROR] Cannot find node for backup'
+ exit 1
Do you have any clue about what is happening? Many thanks in advance! Marc
I see the same error also with storageName: fs-pvc. The database in the instance is still empty. I don’t think it takes so long until full sync status is reached.
But anyway, I’m not able to see any unsynced or synced cluster status anywhere.
@mygov As you can see from the log you have three PXC nodes and two of them are in ‘Donor’ state(you have the backups in running state). Donor could not been chosen for backup. The first pxc-0 node has ‘Synced’ state but it is the first ‘primary’ pod which accepts the writes. So, it also can’t be used for backup. Did you make the several backups at the same time?
I deleted the manual backup jobs before I started a new one. So I will not see any active jobs or running pods which indicate that something is still going on. It would be great to see some status somewhere when the cluster is ready to proceed with another backupjob.
After trying to start again a backup in about 10-20 minutes, the backup runs successfully.
The reason for starting the backupjobs one behind the other was an invalid s3 bucket configuration and a server timeout.
Hi, iam having the same issue changed the endpointurl’s but still the error is same and s3 credentials are working for other application backup jobs. Using the latest 1.10.0 pxc.
2022/01/27 03:13:22
node:cluster1-pxc-db-pxc-0.cluster1-pxc-db-pxc.pxo.svc.cluster.local:wsrep_ready:ON:wsrep_connected:ON:wsrep_local_state_comment:Synced:wsrep_cluster_status:Primary:wsrep_cluster_size:3
node:cluster1-pxc-db-pxc-1.cluster1-pxc-db-pxc.pxo.svc.cluster.local:wsrep_ready:ON:wsrep_connected:ON:wsrep_local_state_comment:Donor/Desynced:wsrep_cluster_status:Primary:wsrep_cluster_size:3
node:cluster1-pxc-db-pxc-2.cluster1-pxc-db-pxc.pxo.svc.cluster.local:wsrep_ready:ON:wsrep_connected:ON:wsrep_local_state_comment:Donor/Desynced:wsrep_cluster_status:Primary:wsrep_cluster_size:3
2022/01/27 03:13:23 Peer finder exiting