on RHEL 6
Percona-XtraDB-Cluster-server-56-5.6.15-25.5.759.rhel6.x86_64
(with Percona-XtraDB-Cluster-galera-3-3.5-1.216.rhel6.x86_64)
is running a 3 node cluster fine
upgrading one node to
Percona-XtraDB-Cluster-shared-56-5.6.19-25.6.824.el6.x86_64
(and to Percona-XtraDB-Cluster-galera-3-3.6-1.3190.rhel6.x86_6)
breaks it, SST is working but then:
2014-08-11 16:25:45 32720 [Note] WSREP: Member 2.0 (xdb8071) requested state transfer from ‘xdb8069’. Selected 1.0 (xdb8069)(SYNCED) as donor.
2014-08-11 16:25:45 32720 [Note] WSREP: Shifting PRIMARY → JOINER (TO: 6558)
2014-08-11 16:25:45 32720 [Note] WSREP: Requesting state transfer: success, donor: 1
WSREP_SST: [INFO] Proceeding with SST (20140811 16:25:45.992)
WSREP_SST: [INFO] Cleaning the existing datadir (20140811 16:25:45.995)
removed /data1/mysql/data/gvwstate.dat' WSREP_SST: [INFO] Cleaning the binlog directory /data/mysql/logs as well (20140811 16:25:46.017) rm: cannot remove
/data/mysql/logs/.index’: No such file or directory
WSREP_SST: [INFO] Evaluating socat -u TCP-LISTEN:4444,reuseaddr stdio | xbstream -x; RC=( ${PIPESTATUS[@]} ) (20140811 16:25:46.024)
2014-08-11 16:26:26 32720 [Note] WSREP: 1.0 (xdb8069): State transfer to 2.0 (xdb8071) complete.
2014-08-11 16:26:26 32720 [Note] WSREP: Member 1.0 (xdb8069) synced with group.
ls: cannot access mysql-binlog-8071.: No such file or directory
WSREP_SST: [INFO] Preparing the backup at /data1/mysql/data/ (20140811 16:26:26.548)
WSREP_SST: [INFO] Evaluating innobackupex --no-version-check --apply-log $rebuildcmd ${DATA} &>${DATA}/innobackup.prepare.log (20140811 16:26:26.551)
WSREP_SST: [INFO] Total time on joiner: 0 seconds (20140811 16:26:35.265)
WSREP_SST: [INFO] Removing the sst_in_progress file (20140811 16:26:35.268)
2014-08-11 16:26:35 32720 [Note] WSREP: SST complete, seqno: 6558
2014-08-11 16:26:35 32720 [Note] Plugin ‘FEDERATED’ is disabled.
2014-08-11 16:26:35 32720 [Note] InnoDB: Using atomics to ref count buffer pool pages
2014-08-11 16:26:35 32720 [Note] InnoDB: The InnoDB memory heap is disabled
2014-08-11 16:26:35 32720 [Note] InnoDB: Mutexes and rw_locks use GCC atomic builtins
2014-08-11 16:26:35 32720 [Note] InnoDB: Compressed tables use zlib 1.2.3
2014-08-11 16:26:35 32720 [Note] InnoDB: Using Linux native AIO
2014-08-11 16:26:35 32720 [Note] InnoDB: Using CPU crc32 instructions
2014-08-11 16:26:35 32720 [Note] InnoDB: Initializing buffer pool, size = 39.1G
2014-08-11 16:26:37 32720 [Note] InnoDB: Completed initialization of buffer pool
2014-08-11 16:26:37 32720 [Note] InnoDB: Highest supported file format is Barracuda.
2014-08-11 16:26:38 32720 [Note] InnoDB: 128 rollback segment(s) are active.
2014-08-11 16:26:38 32720 [Note] InnoDB: Waiting for purge to start
2014-08-11 16:26:38 32720 [Note] InnoDB: Percona XtraDB (http://www.percona.com) 5.6.19-67.0 started; log sequence number 3816493669
2014-08-11 16:26:38 32720 [ERROR] Aborting
2014-08-11 16:26:40 32720 [Note] WSREP: Closing send monitor…
2014-08-11 16:26:40 32720 [Note] WSREP: Closed send monitor.
…
2014-08-11 16:26:44 32720 [Note] InnoDB: Shutdown completed; log sequence number 3816496970
log of the donor node:
2014-08-11 16:26:26 3587 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
WSREP_SST: [INFO] Total time on donor: 0 seconds (20140811 16:26:26.530)
WSREP_SST: [INFO] Cleaning up temporary directories (20140811 16:26:26.536)
2014-08-11 16:26:40 3587 [Note] WSREP: declaring 409fe6e9-0da2-11e4-a8dc-f6c52645ad47 stable
2014-08-11 16:26:40 3587 [Note] WSREP: (5df674bc-0da2-11e4-8264-afe0e5387779, ‘tcp://0.0.0.0:4567’) address ‘tcp://10.64.218.40:4567’ pointing to uuid 5df674bc-0da2-11e4-8264-afe0e5387779 is blacklisted, s
kipping
2014-08-11 16:26:40 3587 [Note] WSREP: (5df674bc-0da2-11e4-8264-afe0e5387779, ‘tcp://0.0.0.0:4567’) address ‘tcp://10.64.218.40:4567’ pointing to uuid 5df674bc-0da2-11e4-8264-afe0e5387779 is blacklisted, s
kipping
2014-08-11 16:26:40 3587 [Note] WSREP: (5df674bc-0da2-11e4-8264-afe0e5387779, ‘tcp://0.0.0.0:4567’) turning message relay requesting on, nonlive peers: tcp://10.64.218.42:4567
2014-08-11 16:26:40 3587 [Note] WSREP: Node 409fe6e9-0da2-11e4-a8dc-f6c52645ad47 state prim
2014-08-11 16:26:40 3587 [Note] WSREP: view(view_id(PRIM,409fe6e9-0da2-11e4-a8dc-f6c52645ad47,34) memb {
409fe6e9-0da2-11e4-a8dc-f6c52645ad47,0
5df674bc-0da2-11e4-8264-afe0e5387779,0
} joined {
} left {
} partitioned {
5eb5d970-2163-11e4-ad72-7bd016470afd,0
})
2014-08-11 16:26:40 3587 [Note] WSREP: forgetting 5eb5d970-2163-11e4-ad72-7bd016470afd (tcp://10.64.218.42:4567)
2014-08-11 16:26:40 3587 [Note] WSREP: (5df674bc-0da2-11e4-8264-afe0e5387779, ‘tcp://0.0.0.0:4567’) address ‘tcp://10.64.218.40:4567’ pointing to uuid 5df674bc-0da2-11e4-8264-afe0e5387779 is blacklisted, s
kipping
2014-08-11 16:26:40 3587 [Note] WSREP: New COMPONENT: primary = yes, bootstrap = no, my_idx = 1, memb_num = 2
2014-08-11 16:26:40 3587 [Note] WSREP: (5df674bc-0da2-11e4-8264-afe0e5387779, ‘tcp://0.0.0.0:4567’) turning message relay requesting off
2014-08-11 16:26:40 3587 [Note] WSREP: STATE EXCHANGE: Waiting for state UUID.
2014-08-11 16:26:40 3587 [Note] WSREP: STATE EXCHANGE: sent state msg: 82731576-2163-11e4-9a53-87f667275baa
2014-08-11 16:26:40 3587 [Note] WSREP: STATE EXCHANGE: got state msg: 82731576-2163-11e4-9a53-87f667275baa from 0 (xdb8070)
2014-08-11 16:26:40 3587 [Note] WSREP: STATE EXCHANGE: got state msg: 82731576-2163-11e4-9a53-87f667275baa from 1 (xdb8069)
2014-08-11 16:26:40 3587 [Note] WSREP: Quorum results:
version = 3,
component = PRIMARY,
conf_id = 32,
members = 2/2 (joined/total),
act_id = 6558,
last_appl. = 6527,
protocols = 0/5/2 (gcs/repl/appl),
group UUID = fd641b3d-0d07-11e4-bb59-bfa063c9be5f
2014-08-11 16:26:40 3587 [Note] WSREP: Flow-control interval: [23, 23]
2014-08-11 16:26:40 3587 [Note] WSREP: New cluster view: global state: fd641b3d-0d07-11e4-bb59-bfa063c9be5f:6558, view# 33: Primary, number of nodes: 2, my index: 1, protocol version 2
2014-08-11 16:26:40 3587 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
2014-08-11 16:26:40 3587 [Note] WSREP: REPL Protocols: 5 (3, 1)
2014-08-11 16:26:40 3587 [Note] WSREP: Service thread queue flushed.
2014-08-11 16:26:40 3587 [Note] WSREP: Assign initial position for certification: 6558, protocol version: 3
2014-08-11 16:26:40 3587 [Note] WSREP: Service thread queue flushed.
2014-08-11 16:26:40 3587 [Warning] WSREP: Releasing seqno 6558 before 6559 was assigned.
2014-08-11 16:26:45 3587 [Note] WSREP: cleaning up 5eb5d970-2163-11e4-ad72-7bd016470afd (tcp://10.64.218.42:4567)
yum downgrade to 5.6.15. and node is up again.
I tried:
rm -rf in the datadir to force an SST, didn’t help.
Have I done it wrong? Is there a need to upgrade galera first on all node or something like this, or is 5.6.19 broken?