So I decided to downgrade the packages to the previous versions (we’re installing from a local repo) and I can confirm that it SST’s correctly and synchronises with the rest of the cluster successfully. We’re going to try and spin up a Debian 7 VM with the same packages as our existing Debian physical nodes to see if that has the same issue as the RHEL box.
Here’s the log from the older version as it successfully SST’s. Do note that to downgrade back to this version I had to do:
rpm -e --nodeps mysql-libs-5.1.73-5.el6_6.x86_64
…and then install the compatibility libs as there’s a conflict between the PXC RPMs and the RHEL6 ones in this version - it’s not necessary for the latest 5.5 and in fact can’t be done as you’ll end up with missing mysql libraries with version 16 which things like Postfix rely on.
Jun 9 15:33:43 hostname mysqld_safe: Starting mysqld daemon with databases from /var/lib/mysql
Jun 9 15:33:43 hostname mysqld_safe: Skipping wsrep-recover for empty datadir: /var/lib/mysql
Jun 9 15:33:43 hostname mysqld_safe: Assigning 00000000-0000-0000-0000-000000000000:-1 to wsrep_start_position
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: wsrep_start_position var submitted: '00000000-0000-0000-0000-000000000000:-1'
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: Read nil XID from storage engines, skipping position init
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: wsrep_load(): loading provider library '/usr/lib64/libgalera_smm.so'
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: wsrep_load(): Galera 2.8(r165) by Codership Oy <info@codership.com> loaded successfully.
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Warning] WSREP: Could not open saved state file for reading: /var/lib/mysql//grastate.dat
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: Found saved state: 00000000-0000-0000-0000-000000000000:-1
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: Preallocating 134219048/134219048 bytes in '/var/lib/mysql//galera.cache'...
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: Passing config to GCS: base_host = 10.3.0.60; base_port = 4567; cert.log_conflicts = no; gcache.dir = /var/lib/mysql/; gcache.keep_pages_size = 0; gcache.mem_size = 0; gcache.name = /var/lib/mysql//galera.cache; gcache.page_size = 128M; gcache.size = 128M; gcs.fc_debug = 0; gcs.fc_factor = 1; gcs.fc_limit = 16; gcs.fc_master_slave = NO; gcs.max_packet_size = 64500; gcs.max_throttle = 0.25; gcs.recv_q_hard_limit = 9223372036854775807; gcs.recv_q_soft_limit = 0.25; gcs.sync_donor = NO; replicator.causal_read_timeout = PT30S; replicator.commit_order = 3
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: Assign initial position for certification: -1, protocol version: -1
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: wsrep_sst_grab()
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: Start replication
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: Setting initial position to 00000000-0000-0000-0000-000000000000:-1
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: protonet asio version 0
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: backend: asio
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: GMCast version 0
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: (bb0789d6-2e03-11e6-85f8-9e0977225df2, 'tcp://0.0.0.0:4567') listening at tcp://0.0.0.0:4567
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: (bb0789d6-2e03-11e6-85f8-9e0977225df2, 'tcp://0.0.0.0:4567') multicast: , ttl: 1
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: EVS version 0
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: PC version 0
Jun 9 15:33:43 hostname mysqld: 160609 15:33:43 [Note] WSREP: gcomm: connecting to group 'my_wsrep_cluster', peer '10.11.0.40:,10.11.0.41:'
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: declaring 45d51314-2b98-11e6-b605-d2ac634302f6 stable
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: declaring d41e23c0-2b9b-11e6-80ca-7bb6c4e547d6 stable
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: Node 45d51314-2b98-11e6-b605-d2ac634302f6 state prim
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: view(view_id(PRIM,45d51314-2b98-11e6-b605-d2ac634302f6,122) memb {
Jun 9 15:33:44 hostname mysqld: #01145d51314-2b98-11e6-b605-d2ac634302f6,
Jun 9 15:33:44 hostname mysqld: #011bb0789d6-2e03-11e6-85f8-9e0977225df2,
Jun 9 15:33:44 hostname mysqld: #011d41e23c0-2b9b-11e6-80ca-7bb6c4e547d6,
Jun 9 15:33:44 hostname mysqld: } joined {
Jun 9 15:33:44 hostname mysqld: } left {
Jun 9 15:33:44 hostname mysqld: } partitioned {
Jun 9 15:33:44 hostname mysqld: })
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: gcomm: connected
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: Changing maximum packet size to 64500, resulting msg size: 32636
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: Shifting CLOSED -> OPEN (TO: 0)
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: Opened channel 'my_wsrep_cluster'
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: New COMPONENT: primary = yes, bootstrap = no, my_idx = 1, memb_num = 3
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: STATE EXCHANGE: Waiting for state UUID.
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: Waiting for SST to complete.
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: STATE EXCHANGE: sent state msg: bb547472-2e03-11e6-a890-52d118c26d50
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: STATE EXCHANGE: got state msg: bb547472-2e03-11e6-a890-52d118c26d50 from 0 (db3)
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: STATE EXCHANGE: got state msg: bb547472-2e03-11e6-a890-52d118c26d50 from 2 (db4)
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: STATE EXCHANGE: got state msg: bb547472-2e03-11e6-a890-52d118c26d50 from 1 (hostname)
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: Quorum results:
Jun 9 15:33:44 hostname mysqld: #011version = 2,
Jun 9 15:33:44 hostname mysqld: #011component = PRIMARY,
Jun 9 15:33:44 hostname mysqld: #011conf_id = 120,
Jun 9 15:33:44 hostname mysqld: #011members = 2/3 (joined/total),
Jun 9 15:33:44 hostname mysqld: #011act_id = 126813177,
Jun 9 15:33:44 hostname mysqld: #011last_appl. = -1,
Jun 9 15:33:44 hostname mysqld: #011protocols = 0/4/2 (gcs/repl/appl),
Jun 9 15:33:44 hostname mysqld: #011group UUID = 874d7052-1e62-11e3-aa8d-97e571a97d4a
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: Flow-control interval: [28, 28]
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: Shifting OPEN -> PRIMARY (TO: 126813177)
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: State transfer required:
Jun 9 15:33:44 hostname mysqld: #011Group state: 874d7052-1e62-11e3-aa8d-97e571a97d4a:126813177
Jun 9 15:33:44 hostname mysqld: #011Local state: 00000000-0000-0000-0000-000000000000:-1
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Note] WSREP: New cluster view: global state: 874d7052-1e62-11e3-aa8d-97e571a97d4a:126813177, view# 121: Primary, number of nodes: 3, my index: 1, protocol version 2
Jun 9 15:33:44 hostname mysqld: 160609 15:33:44 [Warning] WSREP: Gap in state sequence. Need state transfer.
Jun 9 15:33:46 hostname mysqld: 160609 15:33:46 [Note] WSREP: Running: 'wsrep_sst_xtrabackup-v2 --role 'joiner' --address '10.3.0.60' --auth 'vlscisst:TooF3agh' --datadir '/var/lib/mysql/' --defaults-file '/etc/my.cnf' --parent '4736''
Jun 9 15:33:47 hostname mysqld: WSREP_SST: [INFO] Streaming with xbstream (20160609 15:33:47.042)
Jun 9 15:33:47 hostname mysqld: WSREP_SST: [INFO] Using socat as streamer (20160609 15:33:47.045)
Jun 9 15:33:47 hostname mysqld: WSREP_SST: [INFO] Evaluating socat -u TCP-LISTEN:4444,reuseaddr stdio | xbstream -x; RC=( ${PIPESTATUS[@]} ) (20160609 15:33:47.065)
Jun 9 15:33:47 hostname mysqld: 160609 15:33:47 [Note] WSREP: Prepared SST request: xtrabackup-v2|10.3.0.60:4444/xtrabackup_sst
Jun 9 15:33:47 hostname mysqld: 160609 15:33:47 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
Jun 9 15:33:47 hostname mysqld: 160609 15:33:47 [Note] WSREP: Assign initial position for certification: 126813177, protocol version: 2
Jun 9 15:33:47 hostname mysqld: 160609 15:33:47 [Warning] WSREP: Failed to prepare for incremental state transfer: Local state UUID (00000000-0000-0000-0000-000000000000) does not match group state UUID (874d7052-1e62-11e3-aa8d-97e571a97d4a): 1 (Operation not permitted)
Jun 9 15:33:47 hostname mysqld: #011 at galera/src/replicator_str.cpp:prepare_for_IST():445. IST will be unavailable.
Jun 9 15:33:47 hostname mysqld: 160609 15:33:47 [Note] WSREP: Node 1 (hostname) requested state transfer from '*any*'. Selected 0 (db3)(SYNCED) as donor.
Jun 9 15:33:47 hostname mysqld: 160609 15:33:47 [Note] WSREP: Shifting PRIMARY -> JOINER (TO: 126813177)
Jun 9 15:33:47 hostname mysqld: 160609 15:33:47 [Note] WSREP: Requesting state transfer: success, donor: 0
Jun 9 15:33:47 hostname mysqld: WSREP_SST: [INFO] Proceeding with SST (20160609 15:33:47.996)
Jun 9 15:33:48 hostname mysqld: WSREP_SST: [INFO] Cleaning the existing datadir (20160609 15:33:47.999)
Jun 9 15:33:48 hostname mysqld: WSREP_SST: [INFO] Evaluating socat -u TCP-LISTEN:4444,reuseaddr stdio | xbstream -x; RC=( ${PIPESTATUS[@]} ) (20160609 15:33:48.006)