Hello,
I recently tried to change from rsync to xtrabackup-v2 on an existing xtradb cluster on Ubuntu 14.04 but second and third nodes can’t join the custer.
From the logs it seem i’m having an issue with some sort of authentication or permissions.
The first node starts without any sort of error but i’m not able to join none of the nodes to it. When I change the sst to rsync, it works
Error log
(...)
2014-12-30 03:55:30 20657 [Note] WSREP: Found saved state: 00000000-0000-0000-0000-000000000000:-1
2014-12-30 03:55:30 20657 [Note] WSREP: Passing config to GCS: base_host = 192.168.11.103; base_port = 4567; cert.log_conflicts = no; debug = no; evs.auto_evict = 0; evs.delay_margin = PT1S; evs.delayed_keep_period = PT30S; evs.inactive_check_period = PT0.5S; evs.inactive_timeout = PT15S; evs.join_retrans_period = PT1S; evs.max_install_timeouts = 3; evs.send_window = 4; evs.stats_report_period = PT1M; evs.suspect_timeout = PT5S; evs.user_send_window = 2; evs.view_forget_timeout = PT24H; gcache.dir = /bd/mysql/; gcache.keep_pages_size = 0; gcache.mem_size = 0; gcache.name = /bd/mysql//galera.cache; gcache.page_size = 128M; gcache.size = 128M; gcs.fc_debug = 0; gcs.fc_factor = 1.0; gcs.fc_limit = 16; gcs.fc_master_slave = no; gcs.max_packet_size = 64500; gcs.max_throttle = 0.25; gcs.recv_q_hard_limit = 9223372036854775807; gcs.recv_q_soft_limit = 0.25; gcs.sync_donor = no; gmcast.segment = 0; gmcast.version = 0; pc.announce_timeout = PT3S; pc.checksum = false; pc.ignore_quorum = false; pc.ignore_sb = false; pc.npvo = false; pc.recovery = 1; p
2014-12-30 03:55:30 20657 [Note] WSREP: Service thread queue flushed.
2014-12-30 03:55:30 20657 [Note] WSREP: Assign initial position for certification: -1, protocol version: -1
2014-12-30 03:55:30 20657 [Note] WSREP: wsrep_sst_grab()
2014-12-30 03:55:30 20657 [Note] WSREP: Start replication
2014-12-30 03:55:30 20657 [Note] WSREP: Setting initial position to 00000000-0000-0000-0000-000000000000:-1
2014-12-30 03:55:30 20657 [Note] WSREP: protonet asio version 0
2014-12-30 03:55:30 20657 [Note] WSREP: Using CRC-32C for message checksums.
2014-12-30 03:55:30 20657 [Note] WSREP: backend: asio
2014-12-30 03:55:30 20657 [Warning] WSREP: access file(gvwstate.dat) failed(No such file or directory)
2014-12-30 03:55:30 20657 [Note] WSREP: restore pc from disk failed
2014-12-30 03:55:30 20657 [Note] WSREP: GMCast version 0
2014-12-30 03:55:30 20657 [Note] WSREP: (b2bdfec8, 'tcp://0.0.0.0:4567') listening at tcp://0.0.0.0:4567
2014-12-30 03:55:30 20657 [Note] WSREP: (b2bdfec8, 'tcp://0.0.0.0:4567') multicast: , ttl: 1
2014-12-30 03:55:30 20657 [Note] WSREP: EVS version 0
2014-12-30 03:55:30 20657 [Note] WSREP: gcomm: connecting to group 'nvcluster', peer '192.168.11.101:,192.168.11.102:'
2014-12-30 03:55:30 20657 [Note] WSREP: (b2bdfec8, 'tcp://0.0.0.0:4567') turning message relay requesting on, nonlive peers:
2014-12-30 03:55:31 20657 [Note] WSREP: declaring b826dbf4 at tcp://192.168.11.101:4567 stable
2014-12-30 03:55:31 20657 [Note] WSREP: Node b826dbf4 state prim
2014-12-30 03:55:31 20657 [Note] WSREP: view(view_id(PRIM,b2bdfec8,30) memb {
b2bdfec8,0
b826dbf4,0
} joined {
} left {
} partitioned {
})
2014-12-30 03:55:31 20657 [Note] WSREP: save pc into disk
2014-12-30 03:55:31 20657 [Note] WSREP: discarding pending addr without UUID: tcp://192.168.11.102:4567
2014-12-30 03:55:31 20657 [Note] WSREP: gcomm: connected
2014-12-30 03:55:31 20657 [Note] WSREP: Changing maximum packet size to 64500, resulting msg size: 32636
2014-12-30 03:55:31 20657 [Note] WSREP: Shifting CLOSED -> OPEN (TO: 0)
2014-12-30 03:55:31 20657 [Note] WSREP: Opened channel 'nvcluster'
2014-12-30 03:55:31 20657 [Note] WSREP: New COMPONENT: primary = yes, bootstrap = no, my_idx = 0, memb_num = 2
2014-12-30 03:55:31 20657 [Note] WSREP: Waiting for SST to complete.
2014-12-30 03:55:31 20657 [Note] WSREP: STATE_EXCHANGE: sent state UUID: b30afe4c-8fd7-11e4-8df5-4f19a22bc0aa
2014-12-30 03:55:31 20657 [Note] WSREP: STATE EXCHANGE: sent state msg: b30afe4c-8fd7-11e4-8df5-4f19a22bc0aa
2014-12-30 03:55:31 20657 [Note] WSREP: STATE EXCHANGE: got state msg: b30afe4c-8fd7-11e4-8df5-4f19a22bc0aa from 0 (node03)
2014-12-30 03:55:31 20657 [Note] WSREP: STATE EXCHANGE: got state msg: b30afe4c-8fd7-11e4-8df5-4f19a22bc0aa from 1 (node01)
2014-12-30 03:55:31 20657 [Note] WSREP: Quorum results:
version = 3,
component = PRIMARY,
conf_id = 29,
members = 1/2 (joined/total),
act_id = 161486336,
last_appl. = -1,
protocols = 0/5/3 (gcs/repl/appl),
group UUID = 51143ec4-6c99-11e4-ab3f-87c51f793d2d
2014-12-30 03:55:31 20657 [Note] WSREP: Flow-control interval: [23, 23]
2014-12-30 03:55:31 20657 [Note] WSREP: Shifting OPEN -> PRIMARY (TO: 161486336)
2014-12-30 03:55:31 20657 [Note] WSREP: State transfer required:
Group state: 51143ec4-6c99-11e4-ab3f-87c51f793d2d:161486336
Local state: 00000000-0000-0000-0000-000000000000:-1
2014-12-30 03:55:31 20657 [Note] WSREP: New cluster view: global state: 51143ec4-6c99-11e4-ab3f-87c51f793d2d:161486336, view# 30: Primary, number of nodes: 2, my index: 0, protocol version 3
2014-12-30 03:55:31 20657 [Warning] WSREP: Gap in state sequence. Need state transfer.
2014-12-30 03:55:31 20657 [Note] WSREP: Running: 'wsrep_sst_xtrabackup-v2 --role 'joiner' --address '192.168.11.103' --auth 'sstuser:ssts3cret' --datadir '/bd/mysql/' --defaults-file '/etc/mysql/my.cnf' --parent '20657' '' '
WSREP_SST: [INFO] Streaming with xbstream (20141230 03:55:31.456)
WSREP_SST: [INFO] Using socat as streamer (20141230 03:55:31.459)
WSREP_SST: [INFO] Stale sst_in_progress file: /bd/mysql//sst_in_progress (20141230 03:55:31.929)
WSREP_SST: [INFO] Evaluating timeout -k 110 100 socat -u TCP-LISTEN:4444,reuseaddr stdio | xbstream -x; RC=( ${PIPESTATUS[@]} ) (20141230 03:55:31.973)
2014-12-30 03:55:32 20657 [Note] WSREP: Prepared SST request: xtrabackup-v2|192.168.11.103:4444/xtrabackup_sst//1
2014-12-30 03:55:32 20657 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
2014-12-30 03:55:32 20657 [Note] WSREP: REPL Protocols: 5 (3, 1)
2014-12-30 03:55:32 20657 [Note] WSREP: Service thread queue flushed.
2014-12-30 03:55:32 20657 [Note] WSREP: Assign initial position for certification: 161486336, protocol version: 3
2014-12-30 03:55:32 20657 [Note] WSREP: Service thread queue flushed.
2014-12-30 03:55:32 20657 [Warning] WSREP: Failed to prepare for incremental state transfer: Local state UUID (00000000-0000-0000-0000-000000000000) does not match group state UUID (51143ec4-6c99-11e4-ab3f-87c51f793d2d): 1 (Operation not permitted)
at galera/src/replicator_str.cpp:prepare_for_IST():456. IST will be unavailable.
2014-12-30 03:55:32 20657 [Note] WSREP: Member 0.0 (node03) requested state transfer from '*any*'. Selected 1.0 (node01)(SYNCED) as donor.
2014-12-30 03:55:32 20657 [Note] WSREP: Shifting PRIMARY -> JOINER (TO: 161486336)
2014-12-30 03:55:32 20657 [Note] WSREP: Requesting state transfer: success, donor: 1
WSREP_SST: [INFO] Proceeding with SST (20141230 03:55:32.824)
WSREP_SST: [INFO] Cleaning the existing datadir (20141230 03:55:32.828)
removed ‘/bd/mysql/ib_logfile0’
removed ‘/bd/mysql/ib_logfile1’
removed ‘/bd/mysql/backup-my.cnf’
removed ‘/bd/mysql/ibdata1’
WSREP_SST: [INFO] Cleaning the binlog directory /bd as well (20141230 03:55:33.264)
[COLOR=#FF0000]WSREP_SST: [INFO] Evaluating socat -u TCP-LISTEN:4444,reuseaddr stdio | xbstream -x; RC=( ${PIPESTATUS[@]} ) (20141230 03:55:33.273)
2014-12-30 03:55:34 20657 [Note] WSREP: (b2bdfec8, 'tcp://0.0.0.0:4567') turning message relay requesting off
grep: /bd/mysql//xtrabackup_checkpoints: No such file or directory
2014-12-30 03:55:43 20657 [Warning] WSREP: 1.0 (node01): State transfer to 0.0 (node03) failed: -22 (Invalid argument)
2014-12-30 03:55:43 20657 [ERROR] WSREP: gcs/src/gcs_group.cpp:int gcs_group_handle_join_msg(gcs_group_t*, const gcs_recv_msg_t*)():722: Will never receive state. Need to abort.
2014-12-30 03:55:43 20657 [Note] WSREP: gcomm: terminating thread
2014-12-30 03:55:43 20657 [Note] WSREP: gcomm: joining thread
2014-12-30 03:55:43 20657 [Note] WSREP: gcomm: closing backend
WSREP_SST: [INFO] Preparing the backup at /bd/mysql/ (20141230 03:55:43.733)
WSREP_SST: [INFO] Evaluating innobackupex --no-version-check --apply-log $rebuildcmd ${DATA} &>${DATA}/innobackup.prepare.log (20141230 03:55:43.737)
WSREP_SST: [ERROR] Cleanup after exit with status:1 (20141230 03:55:44.093)
[COLOR=#FF0000]2014-12-30 03:55:44 20657 [ERROR] WSREP: Process completed with error: wsrep_sst_xtrabackup-v2 --role 'joiner' --address '192.168.11.103' --auth 'sstuser:ssts3cret' --datadir '/bd/mysql/' --defaults-file '/etc/mysql/my.cnf' --parent '20657' '' : 1 (Operation not permitted)
2014-12-30 03:55:44 20657 [ERROR] WSREP: Failed to read uuid:seqno from joiner script.
2014-12-30 03:55:44 20657 [ERROR] WSREP: SST failed: 1 (Operation not permitted)
2014-12-30 03:55:44 20657 [ERROR] Aborting
2014-12-30 03:55:44 20657 [Note] WSREP: view(view_id(NON_PRIM,b2bdfec8,30) memb {
b2bdfec8,0
} joined {
} left {
} partitioned {
b826dbf4,0
})
2014-12-30 03:55:44 20657 [Note] WSREP: view((empty))
2014-12-30 03:55:44 20657 [Note] WSREP: gcomm: closed
2014-12-30 03:55:44 20657 [Note] WSREP: /usr/sbin/mysqld: Terminated.
Aborted (core dumped)
141230 03:55:45 mysqld_safe mysqld from pid file /var/run/mysqld/mysqld.pid ended
They are running xtradb version 5.6.
Can anyone help me, please.
Thank you.