I have Percona Xtradb Cluster in Azure VM with Three node (2 were MySQL Node NodeA and NodeB, 1 Garb Arbitrator NodeC)
Node A and C in Vnet and connected to NodeB with single VNET Peering
I have joined these cluster with Vnet Peering and private IP address. Issue is MySQL Node is getting connection lost and mysql stopped following were the logs found. Is there is any fix for this.
In Stopped Node (NodeA)
WSREP: Failed to report last committed 20712650, -110 (Connection timed out)
WSREP: last inactive check more than PT1.5S (3*evs.inactive_check_period) ago (PT2.35911S), skipping check
Log of wsrep recovery (–wsrep-recover)
In Running Node (NodeB) - Bootstrap
2020-02-25T10:50:58.405580Z 0 [Note] WSREP: (9288914c, ‘tcp://0.0.0.0:4567’) connection to peer 807f459c with addr tcp://NodeB:4567 timed out, no messages seen in PT3S (gmcast.peer_timeout)
2020-02-25T10:50:58.406359Z 0 [Note] WSREP: (9288914c, ‘tcp://0.0.0.0:4567’) turning message relay requesting on, nonlive peers: tcp://NodeB:4567
2020-02-25T10:50:59.630488Z 0 [Note] WSREP: (9288914c, ‘tcp://0.0.0.0:4567’) reconnecting to 807f459c (tcp://NodeB:4567), attempt 0
2020-02-25T10:51:00.906739Z 0 [Note] WSREP: declaring node with index 0 suspected, timeout PT5S (evs.suspect_timeout)
2020-02-25T10:51:00.906800Z 0 [Note] WSREP: evs:: proto(9288914c, GATHER, view_id(REG,807f459c,3)) suspecting node: 807f459c
2020-02-25T10:51:00.906815Z 0 [Note] WSREP: evs:: proto(9288914c, GATHER, view_id(REG,807f459c,3)) suspected node without join message, declaring inactive
2020-02-25T10:51:01.406965Z 0 [Note] WSREP: declaring node with index 0 inactive (evs.inactive_timeout)
2020-02-25T10:51:01.527169Z 0 [Note] WSREP: declaring a4893046 at tcp://NodeC:4444 stable
2020-02-25T10:51:01.647292Z 0 [Note] WSREP: Node 9288914c state primary
2020-02-25T10:51:01.767513Z 0 [Note] WSREP: Current view of cluster as seen by this node
In Running Arbitrator Node (Node C)
2020-02-25 10:50:58.355 INFO: (a4893046, ‘tcp://0.0.0.0:4444’) connection to peer 807f459c with addr tcp://NodeB:4567 timed out, no messages seen in PT3S (gmcast.peer_timeout)
2020-02-25 10:50:58.355 INFO: (a4893046, ‘tcp://0.0.0.0:4444’) turning message relay requesting on, nonlive peers: tcp://NodeB:4567
2020-02-25 10:50:59.356 INFO: (a4893046, ‘tcp://0.0.0.0:4444’) reconnecting to 807f459c (tcp://NodeB:4567), attempt 0
2020-02-25 10:51:00.356 INFO: declaring node with index 0 suspected, timeout PT5S (evs.suspect_timeout)
2020-02-25 10:51:00.356 INFO: evs:: proto(a4893046, OPERATIONAL, view_id(REG,807f459c,3)) suspecting node: 807f459c
2020-02-25 10:51:00.356 INFO: evs:: proto(a4893046, OPERATIONAL, view_id(REG,807f459c,3)) suspected node without join message, declaring inactive
2020-02-25 10:51:00.856 INFO: declaring node with index 0 inactive (evs.inactive_timeout)