Master-Master set-up with Replication slaves and binlog woes

Imran-UK · September 24, 2009, 7:54am

I want to fix a replication issue with a 2-node cluster (one active, one passive) that is using Heartbeat and a single virtual IP for failover. The nodes are in Master-Master configuration (that is, each is the slave and master of the other).

I have several other hosts that are replication slaves from the active node. They connect to MySQL via TCP over an SSH tunnels.

When failover occurs, the passive node becomes the active node.
However the replication slaves stop replicating. The error from a log on one of the slaves is:

Jul 15 07:43:32 mysqld[1339]: 090715 7:43:32 [Note] Slave I/O
thread: conn
ected to master ‘@127.0.0.1:3307’, replication started in log
‘mysql-bin.00
0978’ at position 23923243
Jul 15 07:43:32 mysqld[1339]: 090715 7:43:32 [ERROR] Error
reading packet
from server: Could not find first log file name in binary log index file ( serve
r_errno=1236)
Jul 15 07:43:32 mysqld[1339]: 090715 7:43:32 [ERROR] Got fatal
error 1236:
‘Could not find first log file name in binary log index file’ from master when
reading data from binary log
Jul 15 07:43:32 mysqld[1339]: 090715 7:43:32 [Note] Slave I/O thread
exiting, read up to log ‘mysql-bin.000978’, position 23923243

I do not think this is an SSH tunnel issue. I believe this is because of inconsistent binary log file names and positions between the two nodes. Probably because one of the nodes had been in operation a lot longer than the other.

At the moment I have to get replication going by dumping the master databases again, re-import to the slave hosts and bootstrap the slaves.

What is the best way to make this consistent and ensure that replication continues smoothly after a failover (and failback) event?

I have asked this question on the mysql-general list and was pointed to the mmm-mysql project. While that certainly looks of great interest I want to explore the possibility of a less-complex solution.

Thanks,
Im

Imran-UK · October 23, 2009, 1:56pm

Wow, 5800+ views and no reply )

Someone at linbit.com proposed DRBD as a solution so I’m now going to evaluate that.

Any comments from production users of DRBD with Master-Master are most welcome.

Thanks

dizhenxiong · October 24, 2009, 1:28am

)

Topic		Replies	Views
replication problem after master crash Other MySQL® Questions	2	763	May 3, 2007
percona replication error after shutting down the slave Other MySQL® Questions	2	700	July 14, 2015
our slave fell behind due to restart and now getting error Other MySQL® Questions	2	693	October 18, 2010
Last_IO_Error: Got fatal error 1236 Other MySQL® Questions	10	27080	March 17, 2023
Last_IO_Error: Got fatal error 1236 from source when reading data from binary log: 'Could not find first log file name in binary log index file' MySQL & MariaDB mysql , percona	1	406	July 25, 2024

Master-Master set-up with Replication slaves and binlog woes

Related topics