Replication generates huge disk write IO

Serge_Shakhov · June 11, 2013, 3:53pm

Noticed that MySQL replication is generating enormous disk write activity

iotop 10-20Mb/s

AWS volume monitor - 700-800 IOPS (while only 150 IOPS on master)

And it lasts for hours. Historical graphs show that this repeats every day for 8-10 hours. (I think until the moment when catch up is finished)

Replication processes in the state: “Reading event from the relay log” and “Waiting for master to send event”
Replication is lagging behind master, it is running but VERY slow.

There are no active processes in MySQL that can generate this activity. No backups at this time. Replication activity is constant but not so high to generate this problem.

If I stop slave SQL thread - activity drops to zero immediately. After starting slave it increases again. Restarting mysqld doesn’t help.

What parameters/configs/metrics sould I check?

Any help would be appreciated.

Percona server 5.5 on CentOS
Replicated database about 500Gb, about 300 tables.

niljoshi · June 12, 2013, 2:58am

Hi, I would like to suggest you to use pt-stalk (Percona utility for collecting data about MySQL when problem occur including trace and tcpdump ). You can get more information here that how to use it.

[URL]http://www.percona.com/doc/percona-toolkit/2.2/pt-stalk.html[/URL]
[URL]http://www.mysqlperformanceblog.com/2013/01/03/percona-toolkit-by-example-pt-stalk/[/URL]

Generally you have to specify some trigger condition so it can start to collect data. i.e with some function, variable values, threshold etc. If you want to collect information right now and without waiting for any trigger occur then you can simply run pt-stalk --no-stalk and check the results files to figure out where is the problem.

Serge_Shakhov · June 13, 2013, 7:15pm

Thank you.
What impact on the system will it have? Because any additional workload can crash the system.

niljoshi · June 19, 2013, 3:20am

Hi,

It will not impact that much on system but I would suggest to read documentation properly with all options and test is on stage server before running on prod.

467675761 · January 10, 2016, 8:17am

I met the same situation.and more ,where the slave process start,the slave will eat up all the memory,all the swap space , and finally ,the mysqld process run outofmemory,and the kernel killed the mysqld worker process ,then ,mysql restart.

carakod · January 11, 2016, 3:11pm

Could it be the case [url]https://www.percona.com/blog/2014/01/21/beware-mysql-5-6-server-uuid-cloning-slaves/[/url] ?

467675761 · February 3, 2016, 10:04am

no ,it’s not the same case

mirfan · February 9, 2016, 4:15am

If you are using row-based replication and tables missing primary key/unique key then you probably hitting this bug [url]MySQL Bugs: #53375: RBR + no PK => High load on slave (table scan/cpu) => slave failure
Try to find out If any of database tables have missing PK/UK [url]http://datacharmer.blogspot.com/2011/09/finding-tables-without-primary-keys.html[/url] And add an PK auto-inc If required.

Topic		Replies	Views
mysql replication slave stuck using 100% cpu and no io for several days Other MySQL® Questions	2	8209	December 3, 2012
slow slave IO_THREAD Percona Server for MySQL 5.7	1	905	June 25, 2018
Laggy master-master gtid replication in 5.7.32 Percona Server for MySQL 5.7	2	878	November 3, 2021
Write performance is horrible on 2nd node Percona XtraDB Cluster 5.x	2	966	September 9, 2013
Strange replication's stuck Percona Server for MySQL 5.7 mysql , percona	10	1376	July 6, 2023

Replication generates huge disk write IO

Related topics