Lagging replication: Slave_SQL_Running_State: Waiting for dependent transaction to commit

mag1kan1n · July 11, 2023, 10:36am

Good afternoon.

We have a cluster MySQL 8.0.32-24 for Linux on x86_64 (Percona Server, Release 24, Revision e5c6e9d2) Master/Slave. Database size is 916GB. Relation is lagging, in show slave Status\G shows Slave_SQL_Running_State: Waiting for dependent transaction to commit.

The lag floating for the last 24 hours rose to 16k and dropped to Seconds_Behind_Master: 8021.

We tried to change the parameter replica_parallel_workers=1, but did not help.

Could you help with any direction or hints for improvements.

Here is the config

[mysqld]

datadir=/var/lib/mysql

socket=/var/lib/mysql/mysql.sock

bind-address = 0.0.0.0

user=mysql

symbolic-links=0

log_timestamps = SYSTEM

log_error = /var/log/mysql/error.log

log_bin_trust_function_creators = 1

sql_mode = "ERROR_FOR_DIVISION_BY_ZERO,NO_ZERO_IN_DATE,NO_ENGINE_SUBSTITUTION"

log_bin = binlog

log_bin_index = binlog_list

max_binlog_size = 1G

expire_logs_days = 5

binlog_row_image = MINIMAL

binlog-do-db = dbname

server-id = 2

relay-log = /var/lib/mysql/slave-relay-bin

relay-log-index = /var/lib/mysql/slave-relay-bin.index

max_connections = 100

innodb_doublewrite = 0

sync_binlog = 1

wait_timeout=150

interactive_timeout=7200

back_log = 4096

slave-skip-errors = 1062,1396

key_buffer_size = 512M

max_allowed_packet = 1024M

table_open_cache = 20000

table_definition_cache = 10000

sort_buffer_size = 2M

read_buffer_size = 1M

read_rnd_buffer_size = 1M

net_buffer_length = 32K

group_concat_max_len = 4K

join_buffer_size = 320M

max_join_size = 1024M

tmp_table_size = 4G

max_heap_table_size = 8G

thread_cache_size = 80

thread_stack = 4M

innodb_buffer_pool_size = 72G

innodb_buffer_pool_instances = 16

innodb_log_file_size = 1G

innodb_log_buffer_size = 64M

innodb_lock_wait_timeout = 60

innodb_file_per_table = 1

innodb_page_cleaners = 8

innodb_read_io_threads = 32

innodb_write_io_threads = 8

innodb_flush_method = O_DSYNC

innodb_flush_log_at_trx_commit = 2

innodb_io_capacity = 100

skip-external-locking

#innodb_adaptive_hash_index=NO

#replica_parallel_workers=4

slave_sql_verify_checksum=OFF

Denis_Subbota · July 11, 2023, 11:34am

Hello mag1kan1n,
It’s expected to see in process list Waiting for dependent transaction to commit when using multithreaded replication, as not all transactions may be applied in parallel ( it’s consistency cost )

The first thing I would like you to suggest is to relax ACID settings to reduce load on IO system and speedup the replication process:

SET GLOBAL innodb_flush_log_at_trx_commit=2,sync_binlog=0;

But it would help if you considered the risk of the losing all transactions for the last 1 minute in case of crash of mysql, so it’s not good option to set on writer node in case if you are using this node as replica and the writer as well. However if it’s pure replica, it’s one of the best way to increase replication speed.

Regards,
Denis Subbota.
Managed Services, Percona.

Denis_Subbota · July 11, 2023, 11:36am

also you may want to review

innodb_log_file_size

mag1kan1n · July 11, 2023, 11:43am

Thanks, let’s try it. We also get in Slave_SQL_Running_State: Waiting for replica workers to process their queues

CTutte · July 11, 2023, 12:13pm

Hi mag1kan1n,

There are multiple causes that can make a replica fall behind. You should check if there is CPU starvation, memory swapping , disk saturation else replication lagging behind might be due to contention. Make sure no subsystem is saturated because I see some unusually large buffer configuration and it’s possible there is swapping or high memory pressure (I am specially looking at you join_buffer_size = 320M ).

Since you mentioned parallel replication, if you want to tune replica_parallel_workers please check Estimating potential for MySQL 5.7 parallel replication .

Note that even if you set multiple parallel workers, at times only 1 (or a few) transactions can be concurrently executing and might make a replica fall behind. For example DDLs and very large transactions will make the replica be single threaded for a while. You can find out parallel replication efficiency with the above blogpost

If replication is mostly single threaded then you should tune your workload rather than tune MySQL configuration.
For running DDLs on the primary you should use pt-online-schema-change pt-online-schema-change — Percona Toolkit Documentation
if you have very large transactions then you should split them into smaller chunks. I.e rather than deleting 1M rows in 1 transaction, commit every 10k rows

You can also try setting innodb_flush_method = O_DIRECT rather than O_DSYNC

Last, remember that every parameter that you configure in the database will be a tradeoff between using more resources or relaxing consistency at the cost of performance. Relaxing consistency (disabling innodb_doublewrite like you did) can cause corruption in the event of a crash so you should not change parameters lightly as they can cause more harm than good

Regards

mag1kan1n · July 11, 2023, 2:26pm

Thanks, made innodb_flush_method = O_DIRECT, will monitor the changes. So far this is the situation for an hour -

matthewb · July 12, 2023, 12:54am

You are throttling yourself:

innodb_io_capacity = 100   <-- change to default 200

tmp_table_size = 4G        <-- The actual value used will be the smaller
max_heap_table_size = 8G   <-- of these two. You should make them the same.

sync_binlog = 1  <-- causes LOTS of disk IO. You should set to 1000 so it syncs less often

mag1kan1n · July 12, 2023, 7:10am

Hi all, we were helped by including innodb_flush_method = O_DIRECT instead of O_DSYNC. Now there is no lag on the replica. But as far as I understand from the description, this is a tradeoff between fault tolerance and speed? Here’s a 24-hour graph -

mag1kan1n · July 12, 2023, 8:08am

Thanks for the advice, we’ll give it a try

matthewb · July 12, 2023, 2:00pm

The different between O_DSYNC and O_DIRECT is that one uses filesystem buffers, and the other does not. Technically, yes, this can be less fault tolerance because you are skipping the filesystem’s journal, but InnoDB has its own protection against this which is the doublewrite buffer. Ideally, you would turn the dblwb on and use O_DIRECT for maximum performance with data reliability.

mag1kan1n · July 12, 2023, 7:47pm

Thank you so much for the advice. Now with parameters

slave_sql_verify_checksum = ON
innodb_flush_method = O_DIRECT

There is no replication lag.

Topic		Replies	Views
A Replication lag on Percona Server 8.0.29-21 Percona Distribution for MySQL percona , new-release	4	1330	January 10, 2023
Percona Master-Master lag issues Other MySQL® Questions	3	995	March 23, 2015
MySQL replication slave parallel workers only work 1 worker MySQL & MariaDB	19	3820	June 22, 2024
Replication lag after upgrade to Percona Server 8.0	9	2823	January 25, 2023
Laggy master-master gtid replication in 5.7.32 Percona Server for MySQL 5.7	2	891	November 3, 2021

Lagging replication: Slave_SQL_Running_State: Waiting for dependent transaction to commit

Related topics