sudden transaction rate slowdown during sysbench test

split-brain · August 13, 2020, 5:06am

Hi there!

I’m testing PXC 8 with Sysbench and ran into some weird problem, where after about and hour of testing, the QPS rate went down from approx. 10000 qps to approx. 2000 qps. From what I’ve observed so far, the only suspicious value is TCP retransmission rate, which (at the time of slowdown) went from approx. 0 ops for all nodes to approx. 3 ops for node1 and node2, to approx. 13 ops for node3 (the writer node). At 8:47 UTC node1 sent couple of FC messages and at 8:49 UTC the cluster latency went up from around 1.5ms to around 20ms and the qps rate went down from approx. 10000 qps to approx. 2000 qps. From 8:49 UTC there were no additional FC messages being sent and the cluster just almost stalled.
This actually happened several times, but now I can document it. Can you please point me to the cause of this problem?

There are 3 PXC nodes in one AWS region, each residing in separate AZ. The fourth node is running ProxySQL and I’m running Sysbench on that node locally. ProxySQL has no query rules defined (no R/W split), only simple mysql_galera_hostgroups set with max_writers=1 (node3 is the writer here, node1 and node2 is only applying wsrep). PMM is running on fifth node. All of them are t3.medium type with 4GB of RAM.

sysbench /usr/share/sysbench/oltp_read_write.lua --db-driver=mysql --mysql-host=127.0.0.1 --mysql-user='sbuser' --mysql-password='sbpass' --mysql-port=6033 --mysql-db=sbtest --tables=1 --table_size=1000000 --db-ps-mode=disable --threads=16 --report-interval=1 --time=3600 --skip-trx=off --mysql-ignore-errors=all run

sysbench_report_from_the_time_of_slowndown.txt (13.1 KB)

node1-myq_status_from_the_time_of_slowndown…txt (48.7 KB)

split-brain · August 17, 2020, 6:08am

I’m testing it again and now I have only 2 nodes in the cluster. After some time there is surge at receive/send queue and cluster hangs due to FC. mysqld.log shows some gcache pages being created/deleted. shall I increase the gcache size?

2020-08-17T11:04:04.333437Z 0 [Note] [MY-000000] [Galera] Created page /var/lib/mysql/gcache.page.000036 of size 134217728 bytes
2020-08-17T11:04:04.534960Z 0 [Note] [MY-000000] [Galera] Deleted page /var/lib/mysql/gcache.page.000035
2020-08-17T11:06:25.791539Z 0 [Note] [MY-000000] [Galera] Created page /var/lib/mysql/gcache.page.000037 of size 134217728 bytes
2020-08-17T11:06:26.007878Z 0 [Note] [MY-000000] [Galera] Deleted page /var/lib/mysql/gcache.page.000036
2020-08-17T11:08:47.387363Z 0 [Note] [MY-000000] [Galera] Created page /var/lib/mysql/gcache.page.000038 of size 134217728 bytes
2020-08-17T11:08:47.544381Z 0 [Note] [MY-000000] [Galera] Deleted page /var/lib/mysql/gcache.page.000037
2020-08-17T11:11:09.973613Z 0 [Note] [MY-000000] [Galera] Created page /var/lib/mysql/gcache.page.000039 of size 134217728 bytes
2020-08-17T11:11:10.269839Z 0 [Note] [MY-000000] [Galera] Deleted page /var/lib/mysql/gcache.page.000038
2020-08-17T11:13:31.655732Z 0 [Note] [MY-000000] [Galera] Created page /var/lib/mysql/gcache.page.000040 of size 134217728 bytes
2020-08-17T11:13:31.753867Z 0 [Note] [MY-000000] [Galera] Deleted page /var/lib/mysql/gcache.page.000039
2020-08-17T11:15:53.698146Z 0 [Note] [MY-000000] [Galera] Created page /var/lib/mysql/gcache.page.000041 of size 134217728 bytes
2020-08-17T11:15:53.805761Z 0 [Note] [MY-000000] [Galera] Deleted page /var/lib/mysql/gcache.page.000040
2020-08-17T11:18:16.027883Z 0 [Note] [MY-000000] [Galera] Created page /var/lib/mysql/gcache.page.000042 of size 134217728 bytes
2020-08-17T11:18:16.380256Z 0 [Note] [MY-000000] [Galera] Deleted page /var/lib/mysql/gcache.page.000041

Topic		Replies	Views
Poor sysbench performance - 3 node pxc versus single node community version Percona XtraDB Cluster 5.x	1	811	April 24, 2017
PXC - sysbench resutls to compare Percona XtraDB Cluster 5.x	3	1086	April 24, 2017
Strange performance numbers Percona XtraDB Cluster 5.x	1	511	June 22, 2013
too many FC msg sent from node1 Percona XtraDB Cluster 8.x	2	577	August 12, 2020
Slow Performance - PXC not utilizing CPU Percona XtraDB Cluster 8.x	6	818	August 11, 2023

sudden transaction rate slowdown during sysbench test

Related topics