wsrep_local_recv_queue high with SELECT on huge tables (wsrep in pre-commit stage)

salvorapi · April 16, 2020, 3:18am

Hi,
I have a Percona MySQL cluster with 3 nodes and this configuration
Server version: 5.6.34-79.1-56 Percona XtraDB Cluster (GPL)
Release rel79.1
Revision 7c38350,
WSREP version 26.19,

I have more than one DB that has small and huge tables. With small tables there aren’t problems.
With huge tables I have a very strange problem when I do full scan SELECT on these tables.
I.E., if I execute on node1 and node2 the "SELECT COUNT(*) FROM HugeTable WHERE time > ‘2020-01-01’ (i
have no index for time field) it take times but at the end return total rows.

If I execute the same query on the node3, the cluster hangs and other queries gone in “wsrep in pre-commit
stage”.

During the execution of the query on node3 I see the wsrep_queue variabiles and this is the result:

mysql&gt; show status like '%queue%';
+----------------------------+------------+
| Variable_name | Value |
+----------------------------+------------+
| wsrep_local_send_queue | 0 |
| wsrep_local_send_queue_max | 63 |
| wsrep_local_send_queue_min | 0 |
| wsrep_local_send_queue_avg | 0.046914 |
| wsrep_local_recv_queue | 48 |
| wsrep_local_recv_queue_max | 18108 |
| wsrep_local_recv_queue_min | 0 |
| wsrep_local_recv_queue_avg | 119.696628 |
+----------------------------+------------+

So my question is: why the local_recv_queue in that node growth on SELECT statement ? I do the same check in the other nodes and this does not happen. What could be the reason and what checks should I do?

These are the variables of second node:

+----------------------------+----------+ 
| Variable_name | Value | 
+----------------------------+----------+ 
| wsrep_local_send_queue | 0 | 
| wsrep_local_send_queue_max | 73 | 
| wsrep_local_send_queue_min | 0 | 
| wsrep_local_send_queue_avg | 0.375133 | 
| wsrep_local_recv_queue | 0 | 
| wsrep_local_recv_queue_max | 1959 | 
| wsrep_local_recv_queue_min | 0 | 
| wsrep_local_recv_queue_avg | 0.264582 | 
+----------------------------+----------+

Thank you, Salvo.

lorraine.pocklington · April 20, 2020, 11:46am

Hello Salvo
I wonder if you are able to execute SHOW STATUS ?
I ask because there seems to be some commonality with this Jira post:
https://jira.percona.com/browse/PXC-1182?jql=text%20~%20%22wsrep%20in%20pre-commit%20stage%20wsrep_local_recv_queue%22
If that doesn’t help at all could you let us see what you have in the my.cnf for each node, and check if you have any error logs being written at all during this event? What environment are you working in? Is it a production system? And did it ever work well for you?

Topic		Replies	Views
wsrep_local_recv_queue high with SELECT on huge tables (wsrep in pre-commit stage Percona XtraDB Cluster 5.x	1	1372	May 13, 2020
cluster query execute hang at wsrep in pre-commit stage Percona XtraDB Cluster 5.x	2	1206	April 19, 2016
PXC - wsrep_local_recv_queue_avg Percona XtraDB Cluster 5.x	3	2466	January 8, 2014
wsrep_max_ws_rows exceeded errors Percona XtraDB Cluster 5.x	3	9461	October 14, 2016
wsrep in pre-commit stage :-( :-( :-( Percona XtraDB Cluster 5.x	1	1032	September 10, 2015

wsrep_local_recv_queue high with SELECT on huge tables (wsrep in pre-commit stage)

Related topics