OK, so we finally got our cluster replicated.
However, we are now suffering from constant application hangs due to the connections staying in wsrep: in pre-commit stage for up to 2 minutes.
Our hardware is all brand new with no load; 24GB RAM, Quad Core Xeon, SSD (ZFS mirror), 10Gbit Network. There is extremely low latency between the 3 nodes in our cluster, we performed 330GB SST in 20 minutes.
I’ve tried tweaking flow control settings, and file limits are all as recommended (or higher).
Running out of options to try.
If I tear down 2 nodes leaving a standalone cluster node then performance is fine. As soon as I add a 2nd or 3rd node, the above state is realised.
Any help would be much appreciated.
PXC version 5.7.14-8-57
Galera 3.17(r447d194)
We’re currently only writing to a single master. We were originally writing to multiple masters but were experiencing transaction locks and hangs, so we reverted to writing to a single master to try to debug the issue.