Hi, we setup 3 nodes cluster to handle max connection of 500 application, when point the traffic to cluster, all the query hang in at wsrep pre-commit stage, most query take more than 3 seconds to finish, we run same query in a single db, it only take 0.063 s in average, so it has something to do this commit process, it only happened when multiple connections happened,
machines:3x ubuntu with 4 core 15G memory in SSD 500G max IO 2300 latest version of cluster, cpu memory usage looks fine
any suggestions?
here are wsrep status
[TABLE]
[TR]
[TD]wsrep_local_state_uuid[/TD]
[TD]7afb8427-cb98-11e5-986d-33dae198bfb5[/TD]
[/TR]
[TR]
[TD]wsrep_protocol_version[/TD]
[TD]4[/TD]
[/TR]
[TR]
[TD]wsrep_last_committed[/TD]
[TD]1854809[/TD]
[/TR]
[TR]
[TD]wsrep_replicated[/TD]
[TD]15954[/TD]
[/TR]
[TR]
[TD]wsrep_replicated_bytes[/TD]
[TD]6972263[/TD]
[/TR]
[TR]
[TD]wsrep_received[/TD]
[TD]14761[/TD]
[/TR]
[TR]
[TD]wsrep_received_bytes[/TD]
[TD]6405341[/TD]
[/TR]
[TR]
[TD]wsrep_local_commits[/TD]
[TD]15413[/TD]
[/TR]
[TR]
[TD]wsrep_local_cert_failures[/TD]
[TD]541[/TD]
[/TR]
[TR]
[TD]wsrep_local_replays[/TD]
[TD]1[/TD]
[/TR]
[TR]
[TD]wsrep_local_send_queue[/TD]
[TD]0[/TD]
[/TR]
[TR]
[TD]wsrep_local_send_queue_avg[/TD]
[TD]0.074181[/TD]
[/TR]
[TR]
[TD]wsrep_local_recv_queue[/TD]
[TD]0[/TD]
[/TR]
[TR]
[TD]wsrep_local_recv_queue_avg[/TD]
[TD]57.142034[/TD]
[/TR]
[TR]
[TD]wsrep_flow_control_paused[/TD]
[TD]0.000000[/TD]
[/TR]
[TR]
[TD]wsrep_flow_control_sent[/TD]
[TD]0[/TD]
[/TR]
[TR]
[TD]wsrep_flow_control_recv[/TD]
[TD]0[/TD]
[/TR]
[TR]
[TD]wsrep_cert_deps_distance[/TD]
[TD]5.171123[/TD]
[/TR]
[TR]
[TD]wsrep_apply_oooe[/TD]
[TD]0.696407[/TD]
[/TR]
[TR]
[TD]wsrep_apply_oool[/TD]
[TD]0.000319[/TD]
[/TR]
[TR]
[TD]wsrep_apply_window[/TD]
[TD]7.473739[/TD]
[/TR]
[TR]
[TD]wsrep_commit_oooe[/TD]
[TD]0.000000[/TD]
[/TR]
[TR]
[TD]wsrep_commit_oool[/TD]
[TD]0.000319[/TD]
[/TR]
[TR]
[TD]wsrep_commit_window[/TD]
[TD]6.108963[/TD]
[/TR]
[TR]
[TD]wsrep_local_state[/TD]
[TD]4[/TD]
[/TR]
[TR]
[TD]wsrep_local_state_comment[/TD]
[TD]Synced[/TD]
[/TR]
[TR]
[TD]wsrep_cert_index_size[/TD]
[TD]907[/TD]
[/TR]
[TR]
[TD]wsrep_causal_reads[/TD]
[TD]0[/TD]
[/TR]
[TR]
[TD]wsrep_incoming_addresses[/TD]
[TD]10.0.2.8:3306,10.0.2.5:3306,10.0.2.4:3306[/TD]
[/TR]
[TR]
[TD]wsrep_cluster_conf_id[/TD]
[TD]3[/TD]
[/TR]
[TR]
[TD]wsrep_cluster_size[/TD]
[TD]3[/TD]
[/TR]
[TR]
[TD]wsrep_cluster_state_uuid[/TD]
[TD]7afb8427-cb98-11e5-986d-33dae198bfb5[/TD]
[/TR]
[TR]
[TD]wsrep_cluster_status[/TD]
[TD]Primary[/TD]
[/TR]
[TR]
[TD]wsrep_connected[/TD]
[TD]ON[/TD]
[/TR]
[TR]
[TD]wsrep_local_bf_aborts[/TD]
[TD]0[/TD]
[/TR]
[TR]
[TD]wsrep_local_index[/TD]
[TD]2[/TD]
[/TR]
[TR]
[TD]wsrep_provider_name[/TD]
[TD]Galera[/TD]
[/TR]
[TR]
[TD]wsrep_provider_vendor[/TD]
[TD]Codership Oy <info@codership.com>[/TD]
[/TR]
[TR]
[TD]wsrep_provider_version[/TD]
[TD]2.12(r318911d)[/TD]
[/TR]
[TR]
[TD]wsrep_ready[/TD]
[TD]ON[/TD]
[/TR]
[TR]
[TD]wsrep_thread_count[/TD]
[TD]9[/TD]
[/TR]
[/TABLE]