Hi!
After starting “pmm-admin start --all” everything works fine, for a few time. After 2-3 hours approximately, Query Analytics stop to show data. We have 2 servers for now, and they both have the same problem.
In /var/log/pmm-mysql-queries-0.log we can see:
[mysql] 2017/01/30 20:29:09 packets.go:59: unexpected EOF
[mysql] 2017/01/30 20:29:09 packets.go:386: busy buffer
2017/01/30 20:29:09.538585 ERROR qan-analyzer-1342087d driver: bad connection
2017/01/30 20:30:00.001315 WARNING qan-analyzer-1342087d-worker Interval out of sequence: got 186, expected 182
2017/01/30 20:30:08.733680 ERROR qan-analyzer-1342087d-worker Got class twice: registry_dev2 e1fecd37897b7375d76acae411ed0f5b
2017/01/30 20:31:00.011321 WARNING qan-analyzer-1342087d Skipping interval ‘187 2017-01-30 18:30:00 UTC to 2017-01-30 18:31:00 UTC (0-0)’ because interval ‘186 2017-01-30 18:29:00 UTC to 2017-01-30 18:30:00 UTC (0-0)’ is still being parsed
2017/01/30 20:32:00.005720 WARNING qan-analyzer-1342087d Skipping interval ‘188 2017-01-30 18:31:00 UTC to 2017-01-30 18:32:00 UTC (0-0)’ because interval ‘186 2017-01-30 18:29:00 UTC to 2017-01-30 18:30:00 UTC (0-0)’ is still being parsed
2017/01/30 20:33:00.001729 WARNING qan-analyzer-1342087d Skipping interval ‘189 2017-01-30 18:32:00 UTC to 2017-01-30 18:33:00 UTC (0-0)’ because interval ‘186 2017-01-30 18:29:00 UTC to 2017-01-30 18:30:00 UTC (0-0)’ is still being parsed
[mysql] 2017/01/30 20:33:28 packets.go:59: unexpected EOF
[mysql] 2017/01/30 20:33:28 packets.go:386: busy buffer
[mysql] 2017/01/30 20:33:28 connection.go:307: invalid connection
2017/01/30 20:33:28.778027 ERROR qan-analyzer-1342087d driver: bad connection
2017/01/30 20:34:00.003532 WARNING qan-analyzer-1342087d-worker Interval out of sequence: got 190, expected 187
2017/01/30 20:34:04.971836 ERROR qan-analyzer-1342087d-worker Got class twice: registry_dev2 e1fecd37897b7375d76acae411ed0f5b
After “pmm-admin restart --all” there is no errors for some time, Query Analytics works fine. But not for long. Same errors.
Server and clients was reinstalled by documentation 3 times. No effect.
How can i fix this problem?
So, with “–query-source slowlog” everything fine. No problems since yesterday morning, it is about 20 hours.
Another problem - slowquery.log grow very fast, 890M from time of switching from performance_schema. Can i rotate it without loss of collected data in PMM?
after some research
please don’t use slowlog_rotation in PS, it is BETA and looks like it has some strange handling inside PMM.
please write immediately if you have any issues with log rotate.