extremely high cpu cost

romber · August 27, 2017, 9:22pm

hi,when install and running for some days, prometheus cost CPU extremely high,it always uses nearly 100%CPU,
and when show cpu information of database server,it’s very slow…

prometheus log was fulled with:
level=warning msg=“Error on ingesting samples with different value but same timestamp” numDropped=1 source=“scrape.go:537”
level=warning msg=“Scrape duration sample discarded” error=“sample with repeated timestamp but different value” sample=scrape_duration_seconds{instance=“”, job=“mysql”} => 0.546582099 @[1503888448.092] source=“scrape.go:590”

i’m using pmm 1.2.0 now,please kindly help me to solve this problem,
thanks in advance.

romber · August 27, 2017, 10:01pm

i also uploaded screenshot of iotop

Michael_Coburn · August 28, 2017, 7:45am

Hi romber

Your server seems adequately provisioned for hardware (16 cores, 32GB RAM), but a couple observations: [LIST=1]
[]Do you have all servers in sync with time? Check with pmm-admin check-network to verify that your clocks are synchronised
[]Did you do the docker-based installation? You have quite a bit of unused RAM that could be allocated to Prometheus - see [url]Percona Monitoring and Management
[/LIST] If you continue to have issues with high CPU, please send us the output of our pmm-info.sh debugging tool, which you can read about here:

[url]https://www.percona.com/blog/2016/11/01/percona-monitoring-and-management-pmm-information-script/[/url]

The essential steps are:

wget https://raw.githubusercontent.com/Percona-QA/percona-qa/master/pmm-info.sh
vi pmm-info.sh
sh ./pmm-info.sh

romber · August 29, 2017, 6:04am

hi Michael Coburn ,
thanks for you answer.

Do you have all servers in sync with time? Check with pmm-admin check-network to verify that your clocks are synchronised
yes,i’ll upload the output of pmm-admin check-network command.i run the command on one of mysql server.
Did you do the docker-based installation?
yes,i use docker-based installation,and i already set 8GB RAM to pmm
as this server is also used for recovery test,although this happened rarely,i have to left some RAM to recovery mysql database.
is 8GB of RAM not enough for pmm-server? i’m monitoring about 20~30 mysql servers and the number is still growing…
how much RAM does it need ?
i also find out that I/O is high used by prometheus,is it normal?
about pmm-info.sh
i’ll upload the output of pmm-info.sh script.i run it on pmm-server

romber · September 1, 2017, 8:25pm

is anyone still keeping an eye on it?

Michael_Coburn · September 6, 2017, 6:08am

Hi romber , my apologies for the delay.

I’d like to ask that you upgrade to latest PMM Server (1.2.2), because in 1.2.1 we introduced a memory fix that may be related to your high CPU utilisation:
[url]Percona Monitoring and Management [INDENT]PMM-1232: Update the default value of the METRICS_MEMORY configuration setting

Previous versions of PMM Server used a different value for the METRICS_MEMORY configuration setting which allowed Prometheus to use up to 768MB of memory.

PMM Server 1.2.0 used the storage.local.target-heap-size setting, its default value being 256MB. Unintentionally, this value reduced the amount of memory that Prometheus could use. As a result, the performance of Prometheus was affected.

To improve the performance of Prometheus, the default setting of storage.local.target-heap-size has been set to 768 MB.[/INDENT]
Can you let us know if this diminishes the CPU utlisation? We await your feedback!

romber · September 10, 2017, 8:57pm

hi Michael Coburn , thanks for your reply.
i’ve upgraded to 1.2.2 for both of pmm server and client,and increased the METRICS_MEMORY to 16GB, decreased the METRICS_RETENTION to 360 hours.
but high cpu cost on pmm server still exists…
here is what i run:
docker run -d
-p 80:80
–volumes-from pmm-data
–name pmm-server
–restart always
-e SERVER_USER=pmm
-e SERVER_PASSWORD=xxx
-e METRICS_MEMORY=17179869184
-e METRICS_RETENTION=360h
percona/pmm-server:1.2.2

Topic		Replies	Views
prometheus high cpu PMM 1.x	6	2176	April 19, 2018
High CPU load PMM 1.x	1	803	May 7, 2018
PMM scraping every 5 minutes after upgrade to 1.10 PMM 1.x	1	721	July 4, 2018
high CPU usage on pmm server PMM 1.x	2	1008	May 2, 2018
Victoriametrics cpu usage looks much higher than prometheus PMM 2.x	13	3561	July 29, 2022

extremely high cpu cost

Related topics