Heartbeat failed after max retries

jamoser · May 5, 2022, 8:47pm

The MongoDB Cluster has been running not too badly but since a few days I get the above error. Version in use :

crVersion: 1.7.0
image: percona/percona-server-mongodb:4.4.3-5

Since this version has “close most files” after 27h hard coded, we have to restart it every 24h. Each restart of a pod takes about 30min. But right after the restart I see in the log the above message.

Heartbeat failed after max retries : what is this about and how can I control this
Is there any “limit” or why does the Pod restart ?

Topic		Replies	Views
Plans for better handling of pod restarts affect on in progress backup? Percona Backup for MongoDB closed-no-reply , kubernetes	0	459	November 7, 2023
Primary replicaset constantly restarts Percona Operator for MongoDB percona , mongodb	4	1108	July 12, 2021
Percona Mongodb operator restarts with error "fatal error: concurrent map read and map write" Percona Operator for MongoDB percona , bugs , mongodb	2	54	December 23, 2024
Mongodb node loops on restart with an OOM Percona Operator for MongoDB	2	219	July 8, 2024
Kubernetes PSMDB shutdown signal 15 Percona Operator for MongoDB percona , mongodb , kubernetes	11	2315	September 7, 2021

Heartbeat failed after max retries

Related topics