Restart of mongod pods - WiredTiger Cache, Percona Memory Engine

jamoser · July 14, 2021, 8:43pm

I was wondering what happens if a mongod pod gets restarted (for ex. simple liveness probe failure or moved to another node … both viable reasons) and 100% of the memory is used for WT Cache and/or Percona Memory Engine.

Is it ensured that no data is lost ?

Further when the pod starts up then it takes very long until these checkpoints are reached. We had to set like

livenessProbe:
  initialDelaySeconds: 300
  failureThreshold: 10

until mongod was able to start. Now the disk is like 5% full - I just wonder what happens if the disk is full - most likely it will take like 1h.

Is there a possibility to make the (re)start much faster ?

jamoser · July 14, 2021, 11:24pm

It looks like mongod always gets shutdown the “hard” way …

{“t”:{“$date”:“2021-07-14T23:15:06.750+00:00”},“s”:“W”, “c”:“STORAGE”, “id”:22271, “ctx”:“initandlisten”,“msg”:“Detected unclean shutdown - Lock file is not empty”,“attr”:{“lockFile”:“/data/db/mongod.lock”}}

jamoser · July 21, 2021, 1:05pm

https://jira.mongodb.org/browse/SERVER-43664

Topic		Replies	Views
Slow startup of mongod Percona Operator for MongoDB	7	2811	July 21, 2021
Percona Cluster crashed and does not want to startup Percona Operator for MongoDB	4	805	July 14, 2021
Pods occasionally fail readiness check, can't find out why, but cluster otherwise works? Percona Operator for MongoDB percona , mongodb	4	174	March 14, 2025
How does Percona mongodb operator handles corrupted disk pods? Percona Operator for MongoDB	5	976	April 22, 2022
Mongodb node loops on restart with an OOM Percona Operator for MongoDB	2	297	July 8, 2024

Restart of mongod pods - WiredTiger Cache, Percona Memory Engine

Related topics