.dropCollection() potential bug

rkamath · June 25, 2024, 9:31pm

There seems to be a bug where doing .dropCollection() and then immediately recreating the collection does not seem to release space back to OS. This was encountered where a collection was dropped and immediately remade with the same indexes in order to free space. But this then required a server restart in order for the space to be released back to the OS.

The MongoDB server was in a replica set.

Is this a known bug?

Ivan_Groenewold · June 26, 2024, 12:03pm

It might be a bug. Please open a report at jira.percona.com with details so the dev team can take a look

radoslaw.szulgo · July 16, 2024, 11:38am

hi @rkamath have you been able to create a bug for this?

rkamath · July 16, 2024, 2:18pm

https://perconadev.atlassian.net/browse/PSMDB-1480
I created it here although I’m not sure if this was the right place or not.

radoslaw.szulgo · July 16, 2024, 2:52pm

Perfect! It’s in the right project - thank you - I was not able to find it. I’ll provide you an update as soon as I can.

radoslaw.szulgo · July 23, 2024, 9:14am

@rkamath - this is an expected behavior in MongoDB. MongoDB seldom returns disk space to the operating system during deleting, these disk pages are marked dirty and reserved for writing in the future. The compact command lets WiredTiger return this space. (like PostgreSQL’s Vacuum). Note, that the compact command needs to be executed on every node in the cluster. Alternatively, you can resync a replicate set member (what you actually did by restarting the node).

See also MongoDB docs:

How do I reclaim disk space in WiredTiger?

The WiredTiger storage engine maintains lists of empty records in data files as it deletes documents. This space can be reused by WiredTiger, but will not be returned to the operating system unless under very specific circumstances.

The amount of empty space available for reuse by WiredTiger is reflected in the output of db.collection.stats() under the heading wiredTiger.block-manager.file bytes available for reuse.

To allow the WiredTiger storage engine to release this empty space to the operating system, you can de-fragment your data file. This can be achieved by resyncing a replica set member or using the compact command.

Igor_Solodovnikov · July 23, 2024, 1:58pm

Hello @rkamath

If I understand correctly the issue is about filesystem space not freeing immediately after drop.

This is probably because MongoDB has two phase drop implementation. At first phase collection and its indexes are removed from metadata (but files are still preserved). At second phase files are removed from the filesystem. In the most common case second phase is deferred until the next checkpoint. By default MongoDB does checkpoints every 60 seconds.

That means it does not matter if you recreate collection with the same name or not. In any case old collection’s files will be removed after up to 60 seconds delay.

Please let us know if this corresponds to your observations.

rkamath · July 23, 2024, 2:23pm

I wasn’t able to test the exact delay but I’ll just try having a longer delay next time to see if that works. Thanks.

Topic		Replies	Views
drop collection or database does not free up disk space Percona Server for MongoDB	2	5752	August 12, 2016
Facing issues while starting the percona mongodb service Percona Server for MongoDB	3	1451	July 12, 2021
Percona Server for MongoDB 3.2.19-3.10 Is Now Available Percona Server for MongoDB	0	425	March 19, 2018
MongoDB server freeze - large amount of collections Percona Server for MongoDB	0	665	June 15, 2016
Mixing WiredTiger & Percona Memory Engine Percona Server for MongoDB	2	703	February 28, 2017

.dropCollection() potential bug

How do I reclaim disk space in WiredTiger?

Related topics