Hi team, we are exploring PBM for a 100TB database that will be sharded in PSMDB. The backup would go to S3.
Our POC on a 10TB database with different uploadPartSize and maxUploadParts settings on these EC2 instance types took:
m7i.16xlarge = 3 hours
m7i.2xlarge = 14 hours
With the above numbers, backup duration for 100TB would not be ideal. We are looking for options to make the PBM backup faster.
Would disabling PBM compression make backup faster? Or any other compression method faster than default s2?
Are there any other tuning parameters for PBM to make the backups faster?
Also, we have our backup script that splits largest files into multiple parts and copies those parts to S3 parallely.
What will be the consistent way to backup oplog in shards for point-in-time recovery with the above file copy script?
Is Hot Backup faster than PBM? Can we use with PBM’s pitr.oplogOnly option to achieve PITR?
For physical backups, PBM already handles the splitting and parallel copy of each shard. You can tune the upload part size, but we use numCPU/2 for multipart upload (feel free to vote here Jira if you think is worth having)
Also see
Btw we typically suggest having each shard not bigger than 1 or 2 TB as best practice.