Hey all,
Hoping someone can help me as I have exhausted all ideas I can thing of. I have percona for mongodb running in k8s. Taking nightly backups. We want to create a reporting DB and restore our backups into that DB. I have the new reporting DB up using all the same passwords, aws creds etc. and a logical restore kubectl apply -f deploy/backup/restore-logical.yaml seems to work with backup-agent 2.5.0.
However after a short while, the mongos instances start failing because of a tripwire error. Upon research this looked to be a known issue and resolved in 2.9.x or 2.10.0. However when I try running the restore after upgrading to 2.10.0 I keep getting the following errors;
Error: waiting for start: cluster failed: waiting for start: cluster failed: failed to ensure snapshot file 2025-02-11T00:00:21Z/amfam/metadata.json: get S3 object header: Forbidden: Forbidden
status code: 403, request id: 5WHY4MSZWQCM4PAH, host id: GVS2sxMgU2VsabSwVdydbzYyhQbWPMbzvthYBSdcnvk245G3nkrC7KZDOLItQ7AWz9+u5AwXJxI=
I have tried updating both the user policy and the bucket policy to acomodate. they both already had getobject and getobjectacl etc but even moved to s3:* just to see if it resolved the issue but still no luck. I have the KMS key id, the aws user info etc in secrets and they match the cluster I am trying to restore from. In fact i can just downgrade the pbm-agent to 2.5.0 and everything works fine from the restore perspective but then I have the tripwire error issue.
Example restore yaml;
apiVersion: psmdb.percona.com/v1
kind: PerconaServerMongoDBRestore
metadata:
name: logical-restore-from-main-1
namespace: psmdb-dev-reports
spec:
clusterName: psmdb-dev-reports-psm
storageName: s3-us-east-logical
backupSource:
destination: s3://<my_bucket>/logical/2025-02-11T00:00:21Z
pbm status
Cluster:
========
amfam:
- psmdb-dev-reports-psm-amfam-0.psmdb-dev-reports-psm-amfam.psmdb-dev-reports.svc.cluster.local:27017 [P]: pbm-agent [v2.9.1] OK
- psmdb-dev-reports-psm-amfam-1.psmdb-dev-reports-psm-amfam.psmdb-dev-reports.svc.cluster.local:27017 [S]: pbm-agent [v2.9.1] OK
- psmdb-dev-reports-psm-amfam-2.psmdb-dev-reports-psm-amfam.psmdb-dev-reports.svc.cluster.local:27017 [S]: pbm-agent [v2.9.1] OK
rs0:
- psmdb-dev-reports-psm-rs0-0.psmdb-dev-reports-psm-rs0.psmdb-dev-reports.svc.cluster.local:27017 [P]: pbm-agent [v2.9.1] OK
- psmdb-dev-reports-psm-rs0-1.psmdb-dev-reports-psm-rs0.psmdb-dev-reports.svc.cluster.local:27017 [S]: pbm-agent [v2.9.1] OK
- psmdb-dev-reports-psm-rs0-2.psmdb-dev-reports-psm-rs0.psmdb-dev-reports.svc.cluster.local:27017 [S]: pbm-agent [v2.9.1] OK
cfg:
- psmdb-dev-reports-psm-cfg-0.psmdb-dev-reports-psm-cfg.psmdb-dev-reports.svc.cluster.local:27017 [P]: pbm-agent [v2.9.1] OK
- psmdb-dev-reports-psm-cfg-1.psmdb-dev-reports-psm-cfg.psmdb-dev-reports.svc.cluster.local:27017 [S]: pbm-agent [v2.9.1] OK
- psmdb-dev-reports-psm-cfg-2.psmdb-dev-reports-psm-cfg.psmdb-dev-reports.svc.cluster.local:27017 [S]: pbm-agent [v2.9.1] OK
PITR incremental backup:
========================
Status [OFF]
Currently running:
==================
(none)
Backups:
========
S3 us-east-2 s3:///<my_bucket>/logical
Snapshots:
2025-02-13T00:00:21Z 9.69MB <logical> [restore_to_time: 2025-02-13T00:01:47Z]
2025-02-11T00:00:21Z 9.65MB <logical> [restore_to_time: 2025-02-11T00:01:37Z]
pbm logs
2025-07-09T13:39:28Z E [amfam/psmdb-dev-reports-psm-amfam-0.psmdb-dev-reports-psm-amfam.psmdb-dev-reports.svc.cluster.local:27017] [restore/2025-07-09T13:39:26.191458116Z] restore: failed to ensure snapshot file 2025-02-11T00:00:21Z/amfam/metadata.json: get S3 object header: Forbidden: Forbidden
status code: 403, request id: 5WHY4MSZWQCM4PAH, host id: GVS2sxMgU2VsabSwVdydbzYyhQbWPMbzvthYBSdcnvk245G3nkrC7KZDOLItQ7AWz9+u5AwXJxI=
2025-07-09T13:39:28Z E [rs0/psmdb-dev-reports-psm-rs0-0.psmdb-dev-reports-psm-rs0.psmdb-dev-reports.svc.cluster.local:27017] [restore/2025-07-09T13:39:26.191458116Z] restore: waiting for start: cluster failed: failed to ensure snapshot file 2025-02-11T00:00:21Z/amfam/metadata.json: get S3 object header: Forbidden: Forbidden
status code: 403, request id: 5WHY4MSZWQCM4PAH, host id: GVS2sxMgU2VsabSwVdydbzYyhQbWPMbzvthYBSdcnvk245G3nkrC7KZDOLItQ7AWz9+u5AwXJxI=
2025-07-09T13:39:28Z E [cfg/psmdb-dev-reports-psm-cfg-0.psmdb-dev-reports-psm-cfg.psmdb-dev-reports.svc.cluster.local:27017] [restore/2025-07-09T13:39:26.191458116Z] restore: waiting for start: cluster failed: waiting for start: cluster failed: failed to ensure snapshot file 2025-02-11T00:00:21Z/amfam/metadata.json: get S3 object header: Forbidden: Forbidden
status code: 403, request id: 5WHY4MSZWQCM4PAH, host id: GVS2sxMgU2VsabSwVdydbzYyhQbWPMbzvthYBSdcnvk245G3nkrC7KZDOLItQ7AWz9+u5AwXJxI=
Any thoughts or help would be greatly appreciated.
Thank You