Hi,
I struggling with a number of related backup issues, I wondered if anyone had any advice that might help.
I’m carrying out some testing on a 3 node cluster formed from Hetzner cloud nodes, in a namespace called mongodb. Percona Mongodb has been installed using Helm . Changes to the setup are applied as follows
helm upgrade mongodb-clu1 percona/psmdb-db -n mongodb --reuse-values -f storage.yaml
Where different yaml files adjust the various parts of the S3 backup.
Connectivity to S3 has been successfully tested -
kubectl run -i --rm aws-cli --image=perconalab/awscli --restart=Never – bash -c ‘AWS_ACCESS_KEY_ID=aws_access_key
AWS_SECRET_ACCESS_KEY=aws_secret_access_key
AWS_DEFAULT_REGION=eu-west-2
/usr/bin/aws
s3 ls s3://our.fqdn.com/archive/server20/’
If you don’t see a command prompt, try pressing enter.
2021-07-14 12:09:35 0
2022-04-25 02:01:37 1184854844 backup_server20_22-04-25.tgz
2022-05-30 02:01:33 1192485332 backup_server20_22-05-30.tgz
2022-06-27 02:01:34 1192498841 backup_server20_22-06-27.tgz
2022-07-25 02:01:36 1192498442 backup_server20_22-07-25.tgz
2022-08-29 02:01:32 1192491757 backup_server20_22-08-29.tgz
2022-09-26 02:01:36 1192491772 backup_server20_22-09-26.tgz
but attempts to schedule the backup fail with the following error
2023-06-02T10:58:11.000+0000 E [agentCheckup] check storage connection: storage check failed with: get S3 object header: RequestError: send request failed
caused by: Head “https://s3.eu-west-2.amazonaws.com/our.fqdn.com/archive/server20/.pbm.init”: net/http: invalid header field value for “Authorization”
As the backup hasn’t completed, the log is full of these messages.
One final point, I can’t be sure at present if these messages are from the original cron job, which didn’t reference the namespace, and which I can’t delete.
root@kube-1:~# kubectl get psmdb-backup -A
NAMESPACE NAME CLUSTER STORAGE DESTINATION TYPE STATUS COMPLETED AGE
default backup1 mongodb-clu1 s3-eu-west 31d
mongodb backup2 mongodb-clu1-psmdb-db s3-eu-west 2023-05-26T14:20:53Z logical error 8m8s
attempts to remove backup1 claim to have succeeded, but always hang. Is there a way to remove it ?
oot@kube-1:~# kubectl get psmdb-backup -A
NAMESPACE NAME CLUSTER STORAGE DESTINATION TYPE STATUS COMPLETED AGE
default backup1 mongodb-clu1 s3-eu-west 38d
root@kube-1:~# kubectl -n default delete psmdb-backup/backup1
perconaservermongodbbackup.psmdb.percona.com “backup1” deleted
^Croot@kube-1:~# kubectget psmdb-backup -Ap1
NAMESPACE NAME CLUSTER STORAGE DESTINATION TYPE STATUS COMPLETED AGE
default backup1 mongodb-clu1 s3-eu-west 38d
mongodb cron-mongodb-clu1-psm-20230602113000-4wp57 mongodb-clu1-psmdb-db s3-eu-west 2023-06-02T11:30:21Z logical error 4m53s
root@kube-1:~# kubectl -n mongodb delete psmdb-backup/cron-mongodb-clu1-psm-20230602113000-4wp57
perconaservermongodbbackup.psmdb.percona.com “cron-mongodb-clu1-psm-20230602113000-4wp57” deleted
root@kube-1:~# kubectl get psmdb-backup -A
NAMESPACE NAME CLUSTER STORAGE DESTINATION TYPE STATUS COMPLETED AGE
default backup1 mongodb-clu1 s3-eu-west 38d
There is more config info available if required.
Thanks,
Mike