I’ve started a sample restore, and it simply hung.
Details:
According to the pbm status
, the restore started on today, at 09:51, local time.
[feng.sian@DSAGIMOGDBSP01 ~]$ pbm status
...
Currently running:
==================
Snapshot restore "2021-02-25T18:43:59Z", started at 2021-02-26T12:51:27Z. Status: snapshot restore. [op id: 6038eecfe36b8a6ceed1e538]
Backups:
========
FS /data/backup/DEV
Snapshots:
2021-02-26T12:50:05Z 35.42KB [complete: 2021-02-26T12:50:25]
2021-02-25T19:28:44Z 22.48KB [complete: 2021-02-25T19:29:04]
2021-02-25T18:43:59Z 18.19KB [complete: 2021-02-25T18:44:20]
2021-02-25T12:34:44Z 14.48KB [complete: 2021-02-25T12:35:05]
2021-02-25T11:04:20Z 11.98KB [complete: 2021-02-25T11:04:40]
2021-02-24T23:23:03Z 8.92KB [complete: 2021-02-24T23:23:23]
PITR chunks:
2021-02-25T19:29:04 - 2021-02-26T01:01:04 320.95KB
2021-02-25T18:44:20 - 2021-02-25T19:29:03 51.45KB
2021-02-25T12:35:05 - 2021-02-25T18:44:19 403.16KB
2021-02-25T11:04:40 - 2021-02-25T12:35:03 47.64KB
It is already 19:33 and the pbm status
still says it is currently running the restore.
Checking out the pbm logs
:
...
2021-02-26T12:50:05Z I [rs-dev-00/DSAGIMOGDBSP01.axabr.ad:27001] got epoch {1614301247 1}
2021-02-26T12:50:22Z I [rs-dev-00/DSAGIMOGDBSP01.axabr.ad:27001] [backup/2021-02-26T12:50:05Z] backup started
2021-02-26T12:50:25Z I [rs-dev-00/DSAGIMOGDBSP01.axabr.ad:27001] [backup/2021-02-26T12:50:05Z] mongodump finished, waiting for the oplog
2021-02-26T12:50:30Z I [rs-dev-00/DSAGIMOGDBSP01.axabr.ad:27001] [backup/2021-02-26T12:50:05Z] backup finished
2021-02-26T12:51:27Z I [rs-dev-00/DSAGIMOGDBSP01.axabr.ad:27001] got command restore [name: 2021-02-26T12:51:27.166184056Z, backup name: 2021-02-25T18:43:59Z] <ts: 1614343887>
2021-02-26T12:51:27Z I [rs-dev-00/DSAGIMOGDBSP01.axabr.ad:27001] got epoch {1614343830 1}
2021-02-26T12:51:27Z I [rs-dev-00/DSAGIMOGDBSP01.axabr.ad:27001] [restore/2021-02-25T18:43:59Z] restore started
[feng.sian@DSAGIMOGDBSP01 ~]$
Nothing came out after the last line.
Checking the journalctl -u pbm-agent
:
...
Feb 26 09:51:27 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:27.000-0300 I [restore/2021-02-25T18:43:59Z] restore started
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.585-0300 connected to node type: replset
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.595-0300 archive prelude admin.pbmLog
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.595-0300 archive prelude admin.pbmOpLog
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.595-0300 archive prelude admin.pbmBackups
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.595-0300 archive prelude config.system.indexBuilds
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.596-0300 archive prelude wmISMonitoring.isServerStats
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.596-0300 archive prelude admin.pbmConfig
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.596-0300 archive prelude admin.pbmCmd
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.596-0300 archive prelude admin.pbmRestores
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.596-0300 archive prelude admin.pbmPITRChunks
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.596-0300 archive prelude admin.pbmLock
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.596-0300 archive prelude admin.pbmAgents
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.596-0300 archive prelude admin.pbmLockOp
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.596-0300 archive prelude admin.system.users
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.596-0300 archive prelude admin.system.roles
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.596-0300 archive prelude admin.system.version
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.596-0300 archive format version "0.1"
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.596-0300 archive server version "4.4.1-3"
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.596-0300 archive tool version "0.0.1"
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.596-0300 setting number of parallel collections to number of parallel collections in archive (2)
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 preparing collections to restore from
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 skipping restoring admin.pbmLog, it is excluded
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 skipping restoring admin.pbmLog metadata, it is excluded
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 found collection admin.pbmOpLog bson to restore to admin.pbmOpLog
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 found collection metadata from admin.pbmOpLog to restore to admin.pbmOpLog
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 skipping restoring admin.pbmBackups, it is excluded
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 skipping restoring admin.pbmBackups metadata, it is excluded
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 skipping restoring admin.pbmConfig, it is excluded
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 skipping restoring admin.pbmConfig metadata, it is excluded
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 skipping restoring admin.pbmCmd, it is excluded
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 skipping restoring admin.pbmCmd metadata, it is excluded
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 skipping restoring admin.pbmRestores, it is excluded
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 skipping restoring admin.pbmRestores metadata, it is excluded
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 skipping restoring admin.pbmPITRChunks, it is excluded
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 skipping restoring admin.pbmPITRChunks metadata, it is excluded
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 skipping restoring admin.pbmLock, it is excluded
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 skipping restoring admin.pbmLock metadata, it is excluded
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 found collection admin.pbmAgents bson to restore to admin.pbmAgents
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 found collection metadata from admin.pbmAgents to restore to admin.pbmAgents
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 found collection admin.pbmLockOp bson to restore to admin.pbmLockOp
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 found collection metadata from admin.pbmLockOp to restore to admin.pbmLockOp
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 found collection admin.system.users bson to restore to admin.pbmRUsers
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 found collection metadata from admin.system.users to restore to admin.pbmRUsers
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 found collection admin.system.roles bson to restore to admin.pbmRRoles
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 found collection metadata from admin.system.roles to restore to admin.pbmRRoles
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 skipping restoring admin.system.version, it is excluded
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 skipping restoring admin.system.version metadata, it is excluded
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 found collection config.system.indexBuilds bson to restore to config.system.indexBuilds
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 found collection metadata from config.system.indexBuilds to restore to config.system.indexBuilds
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 found collection wmISMonitoring.isServerStats bson to restore to wmISMonitoring.isServerStats
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.599-0300 found collection metadata from wmISMonitoring.isServerStats to restore to wmISMonitoring.isServerStats
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.608-0300 first non special collection admin.system.users found. The demultiplexer will handle it and the remainder
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.608-0300 restoring up to 2 collections in parallel
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.612-0300 collection admin.pbmRUsers doesn't exist, skipping drop command
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.612-0300 reading metadata for admin.pbmRUsers from archive on stdin
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.612-0300 creating collection admin.pbmRUsers using options from metadata
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.629-0300 restoring admin.pbmRUsers from archive on stdin
Feb 26 09:51:30 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:30.631-0300 using 10 insertion workers
Feb 26 09:51:31 DSAGIMOGDBSP01 pbm-agent[107868]: 2021-02-26T09:51:31.083-0300 restoring indexes for collection admin.pbmRUsers from metadata
[feng.sian@DSAGIMOGDBSP01 ~]$
So… what is going on?