Solved

The restore job is suspended after some time

2 years ago
August 30, 2022
5 replies
836 views

JuanPa
Bit
6 replies

The job is a VM restore, I share the cvd.log output

9540 21e4 08/29 21:35:28 174330 stat- ID [DMBaseClose], Samples [59], Time [0.011945] Sec(s), Average [0.000202] Sec/Sample
9540 21e4 08/29 21:35:28 174330 stat- ID [DMBaseChunkClose], Samples [64], Time [0.009216] Sec(s), Average [0.000144] Sec/Sample
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] SendCtrlBuf: Sending Ctrl Msg 0x85
9540 21e4 08/29 21:35:28 ###### CPipelayer::SendPipelineBuffer() - Tail has reported error [98][Services on the tail side of the SDT pipe are going down.]. Cannot continue.
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Error in flushing the current buffer.
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] FinishRestore: Restore completed. StopCode [0x1]-[Destroy recvd from client], StopCodeClnt [1]
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Stats: AppTy 106, Total AF 59, Seeks 12/14483/14483, WrkItms 216602, MxRdr (Bsy-Tot: 1-4, Pnd 0), MsgsRcvd 735, QLen 0, LstMsg FSR_MSG_DESTROY
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Stats: [CtrlCh Idle Time] Exp Avg [0.08], Total Avg [0.08], Total Time [9.99], Total Count [121]
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Stats: [Seek Time] Exp Avg [0.00], Total Avg [0.00], Total Time [2.99], Total Count [14483]
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Stats: [AllocBuf Time] Exp Avg [0.00], Total Avg [0.00], Total Time [54.40], Total Count [374718]
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Stats: [AllocBuf Speed] Curr Avg [430.23] MB/Sec, Bytes [24542530128]; Total Avg [430.23] MB/Sec, Bytes [24542530128]
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Stats: [MediaRead Time] Exp Avg [0.00], Total Avg [0.00], Total Time [33.17], Total Count [374718]
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Stats: [MediaRead Speed] Curr Avg [261.07] MB/Sec, Bytes [9079855410]; Total Avg [261.07] MB/Sec, Bytes [9079855410]
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Stats: Total links [0], Blks Size Avg/Total [0/0] bytes; TagData [345672], TagDataSize [9040869698] bytes
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] SendCtrlBuf: Sending Ctrl Msg 0x14C
9540 21e4 08/29 21:35:28 ###### SdtNetLink::recvMsgPacket() - Received SDT_LINK_FIN packet.
9540 21e4 08/29 21:35:28 174330 Closing all sockets. RCId [3]
9540 21e4 08/29 21:35:28 ###### SdtNetLink::recvMsgPacket() - The other side has closed the network connection gracefully
9540 21e4 08/29 21:35:28 174330 SdtBase::setLastErr: Setting last err [92][The other side has closed the network connection gracefully] RCId [3]
9540 21e4 08/29 21:35:28 ###### CPipelayer::SendPipelineBuffer() - Cannot peek for message. Error [92][The other side has closed the network connection gracefully]
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Error in flushing the current buffer.
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] SendCtrlBuf: Could not allocate pipeline buf
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Removing all SEEK msgs from queue. QLen = 0
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Removed 0 SEEK msgs from queue. QLen = 0
9540 21e4 08/29 21:35:28 ###### [FSRESTHEAD ] SCtrlSess::RelRef: Deleting CtrlSess obj [000000B2933F0510] as ref. count is 0. Listener [0], JobId [174330], SockFd: (6408/-1)
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] RelRef: Deleting restore head as ref. count is 0.
9540 21e4 08/29 21:35:28 174330 SdtBase::relRef: Going to delete SdtBase as ref count is down to 0. RCId [3]
9540 21e4 08/29 21:35:28 174330 SdtBase is being destroyed. RCId [3]
9540 21e4 08/29 21:35:28 174330 Name [Head], Start Delay [2672.76 s], Avg [7.13 ms], Procsng Time [37.38 s], Avg [99.70 us], Bytes [8.47 GB], Speed [232.09 MB/s], Samples [374956] RCId [3]
9540 21e4 08/29 21:35:28 174330 Name [Head CRC32], Start Delay [7.19 s], Avg [19.17 us], Procsng Time [19.46 s], Avg [51.89 us], Bytes [8.47 GB], Speed [445.99 MB/s], Samples [374956] RCId [3]
9540 21e4 08/29 21:35:28 174330 Name [Head Network], Start Delay [7652.27 s], Avg [20.41 ms], Procsng Time [93.83 s], Avg [250.25 us], Bytes [8.47 GB], Speed [92.47 MB/s], Samples [374956] RCId [3]
9540 16b4 08/29 21:35:37 174319 1-# [DM_BASE ] LAStateInfo: Waiting for Datamover Cmd. LinksRead=2/0/2, FreeQ=256, TempQ=0, BusyQ=0, RCID=16/1, RID=-1, TID=5812, State=0/0, Line=684
9540 16fc 08/29 21:35:52 174319 11-# [FSRESTHEAD ] OnIncomingData: Recvd FSR_MSG_KEEPALIVE
9540 2610 08/29 21:36:08 ###### cccAuthServer() - Only built-in certificate is loaded. Client name requested by remote end: (commvault-drp)(). Physical node in the registry: (commvault-drp_new).
9540 24f0 08/29 21:36:08 ###### cccAuthServer() - Only built-in certificate is loaded. Client name requested by remote end: (commvault-drp)(). Physical node in the registry: (commvault-drp_new).
9540 2324 08/29 21:36:09 174319 16-# [FSRESTHEAD ] OnIncomingData: Recvd FSR_MSG_KEEPALIVE9540 21e4 08/29 21:35:28 174330 stat- ID [DMBaseClose], Samples [59], Time [0.011945] Sec(s), Average [0.000202] Sec/Sample
9540 21e4 08/29 21:35:28 174330 stat- ID [DMBaseChunkClose], Samples [64], Time [0.009216] Sec(s), Average [0.000144] Sec/Sample
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] SendCtrlBuf: Sending Ctrl Msg 0x85
9540 21e4 08/29 21:35:28 ###### CPipelayer::SendPipelineBuffer() - Tail has reported error [98][Services on the tail side of the SDT pipe are going down.]. Cannot continue.
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Error in flushing the current buffer.
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] FinishRestore: Restore completed. StopCode [0x1]-[Destroy recvd from client], StopCodeClnt [1]
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Stats: AppTy 106, Total AF 59, Seeks 12/14483/14483, WrkItms 216602, MxRdr (Bsy-Tot: 1-4, Pnd 0), MsgsRcvd 735, QLen 0, LstMsg FSR_MSG_DESTROY
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Stats: [CtrlCh Idle Time] Exp Avg [0.08], Total Avg [0.08], Total Time [9.99], Total Count [121]
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Stats: [Seek Time] Exp Avg [0.00], Total Avg [0.00], Total Time [2.99], Total Count [14483]
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Stats: [AllocBuf Time] Exp Avg [0.00], Total Avg [0.00], Total Time [54.40], Total Count [374718]
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Stats: [AllocBuf Speed] Curr Avg [430.23] MB/Sec, Bytes [24542530128]; Total Avg [430.23] MB/Sec, Bytes [24542530128]
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Stats: [MediaRead Time] Exp Avg [0.00], Total Avg [0.00], Total Time [33.17], Total Count [374718]
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Stats: [MediaRead Speed] Curr Avg [261.07] MB/Sec, Bytes [9079855410]; Total Avg [261.07] MB/Sec, Bytes [9079855410]
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Stats: Total links [0], Blks Size Avg/Total [0/0] bytes; TagData [345672], TagDataSize [9040869698] bytes
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] SendCtrlBuf: Sending Ctrl Msg 0x14C
9540 21e4 08/29 21:35:28 ###### SdtNetLink::recvMsgPacket() - Received SDT_LINK_FIN packet.
9540 21e4 08/29 21:35:28 174330 Closing all sockets. RCId [3]
9540 21e4 08/29 21:35:28 ###### SdtNetLink::recvMsgPacket() - The other side has closed the network connection gracefully
9540 21e4 08/29 21:35:28 174330 SdtBase::setLastErr: Setting last err [92][The other side has closed the network connection gracefully] RCId [3]
9540 21e4 08/29 21:35:28 ###### CPipelayer::SendPipelineBuffer() - Cannot peek for message. Error [92][The other side has closed the network connection gracefully]
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Error in flushing the current buffer.
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] SendCtrlBuf: Could not allocate pipeline buf
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Removing all SEEK msgs from queue. QLen = 0
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] Removed 0 SEEK msgs from queue. QLen = 0
9540 21e4 08/29 21:35:28 ###### [FSRESTHEAD ] SCtrlSess::RelRef: Deleting CtrlSess obj [000000B2933F0510] as ref. count is 0. Listener [0], JobId [174330], SockFd: (6408/-1)
9540 21e4 08/29 21:35:28 174330 28-# [FSRESTHEAD ] RelRef: Deleting restore head as ref. count is 0.
9540 21e4 08/29 21:35:28 174330 SdtBase::relRef: Going to delete SdtBase as ref count is down to 0. RCId [3]
9540 21e4 08/29 21:35:28 174330 SdtBase is being destroyed. RCId [3]
9540 21e4 08/29 21:35:28 174330 Name [Head], Start Delay [2672.76 s], Avg [7.13 ms], Procsng Time [37.38 s], Avg [99.70 us], Bytes [8.47 GB], Speed [232.09 MB/s], Samples [374956] RCId [3]
9540 21e4 08/29 21:35:28 174330 Name [Head CRC32], Start Delay [7.19 s], Avg [19.17 us], Procsng Time [19.46 s], Avg [51.89 us], Bytes [8.47 GB], Speed [445.99 MB/s], Samples [374956] RCId [3]
9540 21e4 08/29 21:35:28 174330 Name [Head Network], Start Delay [7652.27 s], Avg [20.41 ms], Procsng Time [93.83 s], Avg [250.25 us], Bytes [8.47 GB], Speed [92.47 MB/s], Samples [374956] RCId [3]
9540 16b4 08/29 21:35:37 174319 1-# [DM_BASE ] LAStateInfo: Waiting for Datamover Cmd. LinksRead=2/0/2, FreeQ=256, TempQ=0, BusyQ=0, RCID=16/1, RID=-1, TID=5812, State=0/0, Line=684
9540 16fc 08/29 21:35:52 174319 11-# [FSRESTHEAD ] OnIncomingData: Recvd FSR_MSG_KEEPALIVE
9540 2610 08/29 21:36:08 ###### cccAuthServer() - Only built-in certificate is loaded. Client name requested by remote end: (commvault-drp)(). Physical node in the registry: (commvault-drp_new).
9540 24f0 08/29 21:36:08 ###### cccAuthServer() - Only built-in certificate is loaded. Client name requested by remote end: (commvault-drp)(). Physical node in the registry: (commvault-drp_new).
9540 2324 08/29 21:36:09 174319 16-# [FSRESTHEAD ] OnIncomingData: Recvd FSR_MSG_KEEPALIVE

Best answer by Damian Andre

Ok so this looks like a livesync replication job? is that correct?

The error I see is

5820 2034 08/30 18:55:46 176487 VSRstArchiveReader::OpenArchiveFile() - Opening Archive File <2:43:373670> with CopyPrecedence:[3]
5820 2034 08/30 18:55:46 176487 Archive Manager Reported error: Archive file for the Storage Policy Copy is not valid

It seems like its trying to replicate (restore) a VM from storage policy copy #3 - but the job may not be fully copied there yet?

View original

Did this answer your question?

+17

Jos Meijer
Commvault Certified Expert
638 replies
2 years ago
August 30, 2022

Hi @JuanPa

Can you manually assign the commvault-drp_new when initiating the restore?

Seems that a media agent is automatically selected.

If this works disable the old media agent commvault-drp in the storage - media agent section. Not sure but seems I forgot to disable the old ma config in the GUI.

You might need to check the vsa assignment as well for the vcenter instance/subclient config, if the old vsa is configured then please change it to commvault-drp_new

JuanPa
Author
Bit
6 replies
2 years ago
August 30, 2022

Hi Jos,

I have made the allocations using commvault-drp_new and the problem still persists.The test has been carried out by manually selecting the ma and after some time it fails.

+23

Damian Andre
Vaulter
1286 replies
2 years ago
August 30, 2022

Hey @JuanPa - the CVD log isn't telling the story here. We may need to look at vsvrst.og or vsrst.log on the proxy being used for the restore to figure out what is going on.

If you open the job in the job controller, does it give a pending or failure reason at all?

JuanPa
Author
Bit
6 replies
2 years ago
August 31, 2022

Hi Damian,

Looking at the job it sends the following:

The jobmanager.log

the vsrst.log

5820 2560 08/30 18:55:46 176487 CVMDiskInfo::OpenHostDiskHandle() - Opened to [sni-app-pro-w01_DRP] [sni-app-pro-w01_1.vmdk] using Transport mode: [nbd]
5820 2560 08/30 18:55:46 176487 vsvRstObj::writeMetadata() - Extent Size [1048576] bytes [2048] sectors [512] bytes per sector
5820 2560 08/30 18:55:46 176487 vsvRstObj::writeData() - Memory allocated for data: [1048576] bytes
5820 2560 08/30 18:55:46 176487 CVArchive::ReadBuffer() - PL_PERF_COUNTERS (2, 43, 373279)
5820 2560 08/30 18:55:46 176487 CVArchive::ReadBuffer() - PL_FS_CLOSE_AFILE(2, 43, 373279)
5820 2560 08/30 18:55:46 176487 VSRstArchiveReader::ReadPipelineThread() - Received PL_FS_CLOSE_AFILE buffer.
5820 2560 08/30 18:55:46 176487 stat- ID [readmedia], Bytes [33561498], Time [1.542644] Sec(s), Average Speed [20.747974] MB/Sec
5820 2034 08/30 18:55:46 176487 VSRstArchiveReader::OpenArchiveFile() - Opening Archive File <2:43:373670> with CopyPrecedence:[3]
5820 2034 08/30 18:55:46 176487 Archive Manager Reported error: Archive file for the Storage Policy Copy is not valid
5820 2034 08/30 18:55:46 176487 Trying to attach to JM without initializing the object
5820 2034 08/30 18:55:46 176487 JM Client Failed to connect to JM
5820 2034 08/30 18:55:46 176487 VSRstArchiveReader::OpenArchiveFile() - Failed to open Archive
5820 2034 08/30 18:55:46 176487 VSRstSyncWorkerIdxBrowseClientResp::HandleSeekToOffset() - VSRstWorkerIdxBrowseClientResp::HandleSeekToOffset returned error.
5820 2034 08/30 18:55:46 176487 IDXBROWSECL Callback failed while processing browse error
5820 2034 08/30 18:55:46 176487 VSRstWorker::PerformBrowse() - Failed to browse VM Data, -1
5820 2034 08/30 18:55:46 176487 stat- ID [ProcessSeekMsgs], Samples [1], Time [3.972513] Sec(s), Average [3.972513] Sec/Sample
5820 2034 08/30 18:55:46 176487 stat- ID [ReceiveSeekMsgs], Samples [2], Time [2.240605] Sec(s), Average [1.120303] Sec/Sample
5820 2560 08/30 18:55:46 176487 VSRstArchiveReader::ReadPipelineThread() - ReadPipeline Thread [0-0] exited
5820 2560 08/30 18:55:46 176487 stat- ID [readmedia], Bytes [33561498], Time [1.542644] Sec(s), Average Speed [20.747974] MB/Sec
5820 2034 08/30 18:55:46 176487 CVArchive::ClosePipeline() - Shutting down head session
5820 2034 08/30 18:55:46 176487 CCVAPipelayer::ClosePipeline() - About to shut down RESTORE pipeline
5820 2034 08/30 18:55:46 176487 CPipelayer::ShutdownPipeline() - stat- SDT [000000E3DA3C9EF0] [duration - 4 seconds]
5820 2034 08/30 18:55:46 176487 CPipelayer::ShutdownPipeline() - SDT [000000E3DA3C9EF0] Shutdown completed.
5820 2034 08/30 18:55:46 176487 VSRstArchive::UpdateJobProgress() - Sending job stream update message for stream [1]
5820 2034 08/30 18:55:46 176487 ~CVArchive() - Destroying CVArchive. This=000000E3DA3C7750
5820 2034 08/30 18:55:46 ###### SdtTailSrvPool::Rel: Resetting SrvPool as ref. count is 0.
5820 278 08/30 18:55:46 ###### SdtTailSrv::MainLoop: Quit flag found set. Will get out ASAP.
5820 278 08/30 18:55:46 176487 SdtBase::setLastErr: Setting last err [98][Services on the tail side of the SDT pipe are going down.] RCId [14]
5820 1e88 08/30 18:55:46 176487 SdtTailSrv::DropClientWorker: Removing Client [SDTPipe_commvault-drp_new_commvault-drp_new_176487_1661903742_5820_8244_000000E3DA3C9EF0], Id [2880] from the list. RCId [14]
5820 1e88 08/30 18:55:46 176487 SdtTail::onDisconnect: Sending error [98][Services on the tail side of the SDT pipe are going down.] to the head. RCId [14]
5820 1e88 08/30 18:55:46 176487 SdtTail::onDisconnect: Sent error code to the head RCId [14]
5820 1e88 08/30 18:55:46 176487 SdtTailSrv::DropClientWorker: Size of selected list [0] RCId [14]
5820 1e88 08/30 18:55:46 176487 SdtTailSrv::DropClientWorker: Draining away incoming data... RCId [14]
5820 1e88 08/30 18:55:46 176487 SdtTailSrv::DropClientWorker: Have drained away incoming data... Cnt [0] RCId [14]
5820 1e88 08/30 18:55:46 176487 SdtBase::relRef: Going to delete SdtBase as ref count is down to 0. RCId [14]
5820 1e88 08/30 18:55:46 176487 SdtTail::logStats:Final Clnt [commvault-drp_new], PId [5820], Cnt - BsyProcsg/Allocs [0/843]; Time - Procsg/NwRecv/Total [2/2/4] secs, Bytes recvd [10940418] RCId [14]
5820 1e88 08/30 18:55:46 176487 SdtBase is being destroyed. RCId [14]
5820 1e88 08/30 18:55:46 176487 Name [Tail Network], Start Delay [101.87 s], Avg [120.99 ms], Procsng Time [2.31 s], Avg [2.74 ms], Bytes [10.43 MB], Speed [4.52 MB/s], Samples [842] RCId [14]
5820 1e88 08/30 18:55:46 176487 Name [Tail Decryption], Start Delay [48.54 ms], Avg [57.64 us], Procsng Time [9.85 ms], Avg [11.70 us], Bytes [10.43 MB], Speed [1.03 GB/s], Samples [842] RCId [14]
5820 1e88 08/30 18:55:46 176487 Name [Tail Decompression], Start Delay [239.00 us], Avg [283.85 ns], Procsng Time [125.56 ms], Avg [149.12 us], Bytes [10.44 MB], Speed [83.12 MB/s], Samples [842] RCId [14]
5820 1e88 08/30 18:55:46 176487 Name [Tail End], Start Delay [0.00 s], Avg [0.00 s], Procsng Time [0.00 s], Avg [0.00 s], Bytes [0.00 B], Speed [0.00 B/s], Samples [0] RCId [14]
5820 1e88 08/30 18:55:46 176487 Performing graceful connection termination
5820 1e88 08/30 18:55:46 176487 Closing all sockets.
5820 2034 08/30 18:55:47 ###### SdtTailSrv::Rel: Destroying SDT server [0], since ref count has gone to 0.
5820 2034 08/30 18:55:47 176487 VSRstWorker::FlushWriteBehindCache() - Waiting for writes to complete
5820 2034 08/30 18:55:47 176487 VSRstWorker::StopWriteBehindThread() - Waiting for write behind thread to exit
4920 9a0 08/30 18:55:51 176490 VSRstWorker::UpdateRestoreProgress() - VM [SEV-APP-PRO-W02_DRP] Restored 86243 of 1467165 extents 5%
4920 21fc 08/30 18:55:51 176490 VSRstCoordinator::OnMsgReceived() - Message [VirtualServer_VMRestoreCtx] received from Agent [commvault-drp_new]
4920 21fc 08/30 18:55:51 176490 VSRstCoordinator::OnMsgVMRestoreCtx() - Received Update for VM [SEV-APP-PRO-W02][5010f794-6bd2-92c3-9e3b-a0883fe6bf55] from [commvault-drp_new]
4920 21fc 08/30 18:55:51 176490 VSRstCoordinator::UpdateReplicationDB() - Updating replication status for VM [SEV-APP-PRO-W02] to [3]
5820 1f08 08/30 18:55:52 176487 VSRstWorker::WriteBehindThread() - Write behind thread exiting
5820 2034 08/30 18:55:52 176487 CVMDiskInfo::CloseHostDiskHandle() - Closing disk handle [0x000000E3BF822B70] on list [0x000000E3D9E4C3A0] CVMDiskInfo object [0x000000E3BF04FCC0]
5820 2034 08/30 18:55:52 176487 CVMDiskInfo::OpenHostDiskHandle() - Opening disk [sni-app-pro-w01.vmdk] on VM [sni-app-pro-w01_DRP] path [[TrueNAS02] sni-app-pro-w01_DRP_5/sni-app-pro-w01_DRP.vmdk] flags [ ]
5820 2034 08/30 18:55:52 176487 CVMDiskInfo::OpenHostDiskHandle() - Opened to [sni-app-pro-w01_DRP] [sni-app-pro-w01.vmdk] using Transport mode: [nbd]
5820 2034 08/30 18:55:52 176487 CVMDiskInfo::CloseHostDiskHandle() - Closing disk handle [0x000000E3BF822B70] on list [0x000000E3D9E4C3A0] CVMDiskInfo object [0x000000E3BF04FCC0]
5820 2034 08/30 18:55:52 176487 CVMDiskInfo::CloseHostDiskHandle() - Closing disk handle [0x000000E3D9C9FA60] on list [0x000000E3D9E4C3A0] CVMDiskInfo object [0x000000E3BF04FCC0]
5820 2034 08/30 18:55:52 176487 CVMDiskInfo::OpenHostDiskHandle() - Opening disk [sni-app-pro-w01_1.vmdk] on VM [sni-app-pro-w01_DRP] path [[TrueNAS02] sni-app-pro-w01_DRP_5/sni-app-pro-w01_DRP_1.vmdk] flags [ ]
5820 2034 08/30 18:55:52 176487 CVMDiskInfo::OpenHostDiskHandle() - Opened to [sni-app-pro-w01_DRP] [sni-app-pro-w01_1.vmdk] using Transport mode: [nbd]
5820 2034 08/30 18:55:52 176487 CVMDiskInfo::CloseHostDiskHandle() - Closing disk handle [0x000000E3D9C9FA60] on list [0x000000E3D9E4C3A0] CVMDiskInfo object [0x000000E3BF04FCC0]
5820 2034 08/30 18:55:52 176487 VSRstWorker::UpdateRestoreProgress() - VM [sni-app-pro-w01_DRP] Restored 52405 of 79295 extents 66%
5820 2034 08/30 18:55:53 176487 stat- ID [writedisk], Bytes [33554432], Time [1.592427] Sec(s), Average Speed [20.095110] MB/Sec
5820 1e58 08/30 18:55:53 176487 VSRstCoordinator::OnMsgReceived() - Message [VirtualServer_VMRestoreCtx] received from Agent [commvault-drp_new]
5820 1e58 08/30 18:55:53 176487 VSRstCoordinator::OnMsgVMRestoreCtx() - Received Update for VM [sni-app-pro-w01][502df4d7-8d0e-683e-1fb6-cec7e0dc9b13] from [commvault-drp_new]
5820 2034 08/30 18:55:53 176487 VSRstWorker::RestoreVMData() - RestoreVMData failed. CloseArchiveRet: [1] Seek Error [1] Write Error [0] Read Error [0] Browse Error [0]
5820 2034 08/30 18:55:53 176487 VSRstSyncWorker::HandleProcessVMFailure() - ProcessVM failed for VM: [sni-app-pro-w01]->[sni-app-pro-w01_DRP]
5820 1e58 08/30 18:55:53 176487 VSRstCoordinator::UpdateReplicationDB() - Updating replication status for VM [sni-app-pro-w01] to [3]
5820 2034 08/30 18:55:53 176487 VSRstSyncWorker::HandleProcessVMFailure() - Setting live sync as paused for VM: [sni-app-pro-w01]->[sni-app-pro-w01_DRP]
5820 2034 08/30 18:55:53 176487 VSRstWorker::VMInfo_CloseRestoreVM() - CloseRestoreVM for [sni-app-pro-w01_DRP]
5820 2034 08/30 18:55:53 176487 CVMDiskInfo::OpenHostDiskHandle() - Opening disk [sni-app-pro-w01_1.vmdk] on VM [sni-app-pro-w01_DRP] path [[TrueNAS02] sni-app-pro-w01_DRP_5/sni-app-pro-w01_DRP_1.vmdk] flags [ ]
5820 1e58 08/30 18:55:53 176487 VSRstCoordinator::OnMsgReceived() - Message [JobManager_VMUpdateInfo] received from Agent [commvault-drp_new]
5820 1e58 08/30 18:55:53 176487 VSRstCoordinator::OnMsgVMUpdateInfo() - Received JM Update for VM [sni-app-pro-w01][502df4d7-8d0e-683e-1fb6-cec7e0dc9b13] from [commvault-drp_new]
5820 1e58 08/30 18:55:53 176487 VSRstCoordinator::OnMsgReceived() - Message [VirtualServer_VMRestoreCtx] received from Agent [commvault-drp_new]
5820 1e58 08/30 18:55:53 176487 VSRstCoordinator::OnMsgVMRestoreCtx() - Received Update for VM [sni-app-pro-w01][502df4d7-8d0e-683e-1fb6-cec7e0dc9b13] from [commvault-drp_new]
5820 1e58 08/30 18:55:53 176487 VSRstCoordinator::UpdateReplicationDB() - Updating replication status for VM [sni-app-pro-w01] to [4]
5820 2034 08/30 18:55:53 176487 CVMDiskInfo::OpenHostDiskHandle() - Opened to [sni-app-pro-w01_DRP] [sni-app-pro-w01_1.vmdk] using Transport mode: [nbd]
5820 2034 08/30 18:55:53 176487 CVMDiskInfo::OpenHostDiskHandle() - Opening disk [sni-app-pro-w01.vmdk] on VM [sni-app-pro-w01_DRP] path [[TrueNAS02] sni-app-pro-w01_DRP_5/sni-app-pro-w01_DRP.vmdk] flags [ ]
5820 2034 08/30 18:55:53 176487 CVMDiskInfo::OpenHostDiskHandle() - Opened to [sni-app-pro-w01_DRP] [sni-app-pro-w01.vmdk] using Transport mode: [nbd]
5820 2034 08/30 18:55:53 176487 CVMDiskInfo::Cleanup() - Cleaned up [0] VMs [3] remaining
5820 2034 08/30 18:55:54 176487 CVMWareInfo::ResetMaxSnapshots() - Resetting maximum snapshots on VM: [sni-app-pro-w01_DRP] [5009272f-abb6-a7b1-140b-314040deb900]
5820 2034 08/30 18:55:55 176487 CVMWareInfo::AllowActivitiesOnVM() - Allowed for: [sni-app-pro-w01_DRP] [5009272f-abb6-a7b1-140b-314040deb900]
5820 2034 08/30 18:55:55 176487 CVMWareInfo::CloseRestoreVM_Replication() - Creating replication snapshot of VM [sni-app-pro-w01_DRP] Guid [5009272f-abb6-a7b1-140b-314040deb900] for integrity purposes
5820 2034 08/30 18:55:56 176487 CVMWareInfo::CloseRestoreVM_Replication() - Successfully created snapshot of VM [sni-app-pro-w01_DRP] Guid [5009272f-abb6-a7b1-140b-314040deb900], snapshot [snapshot-84465]
5820 2034 08/30 18:55:56 176487 SeekDecisionVMWare::printStats() - Number of restored items skipped by SEEK optimization: [0] out of [33] (0%)
5820 1e58 08/30 18:55:56 176487 VSRstCoordinator::OnMsgReceived() - Message [VirtualServer_VMRestoreCtx] received from Agent [commvault-drp_new]
5820 2034 08/30 18:55:56 176487 SeekDecisionVMWare::DeleteCBTFiles() - Deleting CBT files folder [C:\Program Files\Commvault\ContentStore2\iDataAgent\JobResults\CV_JobResults\iDataAgent\VirtualServerAgent\Restore\176487\502df4d7-8d0e-683e-1fb6-cec7e0dc9b13\CBTFiles]
5820 1e58 08/30 18:55:56 176487 VSRstCoordinator::OnMsgVMRestoreCtx() - Received Update for VM [sni-app-pro-w01][502df4d7-8d0e-683e-1fb6-cec7e0dc9b13] from [commvault-drp_new]
5820 1e58 08/30 18:55:56 176487 VSRstCoordinator::UpdateReplicationDB() - Updating replication status for VM [sni-app-pro-w01] to [4]
5820 2034 08/30 18:55:56 176487 VSRstController::SendGetNextVM() - Requesting Next VM from coordinator
5820 1e58 08/30 18:55:56 176487 VSRstCoordinator::OnMsgReceived() - Message [VirtualServer_VMRestoreCtx] received from Agent [commvault-drp_new]
5820 1e58 08/30 18:55:56 176487 VSRstCoordinator::OnMsgVMRestoreCtx() - Received Update for VM [sni-app-pro-w01][502df4d7-8d0e-683e-1fb6-cec7e0dc9b13] from [commvault-drp_new]
5820 1e58 08/30 18:55:56 176487 VSRstCoordinator::UpdateReplicationDB() - Updating replication status for VM [sni-app-pro-w01] to [4]
5820 1e58 08/30 18:55:56 176487 VSRstCoordinator::OnMsgReceived() - Message [VirtualServer_VMRestoreResult] received from Agent [commvault-drp_new]
5820 1e58 08/30 18:55:56 176487 VSRstCoordinator::OnMsgVMRestoreResult() - Received Result for VM [502df4d7-8d0e-683e-1fb6-cec7e0dc9b13] from [commvault-drp_new]
5820 1e58 08/30 18:55:56 176487 VSRstCoordinator::OnMsgVMRestoreResult() - Marking VM [sni-app-pro-w01_DRP] as pending
5820 1e58 08/30 18:55:56 176487 VSRstCoordinator::OnIdle_Running() - All VMs have been processed, stopping agents
5820 1e58 08/30 18:55:56 176487 VSRstCoordinator::OnMsgReceived() - Message [VirtualServer_VMBackupGetNextVM] received from Agent [commvault-drp_new]
5820 2588 08/30 18:55:56 176487 VSRstController::OnMsgReceived() - Message [VirtualServer_VMAgentQuit] received from Server
5820 2588 08/30 18:55:56 176487 VSRstController::OnMsgVMAgentQuit() - Received Quit message from coordinator
5820 1e58 08/30 18:55:56 176487 VSRstCoordinator::OnMsgVMBackupGetNextVM() - GetNextVM Request from [commvault-drp_new]

5820 2034 08/30 18:56:06 176487 VSRstWorker::WorkerThread() - Worker [0] exiting
5820 2588 08/30 18:56:06 176487 VSRstController::StopAllWorkers() - All workers have stopped
5820 2588 08/30 18:56:07 176487 VSRstController::StopAllWorkers() - Finished stopping all workers
5820 2588 08/30 18:56:07 176487 VSRstController::OuputDatastoreMedaiSpeedCounters() - --- Datastore write speeds ---
5820 2588 08/30 18:56:07 176487 stat- ID [Datastore Write [TrueNAS02]], Bytes [33554432], Time [1.592413] Sec(s), Average Speed [20.095292] MB/Sec
5820 2588 08/30 18:56:07 176487 VSRstController::OnAgentStopped() - Controller is stopping
5820 1e58 08/30 18:56:07 176487 VSRstCoordinator::OnMsgReceived() - Message [VirtualServer_VMAgentShutdown] received from Agent [commvault-drp_new]
5820 1e58 08/30 18:56:07 176487 VSRstCoordinator::OnMsgReceived() - Message [CNSession_Disconnect] received from Agent [commvault-drp_new]
5820 1e58 08/30 18:56:07 176487 VSRstCoordinator::OnAgentStopped() - Agent [commvault-drp_new] has stopped
5820 1e58 08/30 18:56:07 176487 CVRemoteAgentServer::MonitorAgents() - There are no running agents
5820 1e58 08/30 18:56:07 176487 VSRstCoordinator::SummarizeVMList() - VM sni-app-pro-w01_DRP --> PENDING
5820 1e58 08/30 18:56:07 176487 VSRstCoordinator::Run() - ========== JOB SUMMARY REPORT ==========
5820 1e58 08/30 18:56:07 176487 VSRstCoordinator::Run() - Total VMs to Restore [1]
5820 1e58 08/30 18:56:07 176487 VSRstCoordinator::Run() - VMs pending [1]
5820 1e58 08/30 18:56:07 176487 VSRstCoordinator::Run() - =========================================
5820 1e58 08/30 18:56:07 176487 VSRstController::StopAllWorkers() - All workers have stopped
5820 1e58 08/30 18:56:07 176487 VSRstController::StopAllWorkers() - Finished stopping all workers
5820 1e58 08/30 18:56:07 176487 Sending FAILED complete message to JM, 176487
5820 150c 08/30 18:56:07 176487 CVJobCtrlLog::unregisterProcess(): successfully removed file [C:\Program Files\Commvault\ContentStore2\Base\JobControl\5.820]

5820 1e10 08/30 18:56:08 176487 CVMDiskInfo::_VixDiskLibThread() - VixDiskLib thread exiting
[commvaultdrp@vsphere.local]
5820 1e58 08/30 18:56:08 176487 stat- ID [Disk Open], Samples [2], Time [0.848106] Sec(s), Average [0.424053] Sec/Sample
5820 1e58 08/30 18:56:08 176487 stat- ID [Disk Close], Samples [2], Time [0.016326] Sec(s), Average [0.008163] Sec/Sample
5820 1e58 08/30 18:56:08 176487 CVMWareInfo::ShutdownVMInfo() - VMDiskInfo::Shutdown complete
5820 1e58 08/30 18:56:08 176487 CVMWareInfo::ShutdownVMInfo() - CVISDKCppBridge::Cleanup complete
5820 a54 08/30 18:56:08 176487 CVDTerminationHandler::_WaitForCVDAliveMutex() - CVD Alive Check Quit Event Signaled
5820 1e58 08/30 18:56:08 176487 = VSRst has exited
5820 1e58 08/30 18:56:08 176487 ================================================================

+23

Damian Andre
Vaulter
1286 replies
Answer
2 years ago
August 31, 2022

Ok so this looks like a livesync replication job? is that correct?

The error I see is

5820 2034 08/30 18:55:46 176487 VSRstArchiveReader::OpenArchiveFile() - Opening Archive File <2:43:373670> with CopyPrecedence:[3]
5820 2034 08/30 18:55:46 176487 Archive Manager Reported error: Archive file for the Storage Policy Copy is not valid

It seems like its trying to replicate (restore) a VM from storage policy copy #3 - but the job may not be fully copied there yet?

Reply

Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

Cookie settings

We use 3 different kinds of cookies. You can choose which cookies you want to accept. We need basic cookies to make this site work, therefore these are the minimum you can select. Learn more about our cookies.

Basic
Functional

Normal
Functional + analytics

Complete
Functional + analytics + social media + embedded videos

Reply

Sign up

Login to the community

Scanning file for viruses.

This file cannot be downloaded

Cookie policy

Cookie settings