Question

How to monitor progress of data recovery on a replacement drive

9 months ago
July 17, 2024
5 replies
134 views

+15

Ken_H
Byte
219 replies

Hello fellow CommVault users,

I run a three node HyperScale 1.5 media agent on HPE reference hardware and recently one of the drives failed. I’ve replaced the failed drive and have started the recovery process to rebuild the contents of the drive.

Question: Is there a command that can be run to show the percent complete on the rebuild process?

Thanks in advance for any information.

Ken

+10

R Anwar
Vaulter
115 replies
9 months ago
July 19, 2024

Hi @Ken_H

Do keep an eye on the utilization of the disk. Once it matches the other disks, it should be Online.

Also, you can check the heals pending for the disk by running gluster v heal $(gluster v list) info.

Regards,

+15

Ken_H
Author
Byte
219 replies
9 months ago
July 25, 2024

I’ve seen replacement drives appear as Online even while they were 1.3TB smaller than other drives on the server so matching the space used to other drives doesn’t seem to be a good way to estimate when the rebuild will be complete.

Running the “gluster v heal HyperScale info” command lists every single block (segment? file?) that needs to be healed and on a newly replaced drive, but these count into the hundreds of thousands. I tried to filter them out using:

gluster v heal HyperScale info | grep -v gfid | grep -v Folder_

And this gives:

Brick inf-srvp110sds.apacorp.net:/ws/disk1/ws_brick
Status: Connected
Number of entries: 0
Brick inf-srvp111sds.apacorp.net:/ws/disk1/ws_brick
Status: Connected
Number of entries: 0
Brick inf-srvp112sds.apacorp.net:/ws/disk1/ws_brick
Status: Connected
Number of entries: 0
Brick inf-srvp110sds.apacorp.net:/ws/disk2/ws_brick
Status: Connected
Number of entries: 1
Brick inf-srvp111sds.apacorp.net:/ws/disk2/ws_brick
Status: Connected
Number of entries: 371113
Brick inf-srvp112sds.apacorp.net:/ws/disk2/ws_brick
Status: Connected
Number of entries: 357442

Unfortunately, the output never completed even after being left to run for 20 hours. Long story short, there doesn’t appear to be a way to monitor the heal process.

Ken

Anil Ozfirat
Byte
1 reply
4 months ago
December 26, 2024

Hello @Ken_H,

How you solved the situation?

+15

Ken_H
Author
Byte
219 replies
3 months ago
January 8, 2025

Anil Ozfirat wrote:

Hello @Ken_H,

How you solved the situation?

There does not appear to be an answer to this problem. If you have multiple drives each reporting multiple bad sectors the only option is to replace one of the drives then wait days for it to rebuild. Depending on when the rebuild completes, it could be quite a while before you notice and get the next failing drive replaced.

+19

Onno van den Berg
Commvault Certified Expert
1252 replies
3 months ago
January 9, 2025

@Pavan Bedadala can you shine your light on this post?

Reply

Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

Cookie settings

We use 3 different kinds of cookies. You can choose which cookies you want to accept. We need basic cookies to make this site work, therefore these are the minimum you can select. Learn more about our cookies.

Basic
Functional

Normal
Functional + analytics

Complete
Functional + analytics + social media + embedded videos + marketing

Reply

Related topics

Restore Files from LTO4 Tapesicon

PostgreSQL block level restore failsicon

Delete Edge Monitor don t workingicon

File System Agent issue on Windows, Optimized Scan failsicon

CommserveDR failed to gather log files error 35:284icon

Most helpful members this week

Sign up

Login to the community

Scanning file for viruses.

This file cannot be downloaded

Cookie policy

Cookie settings