DDB reconstruction

Question

Hello,During a DDB reconstruction process, how does it reconstruct the missing data?  In our case, it first performed a restore of the last backup which was from sometime early morning.  Does it then access the library storage directly to reconstruct the missing data or does it access the Commserve?  I’m asking because I’ve been told conflicting information from different Commvault technicians.  I just want to make sure that I understand this process clearly.  Thank you. Bill

Mike Struening · Accepted Answer

Hi @BillK, thanks for the post and welcome!this is definitely an interesting topic. the question is answered (high level) here:https://documentation.commvault.com/2022e/expert/12514_deduplication_database_recovery.htmlWhen the deduplication database (DDB) or partition of the DDB is detected as offline, unreadable, or unavailable, the DDB is automatically recovered from the latest DDB backup. This is the default recovery setting.Note: While running the DDB reconstruction job, if the restored DDB is found invalid or corrupted, then both theLast DDB Backup Job Idand theLast DDB Backup Timestamp are reset. You must run a full reconstruction process that deletes the existing DDB content, reads the entire data on the disk, and re-creates a new DDB from the deduplicated data read on the disk.DDB recovery process performs the following activities:Adds new records in the restored DDB for all the backups that were added to the DDB between the backup and when they went offline.Ensures that the data in the DDB and the CommServe database are in sync by cross-verifying and updating the metadata with the CommServe database with the DDB.Ensures that the DDB has been successfully reconstructed.The first two items of importance are that the first step is to restore the latest DDB backup. In almost certain likelihood, there will be jobs that have run and completed since this DDB backup happened, so how do we restore the DDB to its TRUE most recent state?The recon process looks at the data on the disk and reads the meta data files (aka the files about the files), then compares that to the Commserve database to see what is not in sync.For that to make sense, it’s best to think about what information lives where:The Commserve database knows about each job and which archive files each job ‘needs’The Dedupe Database knows how many references exist per archive file/blockNow, if the Recon process sees that there are a different number of references in the currently-in-recon-ddb than itshouldhave, then it adjusts the DDB accordingly. Same for anynewchunks/archive files.This does take a decent amount of time, though if you don’t have a valid DDB backup and have to do a Full Reconthen you can see how it would take even longer!Let me know if that helps explain everything.

Mike Struening · Answer

That’s correct. The metadata utilized is within the Chunk folders on the actual storage.

DDB reconstruction

4 replies

Readiverse Academy Certifications

Readiverse Academy Certifications

Sign up

Login to the community