Skip to main content
Question

Content indexing - high CPU utilization

  • December 23, 2024
  • 3 replies
  • 62 views

Nikos.Kyrm
Byte
Forum|alt.badge.img+14

Hello Commvault colleagues,

By enabling Content indexing for Exchange online / OneDrive backup jobs, I realized that consumes extremely high CPU resources, especially in initial backup jobs!

Content indexing backup jobs starting right after backup jobs,

So, apart from adding more CPU cores to Server, is there any way / recommendation to distribute this CPU high utilization maybe to another Server?

Already tried to add a secondary node to associated o365 INDEX, but still, high CPU utilization is only to 1st MediaAgent.


Please for your feedback,
Nikos

3 replies

Forum|alt.badge.img+7
  • Vaulter
  • 90 replies
  • December 24, 2024

Hi ​@Nikos.Kyrm,

 

I would like to share the below document first to validate the configuration,

https://documentation.commvault.com/2024e/essential/guidelines_for_exchange_online_access_nodes.html

Guidelines for Exchange Online Access Nodes

With Content Indexing
If you use content indexing, use the following guidelines.

Environment

Medium

Large

Mailboxes

5,000

10,000

Messages per day

500,000

1 million

Guidelines

   

Access nodes

  • Normal availability: 1

  • High availability: 2 or more

  • Normal availability: 1

  • High availability: 2 or more

CPU or vCPU for the access nodes

8 cores

16 cores

RAM for the access nodes

16 GB

32 GB

Streams per access node

10

20

Azure apps

5

10

Assumptions

The guidelines are based on the following assumptions:

  • The average mailbox size is 5 GB.

  • The average number of messages in each mailbox is 50,000.

  • The average message size is 100 KB.

 

Ensure that the Exchange server is running with the supported OS version.

https://documentation.commvault.com/2024e/essential/system_requirements_for_exchange_online.html

 

Ensure to exclude all the CV folders from the AV scanning,

https://documentation.commvault.com/v11/essential/antivirus_exclusions_for_windows.html

 

Regards,

Wasim


Nikos.Kyrm
Byte
Forum|alt.badge.img+14
  • Author
  • Byte
  • 207 replies
  • December 26, 2024

Hello ​@Wasim and thanks for your reply!

We are in “Large” environment with around 10,000 mailboxes.

Because in this case is also enabled Content Indexing, in the first “initial” backup jobs I noticed high CPU utilization in Content Index jobs (after backup jobs)!

So, Im looking a way to split this CPU high utilization to another MediaAgent.
To achieve that, must configured new MediaAgent also as Access Node of Exchange online? Because already tried to add a secondary node to associated o365 INDEX, but still, high CPU utilization is only to 1st MediaAgent.
 

Merry Christmas & Happy New Year 🎉🎉🎉


Forum|alt.badge.img+6
  • Byte
  • 29 replies
  • December 30, 2024

@Nikos.Kyrm FWIW when I was having this problem, I found that AV was scanning each file. Once the exclusions were reapplied, the CPU utilization went to normal and content indexing is much faster.


Reply


Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

 
Cookie settings