Many SharePoint environments contain hundreds or thousands of sites. In these cases, it is usually not practical to create one ShArc policy per site.
However, policy planning should not start with the number of sites alone. It should start with the SharePoint storage level you want to reach and maintain.
For example, assume your tenant currently uses 80 TB of SharePoint storage. Your included SharePoint storage capacity is 30 TB, so you are currently paying for 50 TB of additional SharePoint storage.
In this case, the first goal is to archive approximately 50 TB of cold data to remove the need for paid additional storage. However, we also recommend keeping a growth buffer within your included SharePoint storage capacity. If your included capacity is 30 TB and you want to keep 10% free, you should reserve 3 TB as buffer. This means your target SharePoint usage is not 30 TB, but approximately 27 TB.
In this example, the initial archive target would be approximately 53 TB:
80 TB current SharePoint usage – 27 TB target SharePoint usage = 53 TB archive target.
After the initial rollout, the goal is to keep this buffer over time. If your SharePoint storage grows by, for example, 1 TB per month, your recurring policies should aim to archive at least that amount of newly cold data over time, so the tenant remains close to the target level of 27 TB used and 3 TB free.
For example, if your tenant contains 8,000 SharePoint sites, you should not create 8,000 individual policies. Instead, we recommend grouping sites by archiving logic, expected archive volume, business relevance, and operational risk.
This article explains how to define an archive target, structure ShArc policies for large SharePoint tenants, and plan a controlled rollout.
Before creating policies, define the SharePoint storage level you want to reach and maintain. This includes calculating how much data must be archived to remove paid additional storage, keeping a buffer within your included SharePoint storage capacity, and considering expected monthly SharePoint growth if this information is available.
After that, the recommended operating model is:
You should avoid starting all sites at once. A controlled rollout helps reduce operational risk, keeps SharePoint load manageable, and gives your IT team time to validate the results before scaling further.
Before creating policies, define the SharePoint storage level you want to reach and maintain.
For example, assume your tenant currently uses 80 TB of SharePoint storage. Your included SharePoint storage capacity is 30 TB, so you are currently paying for 50 TB of additional SharePoint storage.
The first goal is to archive approximately 50 TB of cold data to remove the need for paid additional storage. However, stopping at exactly 30 TB would leave no room for future SharePoint growth. Therefore, we recommend keeping a buffer within the included SharePoint storage capacity.
If your included capacity is 30 TB and you want to keep 10% free, you should reserve 3 TB as buffer. This means your target SharePoint usage is approximately 27 TB.
In this example, the initial archive target is:
80 TB current SharePoint usage – 27 TB target SharePoint usage = 53 TB archive target.
This gives you two targets:
If you know your average monthly SharePoint growth, include it in your operating model. For example, if SharePoint grows by approximately 1 TB per month, your recurring policies should aim to archive at least 1 TB of newly cold data over time. This helps maintain the buffer instead of only reaching it once.
Once this target is clear, you can use the rollout process below to identify which sites and policies should contribute to that target.
Before creating policies in ShArc, start with a site inventory export from the SharePoint Admin Center.
The export should include:
This export becomes the basis for your rollout planning. It helps you identify the largest sites, sensitive areas, and sites that can follow the same archiving rule.
After exporting the site list, prepare a rollout matrix. This can be done in Excel or another planning tool.
The rollout matrix helps you decide which sites should be grouped together and which sites should be treated separately.
Recommended columns:
| Column | Purpose |
|---|---|
| Site URL | SharePoint site to be included in ShArc |
| Storage used | Helps prioritize large sites |
| Department / business unit | Helps identify ownership and risk |
| Proposed archiving rule | Example: older than 1 year, 2 years, 3 years |
| Estimated archive volume | Result from ShArc Estimation |
| Proposed policy batch | Example: Batch 01, Batch 02 |
| Risk level | Low, medium, high |
| Needs simulation | Yes / No |
| Excluded / review required | For sensitive or special sites |
| Notes | Additional comments |
This planning step is especially useful for large tenants because it prevents accidental overloading of a policy with too many large or sensitive sites.
| Policy group | Example rule | Typical use case |
|---|---|---|
| Conservative archive | Not modified since 5–8 years, file size >10 MB | Very large or sensitive sites |
| Standard archive | Not modified since 3 years, file size >5 MB | Regular collaboration sites |
| Aggressive archive | Not modified since 1–2 years, file size >1 MB | Project sites or completed workspaces |
| Excluded / review required | No automatic archiving | HR, Legal, Finance, retention-heavy areas |
For many customers, the best grouping is not only based on age. Department, risk, site type, and business ownership can be equally important.
For example, HR or Legal sites may require a different rollout model than Marketing or project collaboration sites.
ShArc Estimation helps you understand the expected archive potential before creating final policy batches.
Important: Estimation is currently performed from the Offload site and supports one URL at a time. Estimation is not performed directly from the Policy page and does not estimate multiple URLs as one combined group.
For large tenants, we recommend using Estimation as a planning tool:
You usually do not need to estimate every single site individually. For very large tenants, a practical approach is:
| Site group | Recommended approach |
|---|---|
| Largest 20–100 sites | Estimate individually |
| Sites between 100 GB and 1 TB | Estimate individually if they are part of the first rollout waves |
| Sites below 100 GB | Estimate representative examples from each archiving cohort |
| Sensitive sites | Review separately before including them in a policy |
The goal is not to estimate all sites one by one. The goal is to estimate enough representative and high-impact sites to build safe rollout batches.
Very large sites can create high savings, but they can also create more SharePoint load and longer runtimes.
For sites significantly larger than 1 TB, we recommend a conservative first test. For example:
After validating the result, you can gradually expand the scope, for example:
This approach helps you validate the behavior before applying a broader archiving rule.
Once you have estimation results for individual URLs, use your rollout matrix to build policy batches.
A ShArc policy can contain multiple SharePoint URLs. Therefore, you do not need one policy per site.
However, the expected volume of a policy batch should currently be calculated outside of ShArc, for example in Excel. Add the estimated archive volume per URL and group URLs with the same archiving logic into one policy batch.
A practical starting point is to build batches with approximately 5–10 TB of estimated offload volume. However, the exact size depends on:
Policy batches should be built based on operational risk, not only based on the number of sites.
Example:
| Policy batch | Rule | Input | Estimated offload volume |
|---|---|---|---|
| Batch 01 – Standard archive | Not modified since 3 years, file size >5 MB | Sum of individually estimated URLs | 7.5 TB |
| Batch 02 – Conservative archive | Not modified since 5 years, file size >10 MB | Sum of individually estimated URLs | 6.2 TB |
| Batch 03 – Project sites | Not modified since 2 years, file size >1 MB | Based on representative estimations and site inventory | 8.8 TB |
| Batch 04 – Large sites | Individual validation required | Individually estimated large sites | 4.5 TB |
ShArc Simulation provides more detailed results than Estimation, including reporting down to the library level.
However, Simulation can take longer depending on the size of the selected sites, the number of files, and the configured filters.
For large tenants, Simulation should be used selectively rather than for every single site.
We recommend using Simulation for:
You do not need to simulate every small or low-risk site if the archiving logic has already been validated.
After planning, estimation, and selective simulation, start with the first policy batch.
We recommend beginning with a controlled batch instead of starting with the full tenant.
A good first batch should be:
After the first offload run, validate the outcome before continuing.
Recommended validation points:
Once the first batch has been validated, continue with the next batch.
The rollout should continue in waves:
This controlled rollout model helps avoid unnecessary risk and allows your team to improve the configuration as you learn more about the tenant.
For large tenants, use a clear naming convention for policies.
Recommended format:
Batch number – Archiving logic – Scope
Examples:
This makes it easier to understand what each policy does and where it belongs in the rollout.
For large SharePoint tenants, avoid the following approaches:
Instead, use a planning-first approach based on site inventory, individual URL estimations, expected archive volume, risk, and rollout sequence.
ShArc can support large SharePoint environments with several thousand sites. However, large-scale archiving should be rolled out in a structured way.
For large tenants, we recommend:
The goal is not only to archive data once. The better operating model is to reduce SharePoint usage to a defined target level and then keep enough free included capacity for future growth.
If you are working with a large SharePoint tenant and need help creating the right policy structure, we recommend a short technical planning session.
In this session, we can help you:
Contact us if you would like support with planning your ShArc policy rollout.