Deduplication in Windows Server: Data deduplication involves finding and removing duplication within data without compromising its fidelity or integrity. The goal is to store more data in less space by segmenting files into small variable-sized chunks (32–128 KB), identifying duplicate chunks, and maintaining a single copy of each chunk. Redundant copies of the chunk are replaced by a reference to the single copy. The chunks are compressed and then organized into special container files in the System Volume Information folder.
Enhanced Dedupe features in Windows Server 2012 R2
- Data deduplication for remote storage of Virtual Desktop Infrastructure (VDI) workloads
- Expand an optimized file on its original path.
When using the Data Deduplication feature for the first time or migrating from a previous version of Windows Server, be sure to consider the following related technologies and issues:
- Failover Clusters
- DFS Replication
- FSRM quotas
- Single Instance Storage or NAS Box
Install and Configure Data Deduplication using GUI
1. Open Server Manager, From the Add Roles and Features Wizard, under Server Roles, select File and Storage Services.
2. Select the File Services check box, and then select the Data Deduplication check box.
3. Click Next until the Install button is active, and then click Install.
4. From the Server Manager dashboard, right-click a data volume and choose Configure Data Deduplication. The Deduplication Settings page appears.
5. In the Data deduplication box, select the workload you want to host on the volume. Select General purpose file server for general data files or Virtual Desktop Infrastructure (VDI) server when configuring storage for running virtual machines.
6. Enter the number of days that should elapse from the date of file creation until files are deduplicated, enter the extensions of any file types that should not be deduplicated, and then click Add to browse to any folders with files that should not be deduplicated.
7. Click Apply to apply these settings and return to the Server Manager dashboard, or click the Set Deduplication Schedule button to continue to set up a schedule for deduplication.
Install and Configure Data Deduplication using Windows PowerShell
Start Windows PowerShell. Right-click the Windows PowerShell icon on the taskbar, and then click Run as Administrator.
Import-Module ServerManager | Add-WindowsFeature -name FS-Data-Deduplication
Enable-DedupVolume E: -UsageType HyperV
Enable-DedupVolume E: -UsageType Default
Set-Dedupvolume E: -MinimumFileAgeDays 20
Get-DedupVolume | fl
Start-DedupJob E: –Type Optimization –Wait