Managing deduplication-enabled storage pools

You can create a storage pool for data deduplication or you can update an existing storage pool. If you are implementing server-side data deduplication, Tivoli® Storage Manager provides the option of running duplicate-identification processes automatically or manually.

Before you begin

Before you set up a storage pool:

About this task

You can create a storage pool for data deduplication or update an existing storage pool for data deduplication. You can store client-side deduplicated data and server-side deduplicated data in the same storage pool.

Procedure

To set up a storage pool for data deduplication, complete the following steps:

Results

As data is stored in the pool, the duplicates are identified. When the reclamation threshold for the storage pool is reached, reclamation begins, and the space that is occupied by duplicate data is reclaimed.

In the storage pool definition, you can specify as many as 50 duplicate-identification processes to start automatically. However, the number of duplicate-identification processes must not exceed the number of processor cores available on the Tivoli Storage Manager server. If you do not specify any duplicate-identification processes in the storage pool definition, you must control data deduplication manually. Duplicate identification requires extra disk I/O and processor resources. To mitigate the effects on server workload, you can manually increase or decrease the number of duplicate-identification processes, along with their duration.

Attention: By default, the Tivoli Storage Manager server requires that you back up deduplication-enabled primary storage pools before volumes in the storage pool are reclaimed and before duplicate data is discarded.