Did I Repeat Myself? Did I Repeat Myself?
By blueprints on Mar 14, 2011
There are many aspects to optimizing storage utilization. We usually think in terms of compression: packing the bits into the minimal space. However, have you ever considered how often we save the same data multiple times? Like that amusing picture that everyone in the office saves a personal copy of. It all adds up.
Deduplication – one of those geeky terms that is efficiently self-descriptive – solves the problem by removing duplicated data. Frequent contributor Jeff Wright gives us the lowdown in Sun ZFS Storage Appliance Deduplication Design and Implementation Guidelines. Approaches to deduplication vary in both when and how: the when can be synchronous or asynchronous, the how can be block or file level.
The data deduplication feature provided in the Sun ZFS Storage Appliance is available with Software Release 2010.Q1. This feature is implemented to provide synchronous block-level deduplication and is designed to be applicable to any data stored on the appliance. Jeff's article provides practical application and performance guidelines, along with a list of known issues and limitations.
The Sun ZFS Storage Appliance is one powerful and nifty device. You can tell from the number of interesting articles we are publishing on it that there is a lot under the hood.