George Crump


Upcoming Events

Cloud Connect
Santa Clara
Feb 13-16, 2012

Cloud Connect brings together the entire cloud eco-system to better understand the transformation we're experiencing and promises to be the defining event of the cloud computing industry. Learn about the latest cloud technologies and platforms from thought leaders in Cloud Connect’s comprehensive conference.

Register Now!

More Events »

Subscribe to Newsletter

  • Keep up with all of the latest news and analysis on the fast-moving IT industry with Network Computing newsletters.
Sign Up

Deduplicating Elsewhere

Deduplication technology discussions usually center on deduplicating the backup target. That makes sense, as this is where the biggest payoff is for the technology. Increasingly, the discussion is moving more to using deduplication as a part of archive disk or primary storage. Deduplication, however, is also branching out beyond standard disk, and there are areas to consider whether applying the technology is worth the investment.

Solid State Disk (SSD) could hold the most promise. It's more expensive than its mechanical drive counterparts, but it is also substantially faster. Clearly, adding a deduplication/compression capability to an SSD system will impact its performance. However, many -- maybe even most --  environments don't need the full performance boost that an SSD can provide. If an SSD system takes even a 25 percent performance hit, it would still be substantially faster than many mechanical systems, and if in doing so you doubled the capacity of the SSD, you effectively cut the cost of the technology in half. It's not an isolated pocket of data centers that this rational applies to. Many need a performance bump beyond what their mechanical drives can deliver but don't need the full performance boost of standard SSD. For these environments, an SSD with deduplication and compression may be the perfect solution.

Tape as a deduplication target may seem a little odd. In essence, it is the opposite of SSD. We move from very high performance, limited capacity but expensive media to medium performance, high capacity and inexpensive media. The justification is simply the sheer quantity of that media. In environments where the number of tapes to manage is in the thousands, reducing that by a factor of 10 or 20 percent could represent a substantial cost savings. This would not be a savings only in the cost of the actual media, but in the cost of storing that media off-site, as well as the reduced cost in returning those tapes to the data center when needed. The concept of deduplicating to tape makes many admins nervous, and I think the jury is still out on whether on not this makes sense. You need to really weigh the potential cost savings vs. any potential risk associated with using the technology on tape.

Cloud storage is another area where deduplication will gain traction. Technically, cloud storage is still storage, but as we discuss in our article Cloud Storage Deduplication, it's storage with a bandwidth cost associated with it, and it is storage that is often billed at per GB used. The more you save with both, the better off you are. The storage savings can be relatively simple and can be done entirely at the destination side, but if that technology can be moved to the source data center and kept in deduplicated format, the value increases.

Many cloud services are using some form of a on-premise cache as a gateway to the cloud. Gaining better storage efficiency with the on-premise gateway allows for more data to be kept in the local cache. If that data can be kept in its compressed and deduplicated form, it is going to use significantly less bandwidth. This is important because some vendors charge for bandwidth used as well as storage used. In either case moving less data across a slow segment is always a good thing. Deduplication technology will continue to be used in more than just the traditional storage tiers that we think of today. In any situation where you have a relatively high cost of capacity or a relatively low availability of that capacity, there may be some justification for the implementation of the technology.

Related Reading


More deduplication Insights



Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

Network Computing encourages readers to engage in spirited, healthy debate, including taking us to task. However, Network Computing moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing/SPAM. Network Computing further reserves the right to disable the profile of any commenter participating in said activities.

 
Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.
 

Data Deduplication Reports

Research and Reports

Hypervisor Derby
August 2011

Network Computing: August 2011

TechWeb Careers