George Crump


Upcoming Events

Cloud Connect
Santa Clara
Feb 13-16, 2012

Cloud Connect brings together the entire cloud eco-system to better understand the transformation we're experiencing and promises to be the defining event of the cloud computing industry. Learn about the latest cloud technologies and platforms from thought leaders in Cloud Connect’s comprehensive conference.

Register Now!

More Events »

Subscribe to Newsletter

  • Keep up with all of the latest news and analysis on the fast-moving IT industry with Network Computing newsletters.
Sign Up

Deduplication's Replication Mode

According to every deduplication supplier that I talk to, replication has a high attach rate for deduplication products. In most cases over 50 percent of their systems are sold with the replication module or capabilities enabled. Over the next couple of entries I'll review some of the specific vendor's claims and name names as it relates to replication. If your in the dedupe space and I have not spoke to you, please reach out to me so I can include you in the conversation.

While moving backup jobs to a remote site electronically is a key capability for deduplication products, it should not be your sole method of DR. It's important to keep in mind that the data in the remote site is in a backup format and needs to be recovered to DR servers to be of value. The time it takes to move this data from the disk deduplication device to the production server will still take time. That time may push you outside of your recovery service level agreement. For many data centers, having a data set that goes off-site in an inexpensive fashion, a few hours after local backup is complete may be all they can afford and may still represent a huge improvement in recoverability.

There is one exception to the recovery first problem: server virtualization. Since some of the appliance based devices present themselves as disk targets via CIFS or NFS, you could mount server images via NFS at the DR site and be back in production. None of the appliance based systems bill themselves as primary storage, so the intent would be to use a capability like VMware Storage VMotion to move those images quickly to production storage. This concept is worth an article all by itself and something I will dive into later.

While some of the deduplication vendors that I spoke with are relatively new to providing replication capabilities to their solutions, all of them seem to have something. Some of the deduplication providers are delivering replication via a basic file system replication technique. Basically they are leveraging the fact that deduplication only writes unique blocks and they are using file system replication to identify those writes and then replicate them across the wire. While this certainly works from a "point A to point B" perspective, it does cause some problems when you are trying to do a many to one or cascaded type of replication.

Also how the vendor does deduplication, the old inline vs. post processing debate, will affect how the replication mode works. Most vendors will agree that both methods have their strong points and weak points. It's how they take advantage of the strengths and design around the weaknesses that matters. For example, when it comes to replication, an inline system or even an adaptive inline system should be able to replicate data either as data is written to the device or as the specific backup stream to end and provide a file closure. In typical post process data deduplication, the entire backup has to complete before deduplication occurs. Replication then occurs as unique blocks are identified and written to disk.


Page:  1 | 2 |Next Page »

Related Reading


More storage-networking-management Insights



Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

Network Computing encourages readers to engage in spirited, healthy debate, including taking us to task. However, Network Computing moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing/SPAM. Network Computing further reserves the right to disable the profile of any commenter participating in said activities.

 
Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.
 

Research and Reports

Hypervisor Derby
August 2011

Network Computing: August 2011

TechWeb Careers