IBM ProtecTIER - Plotting Strategy for the Data Deduplication Wars

Data deduplication is one of those rare opportunities where the economic and technological benefits are well-recognized so it should come as no surprise that vendors are moving troops into this market as quickly as they can.

David Hill

August 10, 2009

4 Min Read
Network Computing logo

The acquisition battle over Data Domain was a business newsworthy subject for a number of weeks. The culmination, with EMC's successful bid, signifies that while this particular skirmish is over the data deduplication wars are going to heat up even more. In this difficult economic climate, being able to make a powerful economic case for enterprises to actually spend money to do something is challenging, at best. Data deduplication is one of those rare opportunities where the economic and technological benefits are well-recognized so it should come as no surprise that vendors are moving troops into this market as quickly as they can.

Note that EMC's acquisition of Data Domain is by no means the first acquisition of a data deduplication company by an information infrastructure vendor nor is it likely to be the last. Recall that IBM bought Diligent Technologies, one of the leading companies in the data deduplication space, well over a year ago. IBM has announced new capabilities for its TS7650 ProtecTIER gateway and appliance family, which uses data deduplication to support virtual tape library (VTL) technology. The announcement has been planned for some time so that is simply coincidental to EMC's Data Domain acquisition news.

A core use of data deduplication technology has been in conjunction with disk to disk backup using a VTL. Storing multiple full backups on disk is not economically feasible so older copies of backups would have to be kept on tape. Although most recoveries are from data stored recently, there are occasions when older data has to be recovered -- and doing that from tape could be very time consuming. Elimination of redundant data on disk through data deduplication means that older backup data can be stored economically on disk. That can also facilitate the recovery process of older data from disk should that prove to be necessary.

However, that has tended to be at the local level. If data is needed at a remote site for disaster recovery (DR) purposes, the backup data on disk is first written to a tape library at the local site. The backup tapes are "exported" (i.e., physically removed) from the tape library and then physically transported (typically by truck) to the DR site. Transportation of data involves a transportation cost, security issues (such as lost or stolen tapes), and time (say 24 hours when all elements of the transportation process are taken into account).

This transportation process is called vaulting. Rather than physically transporting the tapes, electronic vaulting is the process of sending the data electronically from disk at the local site to disk at the DR site. This speeds up the process and improves both security and reliability. In addition, recoverability planning is a lot easier. The problem is that the high bandwidth to transfer all the data tends to be expensive. Enter data deduplication which requires significantly less bandwidth to transfer all that backup data, and, lo and behold, electronic vaulting is now economically viable as well as managerially attractive.With this latest announcement, IBM joins the crowd providing electronic vaulting capabilities with what it calls the ProtecTIER Native Replication solution. This is a functional enhancement that creates an IP-based connection between ProtecTIER servers/clusters (as obviously ProtecTIER has to be at both the local and the DR site). All new IBM gateways and appliances will have this feature, but existing products will require a software upgrade, as well a second NIC card, to make them "replication ready." Native replication is an optional feature and must be purchased and licensed before use.

IBM offers a lot of capabilities in ProtecTIER around enabling the DR to become a primary site during a disaster so the solution is a lot more than just electronic vaulting. Included are a number of policy-based capabilities, such as managing and monitoring operations during a disaster (i.e, fail-over and fall-back).

In addition, frequent testing of DR plans is now feasible (since it is easier to test with ProtecTIER managed disk at both the local and remote site than test with tape alone). This may not seem like a big deal but it really is. The lack of DR testing due to cost or complexity exposes many an organization to a significant risk when they were under the delusion that they were adequately protected.

Mesabi Musings

An old clich?? holds that a rising tide lifts all boats and the data deduplication boats are all certainly on the rise. That is a good thing for vendors who are starved to have a story that can get budget-conscious customers to loosen their purse strings, but it is also a good thing for IT organizations that are under the gun to do a better job despite tight budgets.All in all, this announcement strengthens IBM's position as one of the leaders in the highly competitive data deduplication space. The company's enhancements to its already well-received ProtecTIER should be well-received by the company's customers and channel partners.

Assisting with operational recovery through the standard use of data deduplication to eliminate data redundancy has long been a strong point for ProtecTIER. But significantly improving enterprise customers' disaster recovery capabilities, as well, should allow ProtecTIER to take on a welcome broader role in data protection.
 

About the Author(s)

SUBSCRIBE TO OUR NEWSLETTER
Stay informed! Sign up to get expert advice and insight delivered direct to your inbox
More Insights