A Data De-Duplication Survival Guide: Part 1

A Data De-Duplication Survival Guide: Part 1: Page 3 of 7

In the first installment of this series, we discuss deciding where to de-duplicate data

May 28, 2008

General-purpose data de-duplication systems will typically have (or should have) the ability to do inline data de-duplication, since that is generally the most efficient process. Also, ideally, the data de-duplication system should have variable-length segment identification in order to provide the most aggressive data de-duplication effect. For example, it should be able to pick up and store only the changed segments within a database, as opposed to storing the entire file new on each backup.

Lastly, general-purpose data de-duplication systems that include replication provide the optimal way to replicate backup data to remote sites. By leveraging data de-duplication, the data de-duplication system only needs to replicate the net new segments of data across the network.

The most efficient systems will perform de-duplicated replication, in-line, across multiple sites. So far, Data Domain fits the bill. In addition, in-line de-duplication enables the replication process to begin the moment the system starts receiving data. This is unlike VTL systems that typically use post-process data de-duplication and therefore incur a time delay before the replication process can begin -- thus putting the disaster recovery data at risk.

To Page 3

VTL solutions

Suppliers of VTL solutions, such as FalconStor (which supplies EMC and Sun), NetApp, and Sepaton, typically qualify a range of backup applications, but they are not neutral in terms of data source or target.

Cloud-Native Software: What it Is, How We Got Here, and Why it Matters

Kiran Bhageshpur

April 22, 2024

To realize the benefits of cloud-native software, an application must actually be cloud-native—designed to run in the cloud, interacting with disaggregated cloud services, fully manageable as code—to deliver the expected benefits.

Cisco-led Big Tech Consortium Addresses the AI Skills Gap

Zeus Kerravala, Founder and Principal Analyst with ZK Research

April 09, 2024

By combining resources and expertise, the AI-Enabled ICT Workforce Consortium will offer a blueprint for how industries can adapt to an AI-dominated future.

Which Cybersecurity Practices Matter Most? The Cyber Insurance Industry Offers Data-Driven Insight

Will Teevan, CEO, Recast Software

January 26, 2024

IT leaders should heed the guidance of cybersecurity insurance providers who think businesses should prioritize security education, incident preparedness, regular internal audits, and ongoing vulnerability scanning and patching.

A Data De-Duplication Survival Guide: Part 1: Page 3 of 7

Tags:

Recommended For You

Cloud-Native Software: What it Is, How We Got Here, and Why it Matters

Cisco-led Big Tech Consortium Addresses the AI Skills Gap

Which Cybersecurity Practices Matter Most? The Cyber Insurance Industry Offers Data-Driven Insight

Search form

A Data De-Duplication Survival Guide: Part 1: Page 3 of 7

Tags:

Recommended For You

Cloud-Native Software: What it Is, How We Got Here, and Why it Matters

Cisco-led Big Tech Consortium Addresses the AI Skills Gap

Which Cybersecurity Practices Matter Most? The Cyber Insurance Industry Offers Data-Driven Insight