Storing Archival Data - Part Deux

Storing Archival Data - Part Deux: Page 4 of 5

Now that you've decided to build a real archive you need to figure out where, both physically and technically, you're going to keep it. Archives are data Roach Motels -- data goes in but doesn't check out for a long time. Which means it will outlast the 5-7 year useful life of most disk systems. Archive systems need to insure data integrity beyond vendor's end of life declarations.

Howard Marks

May 27, 2009

Data integrity assurance goes hand in hand with retention
enforcement. Retrieving a document from the archive to discover it's corrupted
and the critical paragraph that would prove the company followed all the rules
and the CEO shouldn't be wearing an orange jump suit is now gibberish would be
bad. Data objects should be hashed going into the archival store and the
storage system should check data against these hashes periodically and on
retrieval. If the hashes don't match the
system should retrieve another copy.
Which of course implies the system should store multiple
independent copies, preferably in multiple locations. This can be through data scatter and gather
technology like Cleversafe's or simple replication between multiple
systems. Policies should allow admins to
specify keep x copies in each of y locations.
Archives are data Roach Motels -- data goes in but it doesn't
check out for a long time. While SarbOx
and other general business regulations require 5 or so years of data retention,
HIPPA and OSHA regulations require data be retained for 30 years or more under
some conditions. Since the volume of
data in an archive 20 years from now isn't something you can predict, the system
has to be extremely scalable. Just
supporting 1,000 hard drives in many shelves on a small processor cluster like
most NASes isn't enough. This
scaleabliliy can be provided with removable storage or a RAIN
architecture, where many processing and storage nodes can create a single
storage cloud.

Cisco-led Big Tech Consortium Addresses the AI Skills Gap

Zeus Kerravala, Founder and Principal Analyst with ZK Research

April 09, 2024

By combining resources and expertise, the AI-Enabled ICT Workforce Consortium will offer a blueprint for how industries can adapt to an AI-dominated future.

Which Cybersecurity Practices Matter Most? The Cyber Insurance Industry Offers Data-Driven Insight

Will Teevan, CEO, Recast Software

January 26, 2024

IT leaders should heed the guidance of cybersecurity insurance providers who think businesses should prioritize security education, incident preparedness, regular internal audits, and ongoing vulnerability scanning and patching.

Network Courses and Certifications to Consider for 2024

Mary E. Shacklett, President, Transworld Data

November 23, 2023

There are many network courses that can address your present job's priorities and help you gain the needed skills to keep pace with industry changes and new technologies.

Storing Archival Data - Part Deux: Page 4 of 5

Tags:

Recommended For You

Cisco-led Big Tech Consortium Addresses the AI Skills Gap

Which Cybersecurity Practices Matter Most? The Cyber Insurance Industry Offers Data-Driven Insight

Network Courses and Certifications to Consider for 2024

Search form

Storing Archival Data - Part Deux: Page 4 of 5

Tags:

Recommended For You

Cisco-led Big Tech Consortium Addresses the AI Skills Gap

Which Cybersecurity Practices Matter Most? The Cyber Insurance Industry Offers Data-Driven Insight

Network Courses and Certifications to Consider for 2024