Special Coverage Series

Network Computing

Special Coverage Series


Virtual SANs: Scale Out's Next Iteration

Shared, block storage SANs have been the exclusive domain of big, centralized storage arrays, but distributed systems are about to crash the party.

Storage system design often seems as if it's stuck in the Big Iron era of mainframes, a consolidated world of enormous, expensive machines managed by a high priesthood of experts. Yet as computing has become virtualized and democratized, with distributed systems knitted into self-service clouds where every developer can create his own dedicated sandbox, mainframes have had to adapt.

Storage systems are following along, as scale-out designs using distributed object and file systems have become a popular technology for providing large, scalable storage pools over an Ethernet backbone. The concept was commercialized over a decade ago by pioneers like EqualLogic, Isilon and Spinnaker, which were acquired, by Dell, EMC and NetApp, respectively.

More Insights

Webcasts

More >>

White Papers

More >>

Reports

More >>

Yet useful as they are, and indeed I have long recommended scale-out systems as alternatives to traditional large, centralized storage frames, as both a more efficient and cost effective means of providing shared storage, they've lacked all-purpose adaptability because they can't provide block-level storage required for databases and DB-backed applications.

That's changing as the concepts underlying distributed, networked file systems collide with the virtual servers and cloud stacks. This fusion could ultimately threaten the hegemony of traditional SANs over mission-critical, transaction-oriented, back-office applications.

One of the first to marry a scale-out design with SAN versatility was Coraid, which emerged from a project in the Linux community to develop a native Ethernet storage protocol, ATA over Ethernet (AoE) that operated at Layer 2, thus eliminating the IP overhead plaguing iSCSI. But unlike FCoE, AoE, which as the name implies encapsulates the SATA command set within Ethernet frames, was made with simplicity in mind. It is much more efficient than FCoE, which shoehorns the Fibre Channel stack into a physical layer it wasn't originally designed for.

[ Join us at Interop Las Vegas for access to 125+ IT sessions and 300+ exhibiting companies. Register today! ]

Coraid, which has been shipping its EtherDrive storage nodes for several years, long struggled to gain much visibility outside niches in academia, government/military and hosting providers, but appears to be catching on as pan-virtualized, cloud infrastructure gradually seeps from early adopters to mainstream enterprise IT. In retrospect, it looks to be a case of a company (and technology) ahead of its time, coupled with plenty of competitor-incited FUD about a scary new storage protocol.

Coraid's scale-out virtual SAN comes in two pieces. The building blocks are its EtherDrive storage nodes. These are typical scale-out appliances featuring 16 to 36 drive bays sporting either mechanical disks or SDDs. They use AoE to present raw storage volumes as LUNs to any system running an AoE stack. (Drivers are available for Linux, Windows, OS X, Solaris, VMware and OpenBSD.) But in Coraid's first instantiation, each host was responsible for attaching to LUNs on different storage bricks; in other words, volumes couldn't natively span nodes. Instead, the host was responsible for setting up RAID stripes across multiple EtherDrives. It was still a nicely distributed system, in that volumes could withstand both multidisk and multinode failures because the storage nodes themselves were already RAID protected, but wasn't a fully virtualized SAN.

Coraid eliminated this shortcoming a couple of years ago with the VSX SAN virtualization appliance, a device that can create so-called macro LUNs that stripe across multiple nodes. It includes features de rigueur for any enterprise SAN, including synchronous mirroring, asynchronous remote replication, cloning, snapshots and thin provisioning. The final piece of the puzzle came when the company introduced management and automation software, EtherCloud, that simplifies the deployment to the point of allowing self-service storage provisioning for virtualized applications and can handle upwards of petabytes of pooled capacity.

Next page: Rethinking SANs

 1 | 2  | Next Page »


Related Reading



Network Computing encourages readers to engage in spirited, healthy debate, including taking us to task. However, Network Computing moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing/SPAM. Network Computing further reserves the right to disable the profile of any commenter participating in said activities.

 
Disqus Tips To upload an avatar photo, first complete your Disqus profile. | Please read our commenting policy.
 

Editor's Choice

RESEARCH: 2013 Backup Technologies Survey

RESEARCH: 2013 Backup Technologies Survey

Think backups are boring? Not so, say more than 500 IT pros. Most, 60%, use two, three or even more different backup applications, and the percentage encrypting all media has jumped 15 points since 2011.
Get full survey results now! »

Digital Issue: The Standardization Debate

Digital Issue: The Standardization Debate

An IT infrastructure constructed from uniform blocks of hardware and software is easier to manage and secure, and new services can be rolled out fast. But giving business units carte blanche can deliver more flexibility, drive innovation and better meets employee needs. Two IT executives square off in this debate, and almost 400 survey respondents weigh in too.
Get the Digital Issue »

WEBCAST: Avoiding Downtime: How Virtualization Can Help In Times of Trouble

WEBCAST: Avoiding Downtime: How Virtualization Can Help In Times of Trouble

Server and storage virtualization can help keep systems alive even in the face of demand spikes, disasters and other troubles. Attend this webcast to learn how virtualization can maximize application availability, create business continuity options for critical apps, and improve disaster recovery.
Register Today »

Related Content

From Our Sponsor

Implementing Energy Efficient Data Centers

Implementing Energy Efficient Data Centers

Electrical power costs over the life of a data center may exceed the initial cost of the IT equipment. As described in this paper, recognizing the appropriate IT design architecture necessary and being able to quantify the potential electrical savings can significantly increase cost savings over time.

Creating Order from Chaos in Data Centers and Server Rooms

Creating Order from Chaos in Data Centers and Server Rooms

IT Professionals who are challenged with managing a chaotic data center - messy racks, sub-standard floor air distribution and cable sprawl - can now leverage innovative methods for dealing with and eliminating the root causes of disorder. This paper outlines the solutions available to help create an organized data center.

High-Efficiency AC Power Distribution for Green Data Centers

High-Efficiency AC Power Distribution for Green Data Centers

In order to create optimal electrical efficiency and simplified data centers, the use of 240 volt power distribution is highly recommended. This paper describes the various configurations for this distribution architecture as well as the quantified benefits. Note: Applicable to North America only.

Energy Efficient Cooling for Data Centers: A Close-Coupled Row Solution

Energy Efficient Cooling for Data Centers: A Close-Coupled Row Solution

The trend of increased heat densities in data centers has held consistent with advances in computing technology. As power density increased, the degree of difficulty in cooling these higher power loads was also increasing. This article discusses the efficiency benefits of row-based cooling compared to two other common cooling architectures.

Data Center Projects: Standardized Process

Data Center Projects: Standardized Process

As the design and deployment of data centers evolve into more complicated projects, the benefits of a standardized and predictable process are compelling. This paper presents an overview of a standardized, step-by-step process methodology that can be adapted and configured to suit individual requirements, thus reducing costs and eliminating waste.