Howard Marks

Network Computing Blogger


Upcoming Events

Where the Cloud Touches Down: Simplifying Data Center Infrastructure Management

Thursday, July 25, 2013
10:00 AM PT/1:00 PM ET

In most data centers, DCIM rests on a shaky foundation of manual record keeping and scattered documentation. OpManager replaces data center documentation with a single repository for data, QRCodes for asset tracking, accurate 3D mapping of asset locations, and a configuration management database (CMDB). In this webcast, sponsored by ManageEngine, you will see how a real-world datacenter mapping stored in racktables gets imported into OpManager, which then provides a 3D visualization of where assets actually are. You'll also see how the QR Code generator helps you make the link between real assets and the monitoring world, and how the layered CMDB provides a single point of view for all your configuration data.

Register Now!

A Network Computing Webinar:
SDN First Steps

Thursday, August 8, 2013
11:00 AM PT / 2:00 PM ET

This webinar will help attendees understand the overall concept of SDN and its benefits, describe the different conceptual approaches to SDN, and examine the various technologies, both proprietary and open source, that are emerging. It will also help users decide whether SDN makes sense in their environment, and outline the first steps IT can take for testing SDN technologies.

Register Now!

More Events »

Subscribe to Newsletter

  • Keep up with all of the latest news and analysis on the fast-moving IT industry with Network Computing newsletters.
Sign Up

See more from this blogger

Server SSD Isn't Necessarily PCIe

As I talk with both users and vendors researching an upcoming report on server-side SSD use, I've discovered that many folks think a server SSD has to be the kind of ultra-high-performance SLC PCIe flash card made by vendors like Fusion-IO, Micron, Virident and LSI. While people are perfectly aware that the majority of SSDs sold are 2 1/2-inch form-factor devices with SAS or SATA and interfaces, they seem to think these devices are suitable only for laptops and enthusiasts' desktops.

Sure, most of the SATA SSDs on the market aren't up to the rigors of data center use, but there are several steps between the kind of low-cost SSDs they sell at Akbar and Jeff's Computer Hut and the PCIe cards the go-fast-any-cost guys produce. Most folks are just terrified that they'll put an SSD in their disk arrays or servers, and at some point they'll exhaust its endurance and lose all their data.

More Insights

Webcasts

More >>

White Papers

More >>

Reports

More >>

I think part of the problem is it's been a long time since we've had devices in the data center that actually wear out. Today's sealed hard drives don't wear out in any predictable way--they just fail at random--and most storage systems treat disk drives as binary devices that are either working properly or not working at all. So when a drive has a head pre-amplifier failure, the RAID controller it's connected to declares the drive bad and stops using it.

SSD write exhaustion isn't an unpredictable failure, but it is a relatively well-understood process whereby the flash in an SSD has been programmed and erased enough times that the error array from that flash begins to exceed the ability of the flash controller to correct the error. This exhaustion doesn't happen to the whole SSD all at once but flash page by flash page; it has less working flash to use for housekeeping wear leveling and the like. Eventually there's no spare flash at all, and the SSD can no longer accept writes.

Most flash controllers keep careful track of how often they've overwritten each page of flash that they manage and can report back to the storage system how much of their write endurance has been consumed via extensions to the smart diagnostic system. If I can know weeks or months before that this flash exhaustion is going to cause me problems, it should be a simple enough matter to replace an SSD when it's reached 80% or 90% of its life.

RAID controllers could send a message that SSD 14 has reached its endurance threshold, identify a new or spare SSD and rebuild the RAID set to the new storage. If the SSD was being used as a read cache, it wouldn't contain unique data and the replacement could be even easier.

Since MLC SSDs (which may have write endurance of only 5,000 program erase cycles) typically cost about a tenth as much as eMLC or SLC SSDs, it may make sense to treat MLC SSDs as disposable devices for the data center. While we might not be able to predict whether an MLC SSD will last 18 months or five years, it would still be cheaper to buy three MLC SSDs and replace them as needed then to buy one substantially more expensive SLC SSD.

Of course, this is just not the way people buy equipment for the data center--we buy gear based on some projection of future peak need, multiplied by whatever factor we feel may be necessary to prevent a run on the storage bank. Personally, I like to multiply by pi. So if a project may need 10 Tbytes of storage, we make sure the budget for that project includes 30 Tbytes of storage just so we don't get caught short at some point in the future.

Sure, introducing the concept of disposable devices to the data centers may make more work for the poor guy left to swap out a dying server SSD. But until we have the miracle nonvolatile memory of the future, it may be a way to get performance without a huge capital expenditure.

Of course, some of you may be concerned about people recovering sensitive corporate data from your discarded SSDs. While that's a real concern, it can be easily addressed by simply running your old worn-out SSDs through a Blendtec blender. If it can reduce an iPad to dust, it can do the same to a Micron P400 RealSSD.


Related Reading


Network Computing encourages readers to engage in spirited, healthy debate, including taking us to task. However, Network Computing moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing/SPAM. Network Computing further reserves the right to disable the profile of any commenter participating in said activities.

 
Disqus Tips To upload an avatar photo, first complete your Disqus profile. | Please read our commenting policy.
 
Vendor Comparisons
Network Computing’s Vendor Comparisons provide extensive details on products and services, including downloadable feature matrices. Our categories include:

Research and Reports

Network Computing: April 2013



TechWeb Careers