George Crump


Upcoming Events

Cloud Connect
Santa Clara
Feb 13-16, 2012

Cloud Connect brings together the entire cloud eco-system to better understand the transformation we're experiencing and promises to be the defining event of the cloud computing industry. Learn about the latest cloud technologies and platforms from thought leaders in Cloud Connect’s comprehensive conference.

Register Now!

More Events »

Subscribe to Newsletter

  • Keep up with all of the latest news and analysis on the fast-moving IT industry with Network Computing newsletters.
Sign Up

How To Avoid #fail In Storage

In the odd world that is Twitter, #fail is a tag you put on your tweet when something goes wrong in your life, at your job or when flying your least favorite airline. What do you do to avoid #fail in your storage infrastructure? The most important thing you can do when dealing with storage failure is to make sure you are prepared for something to go wrong before it ever happens.

You're in IT. It is not a matter of if something will fail it is a matter of when it will fail. The number one thing that you can do to make sure you are prepared for a failure is to know what you have in that infrastructure. Whether you try to fix the problem yourself, or if you bring in an expert, the first thing that people are going to ask for is an inventory of what you have so diagnosis can begin.

An inventory is not the latest copy of your data center diagram that you have spent hours on. While a good start, this really does not give the details that someone is going to need to begin diagnosing the problem. What is needed is a detailed configuration of every HBA, switch port, inter-switch link (ISL), how the storage ports are configured and of course how the storage itself is configured.

It is also best if this information is captured frequently, preferably in real time by some sort of analysis tool (in other words, not in a spreadsheet). Spreadsheets are not IT diagnostic tools. We've seen troubleshooting projects where the inventory spreadsheet was more than six months old and not updated since before the server virtualization project was started. Things had changed. Candidly, if your inventory is more than a few weeks old, especially in a virtualized environment, you probably shouldn't bother having one. A re-inventory is going to have to be performed, so you are better off just budgeting for that every time a problem arises in the environment. The value of real time capture is it can provide clues of what was changing in the environment in the time leading up to the failure event. Those changes can often provide a clue to what went wrong. Often these tools can capture physical errors being logged by the system which again can provide some insight into what went wrong. Most importantly though, real time capture can help you prevent a #fail before it ever happens.

The problem with most infrastructure hardware, storage hardware and their software components is not that they don't provide enough diagnostic information, but that they provide too much, and as a result, the important information is lost in the shuffle. What these tools can do is highlight when a message really needs your attention or when a combination of slightly related messages are indicative of a failure. There is plenty more to do beyond developing an accurate inventory to help get through a storage failure, but knowing what you have is a critical first step.

Related Reading


More backup-recovery Insights



Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

Network Computing encourages readers to engage in spirited, healthy debate, including taking us to task. However, Network Computing moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing/SPAM. Network Computing further reserves the right to disable the profile of any commenter participating in said activities.

 
Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.
 

Research and Reports

Hypervisor Derby
August 2011

Network Computing: August 2011

TechWeb Careers