Special Coverage Series

Network Computing

Special Coverage Series

Commentary

Amy Arnold
Amy Arnold Network Computing Blogger

Mastering The Art Of Troubleshooting

From social media to blogs, there's a ton of information engineers can tap to solve technical problems. Here's what I find works best.

In IT, a lack of resources isn't usually the problem when troubleshooting. Instead, the challenge is figuring out an efficient way to comb through a wealth of information--including blogs and social media--to find the one obscure bit that resolves your problem.

Mastering the many sources of information available to you as an engineer will drastically decrease your troubleshooting session times and boost your well-earned network ninja reputation.

More Insights

Webcasts

More >>

White Papers

More >>

Reports

More >>

My first line of defense when troubleshooting is reading the manual or release notes. It kills me the number of times engineers chose to forgo this simple exercise, and the results are often spectacular, but not in a good way. I still have scars from some of my reckless attempts to just wing it--a painful education, indeed.

Oftentimes, however, IT professionals are handed a giant stack of network gear and a deadline decreed by the Mayor of Crazy Town. Reading all 1,100-plus pages of the latest deployment guide just isn’t going to happen. This is where I find that blogs, social media and vendor technical assistance work a special kind of magic.

Technical blogs are manna from heaven for so many of the setup and troubleshooting tasks we come across in the daily grind. Very likely, no matter whatever issue you are facing, some other engineer has faced it, too, and has blogged about. This is one of the primary reasons I blog, and if you are an engineer, you should, too. Knowing which blogs to reference is a balance of strong search engine skills and excellent contacts in my field. Many times these blogs include a blog roll, which lead to more great resources.

[Read about two key steps that can help make the difference between a short, unhappy stint and a long, successful career in networking in "Advice To New Network Engineers."]

Social media plays a huge roll in my troubleshooting process. Being extremely active with other network engineers on Twitter means I have a constant stream of relevant content and resources at my fingertips. I get answers to head-scratching questions in minutes, not hours or days. I often make note of bugs and incidents that other engineers discuss or complain about, which can serve as a fabulous source of forewarning on upcoming projects.

A word of caution if you use social media as a troubleshooting resource: These platforms are not designed for lazy types. While convention doesn’t mind if you throw out a couple of easy-to-find-on-Google questions every so often, don’t be that guy who clearly hasn’t taken the time to do any personal research before crowdsourcing a question. Nobody likes that guy.

One resource underutilized by a number of engineers is vendor technical assistance, commonly referred to as TAC. I’m not sure if it is pride that keeps engineers away or bad past experiences with certain vendors, but TAC is one resource that has been bought and paid for, so you might as well use it. Many times, a quick call with TAC can get you that bit of configuration info your feature requires but that the documentation glosses over or leaves out. Other times, vendor technicians can quickly spot the typos that you can’t see because you’ve been staring at the same lines of configuration for days.

Yes, you take a chance that you might draw the short straw and get a crummy engineer, but in my experience you are just as likely to get a solution to your problem from someone who knows the tricks the documentation doesn’t reveal. Some advice, though: Never be afraid to use terms like escalate, re-queue and your-manager-please when not getting ideal results with a TAC.

Even as you master the many sources of information available for troubleshooting, never forget to build your foundational knowledge. Knowing what tool to utilize and when depends very much on your discernment and skill as an engineer, which comes from understanding the concepts at work. There’s still no shortcut for that, and I highly doubt one is coming anytime soon.

[Find out how to take a structured approach to resolving network problems in "11 Things You Can Do When You Get Back to the Office to Improve Network Performance" at Interop New York Sept. 30-Oct. 4.]



Related Reading



Network Computing encourages readers to engage in spirited, healthy debate, including taking us to task. However, Network Computing moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing/SPAM. Network Computing further reserves the right to disable the profile of any commenter participating in said activities.

 
Disqus Tips To upload an avatar photo, first complete your Disqus profile. | Please read our commenting policy.
 

Editor's Choice

Research: 2014 State of Server Technology

Research: 2014 State of Server Technology

Buying power and influence are rapidly shifting to service providers. Where does that leave enterprise IT? Not at the cutting edge, thatís for sure: Only 19% are increasing both the number and capability of servers, budgets are level or down for 60% and just 12% are using new micro technology.
Get full survey results now! »

Vendor Turf Wars

Vendor Turf Wars

The enterprise tech market used to be an orderly place, where vendors had clearly defined markets. No more. Driven both by increasing complexity and Wall Street demands for growth, big vendors are duking it out for primacy -- and refusing to work together for IT's benefit. Must we now pick a side, or is neutrality an option?
Get the Digital Issue »

WEBCAST: Software Defined Networking (SDN) First Steps

WEBCAST: Software Defined Networking (SDN) First Steps


Software defined networking encompasses several emerging technologies that bring programmable interfaces to data center networks and promise to make networks more observable and automated, as well as better suited to the specific needs of large virtualized data centers. Attend this webcast to learn the overall concept of SDN and its benefits, describe the different conceptual approaches to SDN, and examine the various technologies, both proprietary and open source, that are emerging.
Register Today »

Related Content

From Our Sponsor

How Data Center Infrastructure Management Software Improves Planning and Cuts Operational Cost

How Data Center Infrastructure Management Software Improves Planning and Cuts Operational Cost

Business executives are challenging their IT staffs to convert data centers from cost centers into producers of business value. Data centers can make a significant impact to the bottom line by enabling the business to respond more quickly to market demands. This paper demonstrates, through a series of examples, how data center infrastructure management software tools can simplify operational processes, cut costs, and speed up information delivery.

Impact of Hot and Cold Aisle Containment on Data Center Temperature and Efficiency

Impact of Hot and Cold Aisle Containment on Data Center Temperature and Efficiency

Both hot-air and cold-air containment can improve the predictability and efficiency of traditional data center cooling systems. While both approaches minimize the mixing of hot and cold air, there are practical differences in implementation and operation that have significant consequences on work environment conditions, PUE, and economizer mode hours. The choice of hot-aisle containment over cold-aisle containment can save 43% in annual cooling system energy cost, corresponding to a 15% reduction in annualized PUE. This paper examines both methodologies and highlights the reasons why hot-aisle containment emerges as the preferred best practice for new data centers.

Monitoring Physical Threats in the Data Center

Monitoring Physical Threats in the Data Center

Traditional methodologies for monitoring the data center environment are no longer sufficient. With technologies such as blade servers driving up cooling demands and regulations such as Sarbanes-Oxley driving up data security requirements, the physical environment in the data center must be watched more closely. While well understood protocols exist for monitoring physical devices such as UPS systems, computer room air conditioners, and fire suppression systems, there is a class of distributed monitoring points that is often ignored. This paper describes this class of threats, suggests approaches to deploying monitoring devices, and provides best practices in leveraging the collected data to reduce downtime.

Cooling Strategies for Ultra-High Density Racks and Blade Servers

Cooling Strategies for Ultra-High Density Racks and Blade Servers

Rack power of 10 kW per rack or more can result from the deployment of high density information technology equipment such as blade servers. This creates difficult cooling challenges in a data center environment where the industry average rack power consumption is under 2 kW. Five strategies for deploying ultra-high power racks are described, covering practical solutions for both new and existing data centers.

Power and Cooling Capacity Management for Data Centers

Power and Cooling Capacity Management for Data Centers

High density IT equipment stresses the power density capability of modern data centers. Installation and unmanaged proliferation of this equipment can lead to unexpected problems with power and cooling infrastructure including overheating, overloads, and loss of redundancy. The ability to measure and predict power and cooling capability at the rack enclosure level is required to ensure predictable performance and optimize use of the physical infrastructure resource. This paper describes the principles for achieving power and cooling capacity management.