Upcoming Events

Cloud Connect
Santa Clara
Feb 13-16, 2012

Cloud Connect brings together the entire cloud eco-system to better understand the transformation we're experiencing and promises to be the defining event of the cloud computing industry. Learn about the latest cloud technologies and platforms from thought leaders in Cloud Connect’s comprehensive conference.

Register Now!

More Events »

Subscribe to Newsletter

  • Keep up with all of the latest news and analysis on the fast-moving IT industry with Network Computing newsletters.
Sign Up
Workshop
W O R K S H O P  
Tuning Voice Over the WAN

  January 22, 2001
  By Dave Brown


After a recent demonstration of Voice over IP in our Real World Live Lab at the Networld + Interop show this fall in Atlanta, an attendee approached our test crew. "I'm being bombarded by marketeers who claim that VoIP is the only way to go," he said. "Some have set up demonstrations, but the listening experience isn't good enough to persuade my management to take the plunge. I'm afraid to abandon my good old PBXs and PSTN trunks."



He's not alone. While Cisco AVVID and other vendors are enthusiastically pushing voice over the network, businesses, by and large, are still resisting it. A recent Sage Research survey reports that only 6% of U.S. organizations have implemented VoIP. Another 24% are considering deployment within the next 12 months. But NONE of the rest have plans to even look at VoIP this year, because they view the technology as risky and unproven.

Both vendors and users have valid arguments for their respective positions. Using an enterprise WAN -- either IP, Frame Relay, ATM or DSL -- to bypass the public switched telephone network (PSTN) for voice calls has advantages, especially for international companies, because of the cost savings they can realize on overseas calls. (Toll cost savings may not be as great in the continental U.S. because big PSTN carriers can compete by dropping their prices for bulk-purchased long distance service.) Two other big reasons for gradually introducing packet voice on converged networks are the lower capital costs of equipment and simplified management. LAN PBXs are about 10% to 25% cheaper than a traditional PBX. And with VoIP, companies don't have expensive move-and-change costs when employees relocate offices. An IP phone simply re-registers the user with a gateway from its new location.

Still, companies have concerns about delay and jitter, which are two major characteristics of packet networks that you don't have to deal with on switched circuits. The good news is that if careful analysis and testing is invested throughout your network, you can minimize these effects.

In this workshop, we'll look at the likely sources of delay and jitter, and describe some of the "tuning" choices you can make to improve quality. We'll also look at a recently introduced hardware-based, end-to-end voice quality testing system, which we tested on a set of new IP telephones. A hardware-based approach provides a better, more objective way for you to confirm how well your design and equipment choices will play out on the real network than using human listening panels to evaluate voice quality.

Examine your delay budget

The delay budget encompasses the total time from speaking into an instrument at one end to hearing the reconstructed sound at the other end. Human factor studies show that most listeners can detect end-to-end delays that amount to more than 100 milliseconds (ms). At around 250 ms, the delay budget is annoying; it's hard for each party to murmur in agreement or interrupt the other. Above 450 ms end-to-end delay, only listeners who are desperate to save money will put up with voice over a LAN or WAN.

Jitter induces the stutter, gargling and intermittent breakup that can occur on a bad connection. Called the "cliff effect," the jitter phenomenon is peculiar to digital networks. It is caused by too many dropped, or bunched-up, packets. Audio sent over a network can sound good if packets arrive with roughly the same pacing and spacing they had when they left the voice encoder (vocoder). But if packets arrive erratically, a decoder may have to give up and attempt to resynchronize. The human ear is very sensitive to short breaks like these, which impair the perceived quality of a conversation.

Many voice gateways and IP telephones incorporate jitter buffers to smooth out packet delivery to decoders. They may improve quality to a point, but if a jitter buffer is set for too large a number of milliseconds, unacceptable delay may be added to the overall delay budget.

The trick is establishing a jitter buffer that's big enough to keep voice from sounding bad, ideally without causing delays above that 100 ms touchpoint. You might be able to achieve this end by reducing, for example, the number of hops between routers or switches that can introduce buffering delays in the network, or by reducing the size of queues at network access points.

Table 1: Network Delay Analysis

Simple NetworkComplex Network
Input Buffer24 ms.24 ms.
Compression20 ms.20 ms.
Access QueueN/A24 ms.
Network Latency5 ms.25 ms.
Far End QueueN/A24 ms.
Jitter Buffer40 ms.80 ms.
Decoder4 ms.4 ms.
Totals93 ms.201 ms.

Adapted from "Voice Over Frame Relay", White Paper, www.actnetworks.com/3_6_whitep_voiceo.htm by ACT Networks (now a Clarent Company).


Table 1, is a network delay analysis for two frame relay scenarios-one that shows a tolerable delay budget, and one that approaches the 250 ms annoyance factor. Similar budgets can be developed for IP, ATM, or DSL transports. In Table 1, the Simple Network assumes one switch hop and no queuing at the network access points. The Complex Network assumes the queuing associated with the high traffic of an integrated frame relay access device (FRAD), and five switch hops, as may be encountered in a large public frame relay network. In similar environments, it may be hard to bring the delay budget closer to 100 ms.

Overall delay budgets will vary greatly, depending on the quality of service (QoS) mechanisms that might or might not be available. IP networks have no inherent QoS mechanisms. However, a layer 5 enterprise switch like the VIPswitch 1600, plugged in a 100 Mbps LAN to mediate access to a WAN or the Internet, can provide ersatz QoS by recognizing multimedia packets and automatically placing them at the head of an access queue. By doing so, a device like this can reduce both latency and jitter, while allowing for smaller downstream jitter buffer settings. ATM transports are inherently well suited for packet voice transmissions because they guarantee QoS and can provide smooth streaming if voice traffic priorities are set higher than data priorities.


   Page: 1 | 2 | 3 | Next Page

Research and Reports

Hypervisor Derby
August 2011

Network Computing: August 2011

TechWeb Careers