SpeechSC and MRCPv2
Posted by Mike DeMaria on January 19, 2007
|
|
Anyone who develops, deploys or uses a voice application knows the benefits of speech processing; the technology enables functions such as sending e-mail or instant messages over the corporate PBX with a cell phone using TTS (text-to-speech) technology. However, setting up these capabilities isn't easy, and a standard method of processing and controlling audio streams across network resources has been conspicuously absent.
The IETF's SpeechSC (Speech Services Control) working group is out to fix that problem with MRCP (Media Resource Control Protocol) version 2. The specification will allow any voice application to control network-based media resources, such as speech synthesizers and recognizers. The working group's ultimate goal is to encourage the development of--and lower the financial bar to--new speech-enabled applications.
Speech-processing vendors such as Nuance Communications and Voxpilot are on board, as is Cisco Systems, and MRCPv2 speech engines are already coming out, despite the standard still being in development.
However, the standard is getting the cold shoulder from some significant players. Microsoft hasn't taken an official position on it, and it's one of the few heavyweights with a stake in the speech-processing market not to commit. Most major PBX vendors haven't committed to the standard either, though we suspect that MRCPv2 will quietly gain support from PBX vendors through their relationships with speech-processing vendors.









Add Your Comment: