Title :
300 bps noise robust vocoder
Author :
Obranovich, Charles R. ; Golusky, John M. ; Preuss, Robert D. ; Fabbri, Darren R. ; Cruthirds, Daniel R. ; Aylward, Erin M. ; Freebersyser, James A. ; Kolek, Stephen R.
Author_Institution :
Raytheon BBN Technol., Cambridge, MA, USA
fDate :
Oct. 31 2010-Nov. 3 2010
Abstract :
Within DARPA´s Advanced Speech Encoding (ASE) program [1], BBN developed a series of noise-robust vocoder (NRV) algorithms and had them tested at an independent evaluation facility. With transmitted data rates as low as 300 bps, these NRV algorithms yield superior speech intelligibility, as compared to the 2400 bps enhanced Mixed Excitation Linear Prediction (MELPe) vocoder, in extremely harsh noise environments. NRV algorithms achieve their superior performance using an advanced wideband spectrum analysis procedure, known as spectral hypothesis testing, that compares each noisy multi-frame block of microphone output signal against hierarchically-structured speech and noise spectral trajectory codebooks. While the benefits for NRV speech coding are dramatic, spectral hypothesis testing places significant demand on encoder memory bandwidth. This created a challenge for real-time NRV operation. In 2009, BBN addressed this challenge by creating a hardware prototype with a simple coprocessor design. To assist the DSP, an FPGA supports the high-bandwidth memory access and modest number of operations needed for 300 bps real-time operation.
Keywords :
coprocessors; digital signal processing chips; field programmable gate arrays; linear predictive coding; spectral analysis; speech coding; speech intelligibility; vocoders; ASE program; DSP; FPGA; MELPe vocoder; NRV algorithm; advanced speech encoding; advanced wideband spectrum analysis; coprocessor design; encoder memory bandwidth; hierarchically-structured speech; high-bandwidth memory access; microphone output signal; mixed excitation linear prediction vocoder; noise robust vocoder; noise spectral trajectory codebook; spectral hypothesis testing; speech intelligibility; Noise; Noise robustness; Real time systems; Speech; Testing; Trajectory; Vocoders; 300 bps; ASE; MELPe; NRV; noise robust; speech coding; vocoder;
Conference_Titel :
MILITARY COMMUNICATIONS CONFERENCE, 2010 - MILCOM 2010
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-4244-8178-1
DOI :
10.1109/MILCOM.2010.5680311