DocumentCode
1939373
Title
Introducing compact: An oscillator-based approach to toll-quality speech coding at low bit rates
Author
Yen, Anton Y. ; Gorodnitsky, Irina
Author_Institution
SPAWAR Syst. Center Pacific, San Diego, CA, USA
fYear
2010
fDate
Oct. 31 2010-Nov. 3 2010
Firstpage
293
Lastpage
297
Abstract
In this paper, we introduce an improved oscillator model we term the Complete Oscillator Model (COM). A significant advantage of the COM over classical oscillators such as the Self Excited Vocoder is that it is not restricted to modeling only certain larger-scale patterns in the source sequence. Here, we develop a speech coding system based on the proposed COM. In this system, the COM is used in combination with a linear predictor, the Pulsed Autoregressive CompensaTor (PACT), to develop a novel, oscillator-based approach to toll-quality speech coding at low bit rates. Unlike the linear prediction-based models utilized in modern speech coders, oscillators do not depend on an estimate of the residual error to regenerate the signal. As such, the residual is encoded only for select frames, providing a potential improvement in coding efficiency. An implementation of the hybrid COM/PACT system, which we call COMPACT, is described and is shown to provide both perceptual quality and bit rate that are competitive with mature standards such as G.729 and AMR. The given implementation is demonstrated to produce toll-quality speech, as measured by PESQ-MOS, at 9.77 kbps. Future tuning of this implementation is expected to improve performance to where it could exceed the current state of the art.
Keywords
oscillators; speech coding; vocoders; AMR; G.729; bit rate 9.77 kbit/s; complete oscillator model; hybrid COM/PACT system; linear predictor; low bit rates; pulsed autoregressive compensator; residual error; self excited vocoder; toll-quality speech coding; Bit rate; Delay; Mathematical model; Oscillators; Signal to noise ratio; Speech; Speech coding; Audio oscillators; speech codecs; speech coding; speech processing; speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
MILITARY COMMUNICATIONS CONFERENCE, 2010 - MILCOM 2010
Conference_Location
San Jose, CA
ISSN
2155-7578
Print_ISBN
978-1-4244-8178-1
Type
conf
DOI
10.1109/MILCOM.2010.5680310
Filename
5680310
Link To Document