• DocumentCode
    1939373
  • Title

    Introducing compact: An oscillator-based approach to toll-quality speech coding at low bit rates

  • Author

    Yen, Anton Y. ; Gorodnitsky, Irina

  • Author_Institution
    SPAWAR Syst. Center Pacific, San Diego, CA, USA
  • fYear
    2010
  • fDate
    Oct. 31 2010-Nov. 3 2010
  • Firstpage
    293
  • Lastpage
    297
  • Abstract
    In this paper, we introduce an improved oscillator model we term the Complete Oscillator Model (COM). A significant advantage of the COM over classical oscillators such as the Self Excited Vocoder is that it is not restricted to modeling only certain larger-scale patterns in the source sequence. Here, we develop a speech coding system based on the proposed COM. In this system, the COM is used in combination with a linear predictor, the Pulsed Autoregressive CompensaTor (PACT), to develop a novel, oscillator-based approach to toll-quality speech coding at low bit rates. Unlike the linear prediction-based models utilized in modern speech coders, oscillators do not depend on an estimate of the residual error to regenerate the signal. As such, the residual is encoded only for select frames, providing a potential improvement in coding efficiency. An implementation of the hybrid COM/PACT system, which we call COMPACT, is described and is shown to provide both perceptual quality and bit rate that are competitive with mature standards such as G.729 and AMR. The given implementation is demonstrated to produce toll-quality speech, as measured by PESQ-MOS, at 9.77 kbps. Future tuning of this implementation is expected to improve performance to where it could exceed the current state of the art.
  • Keywords
    oscillators; speech coding; vocoders; AMR; G.729; bit rate 9.77 kbit/s; complete oscillator model; hybrid COM/PACT system; linear predictor; low bit rates; pulsed autoregressive compensator; residual error; self excited vocoder; toll-quality speech coding; Bit rate; Delay; Mathematical model; Oscillators; Signal to noise ratio; Speech; Speech coding; Audio oscillators; speech codecs; speech coding; speech processing; speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    MILITARY COMMUNICATIONS CONFERENCE, 2010 - MILCOM 2010
  • Conference_Location
    San Jose, CA
  • ISSN
    2155-7578
  • Print_ISBN
    978-1-4244-8178-1
  • Type

    conf

  • DOI
    10.1109/MILCOM.2010.5680310
  • Filename
    5680310