• DocumentCode
    1707003
  • Title

    Low bit rate speech compression using hidden Markov modeling

  • Author

    Bardenhagen, Steven T. ; Brown, Kathy L. ; Braun, Robert D.

  • Author_Institution
    Sanders Associates Inc., Nashua, NH, USA
  • Volume
    1
  • fYear
    1997
  • Firstpage
    507
  • Abstract
    This paper presents a new approach for low bit-rate speech compression that exploits the acoustic structure of the speech signal to yield a reduction in bit-rate with negligible loss in speech quality. This new approach encodes the signal using subword units of decomposition that can be extracted through hidden Markov modeling. The subword decomposition yields a segmentation of the signal into variable length quasi-stationary segments that are encoded using a variable rate multimode compression scheme. In this paper, we show that quality is dependent on the type and structure of the subword unit chosen for decomposition and compression. We show that fenones, which correspond to acoustic subword units, provide better quality than linguistic-based units such as phonemes. The speech model is based on a harmonic representation similar to the sinusoidal transform coder developed by McAulay and Quatieri (1986, 1992) and the multiband excitation model developed by Griffin and Lim (1987, 1988)
  • Keywords
    data compression; hidden Markov models; speech coding; variable length codes; HMM; acoustic subword units; fenones; harmonic representation; hidden Markov model; low bit rate speech compression; multiband excitation model; segmentation coding; sinusoidal transform coder; speech coding; speech model; speech quality; subword decomposition; variable length quasi-stationary segments; variable rate multimode compression scheme; Acoustic applications; Acoustic signal processing; Bit rate; Hidden Markov models; Parametric statistics; Smoothing methods; Speech processing; Speech synthesis; Steady-state; Vocoders;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    MILCOM 97 Proceedings
  • Conference_Location
    Monterey, CA
  • Print_ISBN
    0-7803-4249-6
  • Type

    conf

  • DOI
    10.1109/MILCOM.1997.648774
  • Filename
    648774