• DocumentCode
    1360508
  • Title

    Autoregressive Models of Amplitude Modulations in Audio Compression

  • Author

    Ganapathy, Sriram ; Motlicek, Petr ; Hermansky, Hynek

  • Author_Institution
    Electr. & Comput. Eng. Dept., Johns Hopkins Univ., Baltimore, MD, USA
  • Volume
    18
  • Issue
    6
  • fYear
    2010
  • Firstpage
    1624
  • Lastpage
    1631
  • Abstract
    We present a scalable medium bit-rate wide-band audio coding technique based on frequency-domain linear prediction (FDLP). FDLP is an efficient method for representing the long-term amplitude modulations of speech/audio signals using autoregressive models. For the proposed audio codec, relatively long temporal segments (1000 ms) of the input audio signal are decomposed into a set of critically sampled sub-bands using a quadrature mirror filter (QMF) bank. The technique of FDLP is applied on each sub-band to model the sub-band temporal envelopes. The residual of the linear prediction, which represents the frequency modulations in the sub-band signal, are encoded and transmitted along with the envelope parameters. These steps are reversed at the decoder to reconstruct the signal. The proposed codec utilizes a simple signal independent nonadaptive compression mechanism for a wide class of speech and audio signals. The subjective and objective quality evaluations show that the reconstruction signal quality for the proposed FDLP codec compares well with the state-of-the-art audio codecs in the 32-64 kbps range.
  • Keywords
    amplitude modulation; audio coding; autoregressive processes; channel bank filters; data compression; frequency modulation; frequency-domain analysis; signal reconstruction; amplitude modulations; audio codec; audio compression; autoregressive models; bit rate 32 kbit/s to 64 kbit/s; decoder; frequency modulations; frequency-domain linear prediction; objective quality evaluations; quadrature mirror filter bank; scalable medium bit-rate wide-band audio coding technique; signal independent nonadaptive compression mechanism; signal quality reconstruction; speech-audio signal decomposition; subband signal; subband temporal envelope modelling; subjective quality evaluations; frequency-domain linear prediction (FDLP); modulation spectrum; objective and subjective evaluation of audio quality; speech and audio coding;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2009.2038813
  • Filename
    5356226