• DocumentCode
    835917
  • Title

    Perceptual coding of narrow-band audio signals at low rates

  • Author

    Najaf-Zadeh, Hossein ; Kabal, Peter

  • Author_Institution
    Adv. Audio Syst., Commun. Res. Centre, Ottawa, Ont., Canada
  • Volume
    14
  • Issue
    2
  • fYear
    2006
  • fDate
    3/1/2006 12:00:00 AM
  • Firstpage
    609
  • Lastpage
    622
  • Abstract
    This paper describes a coding paradigm using coding tools based on the characteristics of the human hearing system so as to accommodate a wide range of narrow-band audio inputs without annoying artifacts at low rates (down to 8 kb/s). The narrow-band perceptual audio coder (NPAC) employs a variety of algorithms to account for the perceptually irrelevant parts of the input signal in addition to statistical redundancies. The new algorithms used in the NPAC coder include a perceptual error measure in training the codebooks and selecting the best codewords which takes into account the audible parts of the quantization noise, a perception-based bit-allocation algorithm and a new predictive scheme to vector quantize the scale factors. The NPAC coder delivers acceptable quality without annoying artifacts for most narrow-band audio signals at around 1 bit/sample. Informal subjective tests have shown that the NPAC coder outperforms a commercial low-rate music coder operating at 8 kb/s.
  • Keywords
    audio coding; vector quantisation; codebooks; narrow-band audio signals; narrow-band perceptual audio coder; perception-based bit-allocation algorithm; perceptual error measure; quantization noise; vector quantization; Audio coding; Bandwidth; Bit rate; Humans; IP networks; Narrowband; Satellite broadcasting; Signal processing; Speech; Vector quantization; Adaptive bit allocation; masking model; modified discrete cosine transform filter bank; narrow-band audio coding; perceptual audio coding; perceptual distortion; perceptual vector quantization; predictive vector quantization;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TSA.2005.855827
  • Filename
    1597264