• DocumentCode
    394683
  • Title

    Fine grain scalable perceptual and lossless audio coding based on IntMDCT

  • Author

    Geiger, Ray ; Herre, Jurgen ; Schuller, Gerald ; Sporer, Thomas

  • Author_Institution
    Fraunhofer AEMT, Ilmenau, Germany
  • Volume
    5
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    This papers presents an embedded fine grain scalable perceptual and lossless audio coding scheme. The enabling technology for this combined perceptual and lossless audio coding approach is the integer modified discrete cosine transform (IntMDCT), which is an integer approximation of the MDCT based on the lifting scheme. It maintains the perfect reconstruction property and therefore enables efficient lossless coding in the frequency domain. The close approximation of the MDCT also allows us to build a perceptual coding scheme based on the IntMDCT. In this paper a bitsliced arithmetic coding technique is applied to the IntMDCT values. Together with the encoded shape of the masking threshold a perceptually hierarchical bitstream is obtained, containing several stages of perceptual quality and extending to lossless operation when transmitted completely. A concept of encoding subslices is presented in order to obtain a fine adaptation to the masking threshold especially in the range of perceptually transparent quality.
  • Keywords
    arithmetic codes; audio coding; data compression; discrete cosine transforms; signal reconstruction; transform coding; variable rate codes; IntMDCT; bitsliced arithmetic coding; encoded shape; fine grain scalable audio coding; integer modified discrete cosine transform; lifting scheme; lossless audio coding; masking threshold; perceptual audio coding; perceptually hierarchical bitstream; perceptually transparent quality; perfect reconstruction property; subslice encoding; Arithmetic; Audio coding; Codecs; Discrete cosine transforms; Frequency domain analysis; MPEG 4 Standard; Masking threshold; Noise shaping; Scalability; Shape;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1200002
  • Filename
    1200002